Host–pathogen interactions may result in either directional selection or in pressure for the maintenance of polymorphism at the molecular level. Hence signatures of both positive and balancing selection are expected in immune genes. Because both overall selective pressure and specific targets may differ between species, large-scale population genomic studies are useful in detecting functionally important immune genes and comparing selective landscapes between taxa. Such studies are of particular interest in amphibians, a group threatened worldwide by emerging infectious diseases. Here, we present an analysis of polymorphism and divergence of 634 immune genes in two lineages of Lissotriton newts: L. montandoni and L. vulgaris graecus. Variation in newt immune genes has been shaped predominantly by widespread purifying selection and strong evolutionary constraint, implying long-term importance of these genes for functioning of the immune system. The two evolutionary lineages differ in the overall strength of purifying selection which can partially be explained by demographic history but may also signal differences in long-term pathogen pressure. The prevalent constraint notwithstanding, 23 putative targets of positive selection and 11 putative targets of balancing selection were identified. The latter were detected by composite tests involving the demographic model and further validated in independent population samples. Putative targets of balancing selection encode proteins which may interact closely with pathogens but include also regulators of immune response. The identified candidates will be useful for testing whether genes affected by balancing selection are more prone to interspecific introgression than other genes in the genome.

Herbaria are unparalleled collections of biodiversity information representing the world’s flora. However, this treasure has remained largely inaccessible to genetic studies, frequently limited by the low yields of poor-quality DNA. Next-generation sequencing (NGS) has transformed every field of biological research. The different strategies for accessing genetic data using NGS are changing the direction of biodiversity research—we are no longer constrained by a relatively small number of markers for non-model organisms, by time and cost limited sample sizes, or by incomplete datasets due to recalcitrant DNA extractions or PCR amplification failure. Here we show that targeted enrichment through hybrid capture can be used to generate hundreds of kilobases of nuclear sequence data of the Neotropical genus Inga, from herbarium specimens as old as 180 years and using as little as 16 ng of degraded DNA.

Phylogenetic analysis of Plasmodium parasites has indicated that their modern-day distribution is a result of a series of human-mediated dispersals involving transport between Africa, Europe, America, and Asia. A major outstanding question is the phylogenetic affinity of the malaria causing parasites Plasmodium vivax and falciparum in historic southern Europe—where it was endemic until the mid-20th century, after which it was eradicated across the region. Resolving the identity of these parasites will be critical for answering several hypotheses on the malaria dispersal. Recently, a set of slides with blood stains of malaria-affected people from the Ebro Delta (Spain), dated between 1942 and 1944, have been found in a local medical collection. We extracted DNA from three slides, two of them stained with Giemsa (on which Plasmodium parasites could still be seen under the microscope) and another one consisting of dried blood spots. We generated the data using Illumina sequencing after using several strategies aimed at increasing the Plasmodium DNA yield: depletion of the human genomic (g)DNA content through hybridization with human gDNA baits, and capture-enrichment using gDNA derived from P. falciparum. Plasmodium mitochondrial genome sequences were subsequently reconstructed from the resulting data. Phylogenetic analysis of the eradicated European P. vivax mtDNA genome indicates that the European isolate is closely related to the most common present-day American haplotype and likely entered the American continent post-Columbian contact. Furthermore, the European P. falciparum mtDNA indicates a link with current Indian strains that is in agreement with historical accounts.

Genomic studies are revealing that divergence and speciation are marked by gene flow, but it is not clear whether gene flow has played a prominent role during the generation of biodiversity in species-rich regions of the world where vicariance is assumed to be the principal mode by which new species form. We revisit a well-studied organismal system in the Mexican Highlands, Aphelocoma jays, to test for gene flow among Mexican sierras. Prior results from mitochondrial DNA (mtDNA) largely conformed to the standard model of allopatric divergence, although there was also evidence for more obscure histories of gene flow in a small sample of nuclear markers. We tested for these ‘hidden histories’ using genomic markers known as ultraconserved elements (UCEs) in concert with phylogenies, clustering algorithms and newer introgression tests specifically designed to detect ancient gene flow (e.g. ABBA/BABA tests). Results based on 4303 UCE loci and 2500 informative SNPs are consistent with varying degrees of gene flow among highland areas. In some cases, gene flow has been extensive and recent (although perhaps not ongoing today), whereas in other cases there is only a trace signature of ancient gene flow among species that diverged as long as 5 million years ago. These results show how a species complex thought to be a model for vicariance can reveal a more reticulate history when a broader portion of the genome is queried. As more organisms are studied with genomic data, we predict that speciation-with-bouts-of-gene-flow will turn out to be a common mode of speciation.

Custom sequence capture experiments are becoming an efficient approach for gathering large sets of orthologous markers in nonmodel organisms. Transcriptome-based exon capture utilizes transcript sequences to design capture probes, typically using a reference genome to identify intron–exon boundaries to exclude shorter exons (<200 bp). Here, we test directly using transcript sequences for probe design, which are often composed of multiple exons of varying lengths. Using 1260 orthologous transcripts, we conducted sequence captures across multiple phylogenetic scales for frogs, including outgroups ~100 Myr divergent from the ingroup. We recovered a large phylogenomic data set consisting of sequence alignments for 1047 of the 1260 transcriptome-based loci (~561 000 bp) and a large quantity of highly variable regions flanking the exons in transcripts (~70 000 bp), the latter improving substantially by only including ingroup species (~797 000 bp). We recovered both shorter (<100 bp) and longer exons (>200 bp), with no major reduction in coverage towards the ends of exons. We observed significant differences in the performance of blocking oligos for target enrichment and nontarget depletion during captures, and differences in PCR duplication rates resulting from the number of individuals pooled for capture reactions. We explicitly tested the effects of phylogenetic distance on capture sensitivity, specificity, and missing data, and provide a baseline estimate of expectations for these metrics based on a priori knowledge of nuclear pairwise differences among samples. We provide recommendations for transcriptome-based exon capture design based on our results, cost estimates and offer multiple pipelines for data assembly and analysis.

The New Zealand acanthisittid wrens are the sister-taxon to all other “perching birds” (Passeriformes) and – including recently extinct species – represent the most diverse endemic passerine family in New Zealand. Consequently, they are important for understanding both the early evolution of Passeriformes and the New Zealand biota. However, five of the seven species have become extinct since the arrival of humans in New Zealand, complicating evolutionary analyses. The results of morphological analyses have been largely equivocal, and no comprehensive genetic analysis of Acanthisittidae has been undertaken. We present novel mitochondrial genome sequences from four acanthisittid species (three extinct, one extant), allowing us to resolve the phylogeny and revise the taxonomy of acanthisittids. Reanalysis of morphological data in light of our genetic results confirms a close relationship between the extant rifleman (Acanthisitta chloris) and an extinct Miocene wren (Kuiornis indicator), making Kuiornis a useful calibration point for molecular dating of passerines. Our molecular dating analyses reveal that the stout-legged wrens (Pachyplichas) diverged relatively recently from a more gracile (Xenicus-like) ancestor. Further, our results suggest a possible Early Oligocene origin of the basal Lyall’s wren (Traversia) lineage, which would imply that Acanthisittidae survived the Oligocene marine inundation of New Zealand and therefore that the inundation was not complete.

Evolutionary biologists from Darwin forward have dreamed of having data that would elucidate our understanding of evolutionary history and the diversity of life. Sequence capture is a relatively old DNA technology, but its use is growing rapidly due to advances in (i) massively parallel DNA sequencing approaches and instruments, (ii) massively parallel bait construction, (iii) methods to identify target regions and (iv) sample preparation. We give a little historical context to these developments, summarize some of the important advances reported in this special issue and point to further advances that can be made to help fulfill Darwin’s dream.

Gathering genomic-scale data efficiently is challenging for nonmodel species with large, complex genomes. Transcriptome sequencing is accessible for organisms with large genomes, and sequence capture probes can be designed from such mRNA sequences to enrich and sequence exonic regions. Maximizing enrichment efficiency is important to reduce sequencing costs, but relatively few data exist for exon capture experiments in nonmodel organisms with large genomes. Here, we conducted a replicated factorial experiment to explore the effects of several modifications to standard protocols that might increase sequence capture efficiency for amphibians and other taxa with large, complex genomes. Increasing the amounts of c0t-1 repetitive sequence blocker and individual input DNA used in target enrichment reactions reduced the rates of PCR duplication. This reduction led to an increase in the percentage of unique reads mapping to target sequences, essentially doubling overall efficiency of the target capture from 10.4% to nearly 19.9% and rendering target capture experiments more efficient and affordable. Our results indicate that target capture protocols can be modified to efficiently screen vertebrates with large genomes, including amphibians.

Teasing apart neutral and adaptive genomic processes and identifying loci that are targets of selection can be difficult, particularly for nonmodel species that lack a reference genome. However, identifying such loci and the factors driving selection have the potential to greatly assist conservation and restoration practices, especially for the management of species in the face of contemporary and future climate change. Here, we focus on assessing adaptive genomic variation within a nonmodel plant species, the narrow-leaf hopbush (Dodonaea viscosa ssp. angustissima), commonly used for restoration in Australia. We used a hybrid-capture target enrichment approach to selectively sequence 970 genes across 17 populations along a latitudinal gradient from 30°S to 36°S. We analysed 8462 single-nucleotide polymorphisms (SNPs) for FST outliers as well as associations with environmental variables. Using three different methods, we found 55 SNPs with significant correlations to temperature and water availability, and 38 SNPs to elevation. Genes containing SNPs identified as under environmental selection were diverse, including aquaporin and abscisic acid genes, as well as genes with ontologies relating to responses to environmental stressors such as water deprivation and salt stress. Redundancy analysis demonstrated that only a small proportion of the total genetic variance was explained by environmental variables. We demonstrate that selection has led to clines in allele frequencies in a number of functional genes, including those linked to leaf shape and stomatal variation, which have been previously observed to vary along the sampled environmental cline. Using our approach, gene regions subject to environmental selection can be readily identified for nonmodel organisms.

Sample availability limits population genetics research on many species, especially taxa from regions with high diversity. However, many such species are well represented in museum collections assembled before the molecular era. Development of techniques to recover genetic data from these invaluable specimens will benefit biodiversity science. Using a mixture of freshly preserved and historical tissue samples, and a sequence capture probe set targeting >5000 loci, we produced high-confidence genotype calls on thousands of single nucleotide polymorphisms (SNPs) in each of five South-East Asian bird species and their close relatives (N = 27–43). On average, 66.2% of the reads mapped to the pseudo-reference genome of each species. Of these mapped reads, an average of 52.7% was identified as PCR or optical duplicates. We achieved deeper effective sequencing for historical samples (122.7×) compared to modern samples (23.5×). The number of nucleotide sites with at least 8× sequencing depth was high, with averages ranging from 0.89 × 106 bp (Arachnothera, modern samples) to 1.98 × 106 bp (Stachyris, modern samples). Linear regression revealed that the amount of sequence data obtained from each historical sample (represented by per cent of the pseudo-reference genome recovered with ≥8× sequencing depth) was positively and significantly (P ≤ 0.013) related to how recently the sample was collected. We observed characteristic post-mortem damage in the DNA of historical samples. However, we were able to reduce the error rate significantly by truncating ends of reads during read mapping (local alignment) and conducting stringent SNP and genotype filtering.