The Cyclophyllidea is the most diverse order of tapeworms, encompassing species that infect all classes of terrestrial tetrapods including humans and domesticated animals. Available phylogenetic reconstructions based either on morphology or molecular data lack the resolution to allow scientists to either propose a solid taxonomy or infer evolutionary associations. Molecular markers available for the Cyclophyllidea mostly include ribosomal DNA and mitochondrial loci. In this study, we identified 3641 single-copy nuclear coding loci by comparing the genomes of Hymenolepis microstoma, Echinococcus granulosus and Taenia solium. We designed RNA baits based on the sequence of H. microstoma, and applied target enrichment and Illumina sequencing to test the utility of those baits to recover loci useful for phylogenetic analyses. We captured DNA from five species of tapeworms representing two families of cyclophyllideans. We obtained an average of 3284 (90%) of the targets from the test samples and then used captured sequences (2 181 361 bp in total; fragment size ranging from 301 to 6969 bp) to reconstruct a phylogeny for the five test species plus the three species for which genomic data are available. The results were consistent with the current consensus regarding cyclophyllidean relationships. To assess the potential for our method to yield informative genetic variation at intraspecific scales, we extracted 14 074 single nucleotide polymorphisms (SNPs) from alignments of four Arostrilepis macrocirrosa and two A. cooki and successfully inferred their relationships. The results showed that our target gene tools yield data sets that provide robust inferences at a range of taxonomic scales in the Cyclophyllidea.
Recent studies have advocated biomonitoring using DNA techniques. In this study, two high-throughput sequencing (HTS)-based methods were evaluated: amplicon metabarcoding of the cytochrome C oxidase subunit I (COI) mitochondrial gene and gene enrichment using MYbaits (targeting nine different genes including COI). The gene-enrichment method does not require PCR amplification and thus avoids biases associated with universal primers. Macroinvertebrate samples were collected from 12 New Zealand rivers. Macroinvertebrates were morphologically identified and enumerated, and their biomass determined. DNA was extracted from all macroinvertebrate samples and HTS undertaken using the illumina miseq platform. Macroinvertebrate communities were characterized from sequence data using either six genes (three of the original nine were not used) or just the COI gene in isolation. The gene-enrichment method (all genes) detected the highest number of taxa and obtained the strongest Spearman rank correlations between the number of sequence reads, abundance and biomass in 67% of the samples. Median detection rates across rare (<1% of the total abundance or biomass), moderately abundant (1–5%) and highly abundant (>5%) taxa were highest using the gene-enrichment method (all genes). Our data indicated primer biases occurred during amplicon metabarcoding with greater than 80% of sequence reads originating from one taxon in several samples. The accuracy and sensitivity of both HTS methods would be improved with more comprehensive reference sequence databases. The data from this study illustrate the challenges of using PCR amplification-based methods for biomonitoring and highlight the potential benefits of using approaches, such as gene enrichment, which circumvent the need for an initial PCR step.
Songbirds originated in Australia and have now diversified into approximately 5,000 species found across the world. Here, Moyle et al. combine phylogenomic and biogeographic analyses to show that songbird diversification was associated with the formation of island…
Obtaining sequence data from historical museum specimens has been a growing research interest, invigorated by next-generation sequencing methods that allow inputs of highly degraded DNA. We applied a target enrichment and next-generation sequencing protocol to generate ultraconserved elements (UCEs) from 51 large carpenter bee specimens (genus Xylocopa), representing 25 species with specimen ages ranging from 2–121 years. We measured the correlation between specimen age and DNA yield (pre- and post-library preparation DNA concentration) and several UCE sequence capture statistics (raw read count, UCE reads on target, UCE mean contig length and UCE locus count) with linear regression models. We performed piecewise regression to test for specific breakpoints in the relationship of specimen age and DNA yield and sequence capture variables. Additionally, we compared UCE data from newer and older specimens of the same species and reconstructed their phylogeny in order to confirm the validity of our data. We recovered 6–972 UCE loci from samples with pre-library DNA concentrations ranging from 0.06–9.8 ng/μL. All investigated DNA yield and sequence capture variables were significantly but only moderately negatively correlated with specimen age. Specimens of age 20 years or less had significantly higher pre- and post-library concentrations, UCE contig lengths, and locus counts compared to specimens older than 20 years. We found breakpoints in our data indicating a decrease of the initial detrimental effect of specimen age on pre- and post-library DNA concentration and UCE contig length starting around 21–39 years after preservation. Our phylogenetic results confirmed the integrity of our data, giving preliminary insights into relationships within Xylocopa. We consider the effect of additional factors not measured in this study on our age-related sequence capture results, such as DNA fragmentation and preservation method, and discuss the promise of the UCE approach for large-scale projects in insect phylogenomics using museum specimens.
Article
In ancient DNA (aDNA) research, evolutionary and archaeological questions are often investigated using the genomic sequences of organelles: mitochondrial and chloroplast DNA. Organellar genomes are found in multiple copies per living cell, increasing their chance of recovery from archaeological samples, and are inherited from one parent without genetic recombination, simplifying analyses. While mitochondrial genomes have played a key role in many mammalian aDNA projects, including research focused on prehistoric humans and extinct hominins, it is unclear how useful plant chloroplast genomes (plastomes) may be at elucidating questions related to plant evolution, crop domestication, and the prehistoric movement of botanical products through trade and migration. Such analyses are particularly challenging for plant species whose genomes have highly repetitive sequences and that undergo frequent genomic reorganization, notably species with high retrotransposon activity. To address this question, we explored the research potential of the grape (Vitis vinifera L.) plastome using targeted-enrichment methods and high-throughput DNA sequencing on a collection of archaeological grape pip and vine specimens from sites across Eurasia dating ca. 4000 BCE–1500 CE. We demonstrate that due to unprecedented numbers of sequence insertions into the nuclear and mitochondrial genomes, the grape plastome provides limited intraspecific phylogenetic resolution. Nonetheless, we were able to assign archaeological specimens in the Italian peninsula, Sardinia, UK, and Armenia from pre-Roman to medieval times as belonging to all three major chlorotypes A, C, and D found in modern varieties of Western Europe. Analysis of nuclear genomic DNA from these samples reveals a much greater potential for understanding ancient viticulture, including domestication events, genetic introgression from local wild populations, and the origins and histories of varietal lineages.
X-chromosome inactivation (XCI) involves major reorganization of the X chromosome as it becomes silent and heterochromatic. During female mammalian development, XCI is triggered by upregulation of the non-coding Xist RNA from one of the two X chromosomes. Xist coats the chromosome in cis and induces silencing of almost all genes via its A-repeat region, although some genes (constitutive escapees) avoid silencing in most cell types, and others (facultative escapees) escape XCI only in specific contexts. A role for Xist in organizing the inactive X (Xi) chromosome has been proposed. Recent chromosome conformation capture approaches have revealed global loss of local structure on the Xi chromosome and formation of large mega-domains, separated by a region containing the DXZ4 macrosatellite. However, the molecular architecture of the Xi chromosome, in both the silent and expressed regions, remains unclear. Here we investigate the structure, chromatin accessibility and expression status of the mouse Xi chromosome in highly polymorphic clonal neural progenitors (NPCs) and embryonic stem cells. We demonstrate a crucial role for Xist and the DXZ4-containing boundary in shaping Xi chromosome structure using allele-specific genome-wide chromosome conformation capture (Hi-C) analysis, an assay for transposase-accessible chromatin with high throughput sequencing (ATAC–seq) and RNA sequencing. Deletion of the boundary disrupts mega-domain formation, and induction of Xist RNA initiates formation of the boundary and the loss of DNA accessibility. We also show that in NPCs, the Xi chromosome lacks active/inactive compartments and topologically associating domains (TADs), except around genes that escape XCI. Escapee gene clusters display TAD-like structures and retain DNA accessibility at promoter-proximal and CTCF-binding sites. Furthermore, altered patterns of facultative escape genes in different neural progenitor clones are associated with the presence of different TAD-like structures after XCI. These findings suggest a key role for transcription and CTCF in the formation of TADs in the context of the Xi chromosome in neural progenitors.
The Ice Free Corridor has been invoked as a route for Pleistocene human and animal dispersals between eastern Beringia and more southerly areas of North America. Despite the significance of the corridor, there are limited data for when and how this corridor was used. Hypothetical uses of the corridor include: the first expansion of humans from Beringia into the Americas, northward postglacial expansions of fluted point technologies into Beringia, and continued use of the corridor as a contact route between the north and south. Here, we use radiocarbon dates and ancient mitochondrial DNA from late Pleistocene bison fossils to determine the chronology for when the corridor was open and viable for biotic dispersals. The corridor was closed after ∼23,000 until 13,400 calendar years ago (cal y BP), after which we find the first evidence, to our knowledge, that bison used this route to disperse from the south, and by 13,000 y from the north. Our chronology supports a habitable and traversable corridor by at least 13,000 cal y BP, just before the first appearance of Clovis technology in interior North America, and indicates that the corridor would not have been available for significantly earlier southward human dispersal. Following the opening of the corridor, multiple dispersals of human groups between Beringia and interior North America may have continued throughout the latest Pleistocene and early Holocene. Our results highlight the utility of phylogeographic analyses to test hypotheses about paleoecological history and the viability of dispersal routes over time.
Rapid evolutionary radiations are expected to require large amounts of sequence data to resolve. To resolve these types of relationships many systematists believe that it will be necessary to collect data by next-generation sequencing (NGS) and use multispecies coalescent (“species tree”) methods. Ultraconserved element (UCE) sequence capture is becoming a popular method to leverage the high throughput of NGS to address problems in vertebrate phylogenetics. Here we examine the performance of UCE data for gallopheasants (true pheasants and allies), a clade that underwent a rapid radiation 10–15 Ma. Relationships among gallopheasant genera have been difficult to establish. We used this rapid radiation to assess the performance of species tree methods, using ∼600 kilobases of DNA sequence data from ∼1500 UCEs. We also integrated information from traditional markers (nuclear intron data from 15 loci and three mitochondrial gene regions). Species tree methods exhibited troubling behavior. Two methods [Maximum Pseudolikelihood for Estimating Species Trees (MP-EST) and Accurate Species TRee ALgorithm (ASTRAL)] appeared to perform optimally when the set of input gene trees was limited to the most variable UCEs, though ASTRAL appeared to be more robust than MP-EST to input trees generated using less variable UCEs. In contrast, the rooted triplet consensus method implemented in Triplec performed better when the largest set of input gene trees was used. We also found that all three species tree methods exhibited a surprising degree of dependence on the program used to estimate input gene trees, suggesting that the details of likelihood calculations (e.g., numerical optimization) are important for loci with limited phylogenetic information. As an alternative to summary species tree methods we explored the performance of SuperMatrix Rooted Triple – Maximum Likelihood (SMRT-ML), a concatenation method that is consistent even when gene trees exhibit topological differences due to the multispecies coalescent. We found that SMRT-ML performed well for UCE data. Our results suggest that UCE data have excellent prospects for the resolution of difficult evolutionary radiations, though specific attention may need to be given to the details of the methods used to estimate species trees.
Glyptodonts were giant (some of them up to ~2400 kg), heavily armoured relatives of living armadillos, which became extinct during the Late Pleistocene/early Holocene alongside much of the South American megafauna. Although glyptodonts were an important component of Cenozoic South American faunas, their early evolution and phylogenetic affinities within the order Cingulata (armoured New World placental mammals) remain controversial. In this study, we used hybridization enrichment and high-throughput sequencing to obtain a partial mitochondrial genome from Doedicurus sp., the largest (1.5 m tall, and 4 m long) and one of the last surviving glyptodonts. Our molecular phylogenetic analyses revealed that glyptodonts fall within the diversity of living armadillos. Reanalysis of morphological data using a molecular ‘backbone constraint’ revealed several morphological characters that supported a close relationship between glyptodonts and the tiny extant fairy armadillos (Chlamyphorinae). This is surprising as these taxa are among the most derived cingulates: glyptodonts were generally large-bodied and heavily armoured, while the fairy armadillos are tiny (~9–17 cm) and adapted for burrowing. Calibration of our phylogeny with the first appearance of glyptodonts in the Eocene resulted in a more precise timeline for xenarthran evolution. The osteological novelties of glyptodonts and their specialization for grazing appear to have evolved rapidly during the Late Eocene to Early Miocene, coincident with global temperature decreases and a shift from wet closed forest towards drier open woodland and grassland across much of South America. This environmental change may have driven the evolution of glyptodonts, culminating in the bizarre giant forms of the Pleistocene.
Ann Arbor, MI 48103
(d/b/a Daicel Arbor Biosciences)
All Rights Reserved.