Archived specimens are highly valuable sources of DNA for retrospective genetic/genomic analysis. However, often limited effort has been made to evaluate and optimize extraction methods, which may be crucial for downstream applications. Here, we assessed and optimized the usefulness of abundant archived skeletal material from sharks as a source of DNA for temporal genomic studies. Six different methods for DNA extraction, encompassing two different commercial kits and three different protocols, were applied to material, so-called bio-swarf, from contemporary and archived jaws and vertebrae of tiger sharks (Galeocerdo cuvier). Protocols were compared for DNA yield and quality using a qPCR approach. For jaw swarf, all methods provided relatively high DNA yield and quality, while large differences in yield between protocols were observed for vertebrae. Similar results were obtained from samples of white shark (Carcharodon carcharias). Application of the optimized methods to 38 museum and private angler trophy specimens dating back to 1912 yielded sufficient DNA for downstream genomic analysis for 68% of the samples. No clear relationships between age of samples, DNA quality and quantity were observed, likely reflecting different preparation and storage methods for the trophies. Trial sequencing of DNA capture genomic libraries using 20 000 baits revealed that a significant proportion of captured sequences were derived from tiger sharks. This study demonstrates that archived shark jaws and vertebrae are potential high-yield sources of DNA for genomic-scale analysis. It also highlights that even for similar tissue types, a careful evaluation of extraction protocols can vastly improve DNA yield.
Single nucleotide polymorphisms (SNPs) are replacing microsatellites for population genetic analyses, but it is not apparent how many SNPs are needed or how well SNPs correlate with microsatellites. We used data from the gopher tortoise, Gopherus polyphemus—a species with small populations, to compare SNPs and microsatellites to estimate population genetic parameters. Specifically, we compared one SNP data set (16 tortoises from four populations sequenced at 17 901 SNPs) to two microsatellite data sets, a full data set of 101 tortoises and a partial data set of 16 tortoises previously genotyped at 10 microsatellites. For the full microsatellite data set, observed heterozygosity, expected heterozygosity and FST were correlated between SNPs and microsatellites; however, allelic richness was not. The same was true for the partial microsatellite data set, except that allelic richness, but not observed heterozygosity, was correlated. The number of clusters estimated by structure differed for each data set (SNPs = 2; partial microsatellite = 3; full microsatellite = 4). Principle component analyses (PCA) showed four clusters for all data sets. More than 800 SNPs were needed to correlate with allelic richness, observed heterozygosity and expected heterozygosity, but only 100 were needed for FST. The number of SNPs typically obtained from next-generation sequencing (NGS) far exceeds the number needed to correlate with microsatellite parameter estimates. Our study illustrates that diversity, FST and PCA results from microsatellites can mirror those obtained with SNPs. These results may be generally applicable to small populations, a defining feature of endangered and threatened species, because theory predicts that genetic drift will tend to outweigh selection in small populations.
Growing evidence supports the idea that species can diverge in the presence of gene flow. However, most methods of phylogeny estimation do not consider this process, despite the fact that ignoring gene flow is known to bias phylogenetic inference. Furthermore, studies that do consider divergence-with-gene-flow typically do so by estimating rates of gene flow using a isolation-with-migration model (IM), rather than evaluating scenarios of gene flow (such as divergence-with-gene flow or secondary contact) that represent very different types of diversification. In this investigation, we aim to infer the recent phylogenetic history of a clade of western long-eared bats while evaluating a number of different models that parameterize gene flow in a variety of ways. We utilize PHRAPL, a new tool for phylogeographic model selection, to compare the fit of a broad set of demographic models that include divergence, migration, or both among Myotis evotis, $$M$$. thysanodes and M. keenii. A genomic data set consisting of 808 loci of ultraconserved elements was used to explore such models in three steps using an incremental design where each successive set was informed by, and thus more focused than, the previous set of models. Specifically, the three steps were to (i) assess whether gene flow should be modeled and identify the best topologies, (ii) infer directionality of migration using the best topologies, and (iii) estimate the timing of gene flow. The best model (AIC model weight $${sim}0.98$$) included two divergence events (($$M$$. evotis, $$M$$. thysanodes), M. keenii) accompanied by gene flow at the initial stages of divergence. These results provide a striking example of speciation-with-gene-flow in an evolutionary lineage.
Recent genomic studies of both ancient and modern indigenous people of the Americas have shed light on the demographic processes involved during the first peopling. The Pacific Northwest Coast proves an intriguing focus for these studies because of its association with coastal migration models and genetic ancestral patterns that are difficult to reconcile with modern DNA alone. Here, we report the low-coverage genome sequence of an ancient individual known as “Shuká Káa” (“Man Ahead of Us”) recovered from the On Your Knees Cave (OYKC) in southeastern Alaska (archaeological site 49-PET-408). The human remains date to ∼10,300 calendar (cal) y B.P. We also analyze low-coverage genomes of three more recent individuals from the nearby coast of British Columbia dating from ∼6,075 to 1,750 cal y B.P. From the resulting time series of genetic data, we show that the Pacific Northwest Coast exhibits genetic continuity for at least the past 10,300 cal y B.P. We also infer that population structure existed in the late Pleistocene of North America with Shuká Káa on a different ancestral line compared with other North American individuals from the late Pleistocene or early Holocene (i.e., Anzick-1 and Kennewick Man). Despite regional shifts in mtDNA haplogroups, we conclude from individuals sampled through time that people of the northern Northwest Coast belong to an early genetic lineage that may stem from a late Pleistocene coastal migration into the Americas.
Genetic diversity within and among populations lies at the heart of evolution. Unraveling the extent to which each intrinsic or extrinsic factor determines levels of diversity among genes, populations, and species is challenging, given the difficulty of isolating any single potentially important variable from all others. Allopolyploid species provide an opportunity to disentangle external and intrinsic factors, as the two (or more) homoeologous genomes co-occur in the same nucleus, often exhibiting high collinearity along homoeologous chromosomes. Here we evaluate the pace of molecular evolution and intraspecific, intragenomic diversity in two species of allopolyploid Gossypium, G. hirsutum and G. barbadense, using several hundred genes sequenced from multiple accessions of each species. Genic diversity in both species is low, having been influenced both by the polyploid bottleneck and a domestication bottleneck (for cultivated accessions), but with a directional bias in homoeolog diversity favoring the same genome in both allopolyploids. Total diversity is remarkably similar for the two homoeologous genomes overall, but the two copies of many gene pairs have accumulated statistically different diversity levels, and in a biased fashion with respect to genome. Domesticated accessions show reduced diversity in both genomes, as expected, but with a much greater reduction in one of the two homoeologous genomes. Furthermore, this biased reduction affects opposite homoeologous genomes in the two species. Interspecific introgression has played a role in shaping diversity within each species. Introgression was only detected for certain accessions, and only from G. barbadense into G. hirsutum in one of the two co-resident genomes.
Palaeognaths, the sister group of all other living birds (neognaths), were once considered to be vicariant relics from the breakup of the Gondwanan supercontinent. However, recent molecular studies instead argue for dispersal of volant ancestors across marine barriers. Resolving this debate hinges upon accurately reconstructing their evolutionary relationships and dating their divergences, which often relies on phylogenetic information from extinct relatives and nuclear genomes. Mitogenomes from the extinct elephant birds of Madagascar have helped inform the palaeognath phylogeny; however, nuclear information has remained unavailable. Here, we use ancient DNA (aDNA) extracted from fossil eggshell, together with target enrichment and next-generation sequencing techniques, to reconstruct an additional new mitogenome from Aepyornis sp. with 33.5X coverage. We also recover the first elephant bird nuclear aDNA, represented by 12,500 bp of exonic information. While we confirm that elephant birds are sister taxa to the kiwi, our data suggests that, like neognaths, palaeognaths underwent an explosive radiation between 69 and 52 Ma—well after the break-up of Gondwana, and more rapidly than previously estimated from mitochondrial data alone. These results further support the idea that ratites primarily diversified immediately following the Cretaceous-Palaeogene mass extinction and convergently evolved flightlessness. Our study reinforces the importance of including information from the nuclear genome of extinct taxa for recovering deep evolutionary relationships. Furthermore, with approximately 3% endogenous aDNA retrieved, avian eggshell can be a valuable substrate for recovering high quality aDNA. We suggest that elephant bird whole genome recovery is ultimately achievable, and will provide future insights into the evolution these birds.
The Mediterranean Basin harbors a remarkable amount of biodiversity, a high proportion of which is endemic to this region. Here, we present an in-depth study of an angiosperm species complex, in which cryptic taxonomic diversity has been hypothesized. Specifically, we focus on four currently recognized species in the Roucela complex, a well-supported clade in the Campanulaceae/Campanuloideae: Campanula creutzburgii, C. drabifolia, C. erinus, and C. simulans. This study takes a phylogenomic approach, utilizing near-complete plastomes and 130 nuclear loci, to uncover cryptic diversity and test hypotheses regarding hybridization and polyploidy within this clade. Genome size estimates recovered tetraploid and octoploid lineages within the currently recognized, widespread species C. erinus, showing an east-west geographic pattern. Though genomic data clearly differentiate these two cytotypes, we failed to discern morphological differences. The formation of a cryptic octoploid lineage, distributed across the eastern Mediterranean, is hypothesized to be the result of an allopolyploid event in which one parental morphology is retained. The tetraploid C. erinus and C. creutzburgii (also a tetraploid) are implicated as parental lineages. Our results highlight the utility of target-enrichment approaches for obtaining genomic datasets for thorough assessments of species diversity and the importance of carefully considering gene-tree discordance within such datasets.
The phylogeny of the Phasianidae (pheasants, partridges, and allies) has been studied extensively. However, these studies have largely ignored three enigmatic genera because of scarce DNA source material and limited overlapping phylogenetic data: blood pheasants (Ithaginis), snow partridges (Lerwa), and long-billed partridges (Rhizothera). Thus, phylogenetic positions of these three genera remain uncertain in what is otherwise a well-resolved phylogeny. Previous studies using different data types place Lerwa and Ithaginis in similar positions, but the absence of overlapping data means the relationship between them could not be inferred. Rhizothera was originally described in the genus Perdix (true partridges), although a partial cytochrome b (CYB) sequence suggests it is sister to Pucrasia (koklass pheasant). To identify robust relationships among Ithaginis, Lerwa, Rhizothera, and their phasianid relatives, we used 3692 ultra-conserved element (UCE) loci and complete mitogenomes from 19 species including previously hypothesized relatives of the three focal genera and representatives from all major phasianid clades. We used DNA extracted from historical specimen toepads for species that lacked fresh tissue in museum collections. Maximum likelihood and multispecies coalescent UCE analyses strongly supported Lerwa sister to a large clade which included Ithaginis at its base, and also including turkey, grouse, typical pheasants, tragopans, Pucrasia, and Perdix. Rhizothera was also in this clade, sister to a diverse group comprising Perdix, typical pheasants, Pucrasia, turkey and grouse. Mitogenomic genealogies differed from UCEs topologies, supporting a sister relationship between Ithaginis and Lerwa rather than a grade. The position of Rhizothera using mitogenomes depended on analytical choices. Unpartitioned and codon-based analyses placed Rhizothera sister to a tragopan clade, whereas a partitioned DNA model of the mitogenome was congruent with UCE results. In all mitogenome analyses, Pucrasia was sister to a clade including Perdix and the typical pheasants with high support, in contrast to UCEs and published nuclear intron data. Due to the strong support and consistent topology provided by all UCE analyses, we have identified phylogenetic relationships of these three enigmatic, poorly-studied, phasianid taxa.
The detection and typing of Vibrio cholerae in natural aquatic environments encounter major methodological challenges related to the fact that the bacterium is often present in environmental matrices at very low abundance in nonculturable state. This study applied, for the first time to our knowledge, a whole-genome enrichment (WGE) and next-generation sequencing (NGS) approach for direct genotyping and metagenomic analysis of low abundant V. cholerae DNA (<50 genome unit/L) from natural water collected in the Morogoro river (Tanzania). The protocol is based on the use of biotinylated RNA baits for target enrichment of V. cholerae metagenomic DNA via hybridization. An enriched V. cholerae metagenome library was generated and sequenced on an Illumina MiSeq platform. Up to 1.8 × 107 bp (4.5× mean read depth) were found to map against V. cholerae reference genome sequences representing an increase of about 2500 times in target DNA coverage compared to theoretical calculations of performance for shotgun metagenomics. Analysis of metagenomic data revealed the presence of several V. cholerae virulence and virulence associated genes in river water including major virulence regions (e.g. CTX prophage and Vibrio pathogenicity island-1) and genetic markers of epidemic strains (e.g. O1-antigen biosynthesis gene cluster) that were not detectable by standard culture and molecular techniques. Overall, besides providing a powerful tool for direct genotyping of V. cholerae in complex environmental matrices, this study provides a ‘proof of concept’ on the methodological gap that might currently preclude a more comprehensive understanding of toxigenic V. cholerae emergence from natural aquatic environments.
Ann Arbor, MI 48103
(d/b/a Daicel Arbor Biosciences)
All Rights Reserved.