With the increasing availability of high-throughput sequencing, phylogenetic analyses are no longer constrained by the limited availability of a few loci. Here, we describe a sequence capture methodology, which we used to collect data for analyses of diversification within Sabal (Arecaceae), a palm genus native to the south-eastern USA, Caribbean, Bermuda and Central America. RNA probes were developed and used to enrich DNA samples for putatively low copy nuclear genes and the plastomes for all Sabal species and two outgroup species. Sequence data were generated on an Illumina MiSeq sequencer and target sequences were assembled using custom workflows. Both coalescence and supermatrix analyses of 133 nuclear genes were used to estimate species trees relationships. Plastid genomes were also analysed, yielding generally poor resolution with regard to species relationships. Species relationships described in both nuclear gene and plastome sequences largely reflect the biogeography of the group and, to a lesser extent, previous morphology-based hypotheses. Beyond the biological implications, this research validates a high-throughput methodology for generating a large number of genes for coalescence-based phylogenetic analyses in plant lineages.

In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations.

Museums hold most of the world’s most valuable biological specimens and tissues collected, including type material that is often decades or even centuries old. Unfortunately, traditional museum collection and storage methods were not designed to preserve the nucleic acids held within the material, often reducing its potential viability and value for many genetic applications. High-throughput sequencing technologies and associated applications offer new opportunities for obtaining sequence data from museum samples. In particular, target sequence capture offers a promising approach for recovering large numbers of orthologous loci from relatively small amounts of starting material. In the present study, we test the utility of target sequence capture for obtaining data from museum-held material from a speciose mammalian genus: the horseshoe bats (Rhinolophidae: Chiroptera). We designed a ‘bait’ for capturing > 3600 genes and applied this to 10 species of horseshoe bat that had been collected between 93 and 7 years ago and preserved using a range of methods. We found that the mean recovery rate per species was approximately 89% of target genes with partial sequence coverage, ranging from 3024 to 3186 genes recovered. On average, we recovered 1206 genes with ≥ 90% sequence coverage, per species. Our findings provide good support for the application of large-scale bait capture across congeneric species spanning approximately 15 Myr of evolution. On the other hand, we observed no clear association between the success of capture and the phylogenetic distance from the bait model, although sample sizes precluded a formal test.

The complete mitochondrial genome of the extinct musk ox Bootherium bombifrons is presented for the first time. Phylogenetic analysis supports placement of Bootherium as sister to the living musk ox, Ovibos moschatus, in agreement with morphological taxonomy. SNPs identified in the COI-5p region provide a tool for the identification of Bootherium among material, which is not morphologically diagnosable, for example postcrania, coprolites, and archaeological specimens, and/or lacks precise stratigraphic control, like many from glacial alluvium and in placer mines.

The genus Cucurbita (squashes, pumpkins, gourds) contains numerous domesticated lineages with ancient New World origins. It was broadly distributed in the past but has declined to the point that several of the crops’ progenitor species are scarce or unknown in the wild. We hypothesize that Holocene ecological shifts and megafaunal extinctions severely impacted wild Cucurbita, whereas their domestic counterparts adapted to changing conditions via symbiosis with human cultivators. First, we used high-throughput sequencing to analyze complete plastid genomes of 91 total Cucurbita samples, comprising ancient (n = 19), modern wild (n = 30), and modern domestic (n = 42) taxa. This analysis demonstrates independent domestication in eastern North America, evidence of a previously unknown pathway to domestication in northeastern Mexico, and broad archaeological distributions of taxa currently unknown in the wild. Further, sequence similarity between distant wild populations suggests recent fragmentation. Collectively, these results point to wild-type declines coinciding with widespread domestication. Second, we hypothesize that the disappearance of large herbivores struck a critical ecological blow against wild Cucurbita, and we take initial steps to consider this hypothesis through cross-mammal analyses of bitter taste receptor gene repertoires. Directly, megafauna consumed Cucurbita fruits and dispersed their seeds; wild Cucurbita were likely left without mutualistic dispersal partners in the Holocene because they are unpalatable to smaller surviving mammals with more bitter taste receptor genes. Indirectly, megafauna maintained mosaic-like landscapes ideal for Cucurbita, and vegetative changes following the megafaunal extinctions likely crowded out their disturbed-ground niche. Thus, anthropogenic landscapes provided favorable growth habitats and willing dispersal partners in the wake of ecological upheaval.

The spread of farming out of the Balkans and into the rest of Europe followed two distinct routes: An initial expansion represented by the Impressa and Cardial traditions, which followed the Northern Mediterranean coastline; and another expansion represented by the LBK (Linearbandkeramik) tradition, which followed the Danube River into Central Europe. Although genomic data now exist from samples representing the second migration, such data have yet to be successfully generated from the initial Mediterranean migration. To address this, we generated the complete genome of a 7,400-year-old Cardial individual (CB13) from Cova Bonica in Vallirana (Barcelona), as well as partial nuclear data from five others excavated from different sites in Spain and Portugal. CB13 clusters with all previously sequenced early European farmers and modern-day Sardinians. Furthermore, our analyses suggest that both Cardial and LBK peoples derived from a common ancient population located in or around the Balkan Peninsula. The Iberian Cardial genome also carries a discernible hunter–gatherer genetic signature that likely was not acquired by admixture with local Iberian foragers. Our results indicate that retrieving ancient genomes from similarly warm Mediterranean environments such as the Near East is technically feasible.

The formation of allopolyploid cotton precipitated a rapid diversification and colonization of dry coastal American tropical and subtropical regions. Previous phylogenetic analyses, combined with molecular divergence analyses, have offered a temporal framework for this radiation, but provide only weak support for some of the resolved branches. Moreover, these earlier analyses did not include the recently recognized sixth polyploid species, G. ekmanianum Wittmack. Here we use targeted sequence capture of multiple loci in conjunction with both concatenated and Bayesian concordance analyses to reevaluate the phylogeny of allopolyploid cotton species. Although phylogenetic resolution afforded by individual genes is often low, sufficient signal was attained both through the concatenated and concordance analyses to provide robust support for the Gossypium polyploid clade, which is reported here.

Based on mitochondrial DNA (mtDNA), it has been estimated that at least 15 founder haplogroups peopled the Americas. Subhaplogroup C1d3 was defined based on the mitogenome of a living individual from Uruguay that carried a lineage previously identified in hypervariable region I sequences from ancient and modern Uruguayan individuals. When complete mitogenomes were studied, additional substitutions were found in the coding region of the mitochondrial genome. Using a complete ancient mitogenome and three modern mitogenomes, we aim to clarify the ancestral state of subhaplogroup C1d3 and to better understand the peopling of the region of the Río de la Plata basin, as well as of the builders of the mounds from which the ancient individuals were recovered. The ancient mitogenome, belonging to a female dated to 1,610±46 years before present, was identical to the mitogenome of one of the modern individuals. All individuals share the mutations defining subhaplogroup C1d3. We estimated an age of 8,974 (5,748–12,261) years for the most recent common ancestor of C1d3, in agreement with the initial peopling of the geographic region. No individuals belonging to the defined lineage were found outside of Uruguay, which raises questions regarding the mobility of the prehistoric inhabitants of the country. Moreover, the present study shows the continuity of Native lineages over at least 6,000 years.

Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.

An incomplete carcass of an extinct bison, Bison ex gr. priscus, was discovered in 2012 in the mouth of the Rauchua River (69°30′N, 166°49′E), Chukotka. The carcass included the rump with two hind limbs, ribs, and large flap of hide from the abdomen and sides, several vertebrae, bones of the forelimbs and anterior autopodia, stomach with its contents, and wool. The limb bones are relatively gracile, which is unusual in bison, and a SEM study of the hair microstructure suggests higher insulating capacity than in extant members of the genus. Additionally, mitochondrial DNA analyses indicate that the Rauchua bison belonged to a distinct and previously unidentified lineage of steppe bison. Two radiocarbon dates suggest a Holocene age for the bison: a traditional 14C date provided an estimate of 8030±70 14C yr ВР (SPb-743) and an AMS radiocarbon date provided an age of 9497±92 14C yr BP (AA101271). These dates make this the youngest known bison from Chukotka. Analysis of stomach contents revealed a diet of herbaceous plants (meadow grasses and sedges) and shrubs, suggesting that the early Holocene vegetation near the mouth of the Rauchua River was similar to that of the present day: tundra-associated vegetation with undersized plants.