The arrival of bison in North America marks one of the most successful large-mammal dispersals from Asia within the last million years, yet the timing and nature of this event remain poorly determined. Here, we used a combined paleontological and paleogenomic approach to provide a robust timeline for the entry and subsequent evolution of bison within North America. We characterized two fossil-rich localities in Canada’s Yukon and identified the oldest well-constrained bison fossil in North America, a 130,000-y-old steppe bison, Bison cf. priscus. We extracted and sequenced mitochondrial genomes from both this bison and from the remains of a recently discovered, ∼120,000-y-old giant long-horned bison, Bison latifrons, from Snowmass, Colorado. We analyzed these and 44 other bison mitogenomes with ages that span the Late Pleistocene, and identified two waves of bison dispersal into North America from Asia, the earliest of which occurred ∼195–135 thousand y ago and preceded the morphological diversification of North American bison, and the second of which occurred during the Late Pleistocene, ∼45–21 thousand y ago. This chronological arc establishes that bison first entered North America during the sea level lowstand accompanying marine isotope stage 6, rejecting earlier records of bison in North America. After their invasion, bison rapidly colonized North America during the last interglaciation, spreading from Alaska through continental North America; they have been continuously resident since then.

Summary Targeted enrichment of conserved genomic regions is a popular method for collecting large amounts of sequence data from non?model taxa for phylogenetic, phylogeographic and population genetic studies. For example, two available bait sets each allow enrichment of thousands of orthologous loci from >20 000 species (Faircloth et al. Systematic Biology, 61, 717?726, 2012; Molecular Ecology Resources, 15, 489?501, 2015). Unfortunately, few open?source workflows are available to identify conserved genomic elements shared among divergent taxa and to design enrichment baits targeting these regions. Those that do exist require extensive bioinformatics expertise and significant amounts of time to use. These shortcomings limit the application of targeted enrichment methods to additional organismal groups. Here, I describe a universal workflow for identifying conserved genomic regions in available genomic data and for designing targeted enrichment baits to collect data from these conserved regions. These methods require less expertise, less time and better use commonly available information to identify conserved loci and design baits to capture them. I apply this computational approach to the understudied arthropod groups Arachnida, Coleoptera, Diptera, Hemiptera or Lepidoptera to identify thousands of conserved loci in each group and design target enrichment baits to capture these loci. I then use in silico analyses to demonstrate that targeted enrichment of the conserved loci can be used to reconstruct the accepted relationships among genome sequences from the focal arthropod orders. The software workflow I created allowed me to identify thousands of conserved loci in five diverse arthropod groups and design sequence capture baits to target them. This suite of capture bait designs should enable collection of phylogenomic data from >900 000 arthropod species. Although the examples in this manuscript focus on understudied arthropod groups, the approach I describe is applicable to all organismal groups having some form of pre?existing genomic information (e.g. other invertebrates, plants, fungi and microbes). Finally, the documentation, design steps, software code and bait sets developed here are available under an open?source license for restriction?free testing, use, and additional modification by any research group.

The organization of the genome in the nucleus and the interactions of genes with their regulatory elements are key features of transcriptional control and their disruption can cause disease. Here we report a genome-wide method, genome architecture mapping (GAM), for measuring chromatin contacts and other features of three-dimensional chromatin topology on the basis of sequencing DNA from a large collection of thin nuclear sections. We apply GAM to mouse embryonic stem cells and identify enrichment for specific interactions between active genes and enhancers across very large genomic distances using a mathematical model termed SLICE (statistical inference of co-segregation). GAM also reveals an abundance of three-way contacts across the genome, especially between regions that are highly transcribed or contain super-enhancers, providing a level of insight into genome architecture that, owing to the technical limitations of current technologies, has previously remained unattainable. Furthermore, GAM highlights a role for gene-expression-specific contacts in organizing the genome in mammalian nuclei. View full text

Recent aDNA studies are progressively focusing on various Neolithic and Hunter – Gatherer (HG) populations, providing arguments in favor of major migrations accompanying European Neolithisation. The major focus was so far on the Linear Pottery Culture (LBK), which introduced the Neolithic way of life in Central Europe in the second half of 6th millennium BC. It is widely agreed that people of this culture were genetically different from local HGs and no genetic exchange is seen between the two groups. From the other hand some degree of resurgence of HGs genetic component is seen in late Neolithic groups belonging to the complex of the Funnel Beaker Cultures (TRB). Less attention is brought to various middle Neolithic cultures belonging to Late Danubian sequence which chronologically fall in between those two abovementioned groups. We suspected that genetic influx from HG to farming communities might have happened in Late Danubian cultures since archaeologists see extensive contacts between those two communities.

Article

Cytosine methylation plays an important role in the epigenetic regulation of eukaryotic gene expression. The methyl-CpG binding domain (MBD) is common to a family of eukaryotic transcriptional regulators. How MBD, a stretch of about 80 amino acids, recognizes CpGs in a methylation dependent manner, and as a function of sequence, is only partly understood. Here we show, using an Escherichia coli cell-free expression system, that MBD from the human transcriptional regulator MeCP2 performs as a specific, methylation-dependent repressor in conjunction with the BDNF (brain-derived neurotrophic factor) promoter sequence. Mutation of either base flanking the central CpG pair changes the expression level of the target gene. However, the relative degree of repression as a function of MBD concentration remains unaltered. Molecular dynamics simulations that address the DNA B fiber ratio and the handedness reveal cooperative transitions in the promoter DNA upon MBD binding that correlate well with our experimental observations. We suggest that not only steric hindrance, but also conformational changes of the BDNF promoter as a result of MBD binding are required for MBD to act as a specific inhibitory element. Our work demonstrates that the prokaryotic transcription machinery can reproduce features of epigenetic mammalian transcriptional regulatory elements.

Abstract— Wake Atoll is an isolated chain of three islets located in the Western Pacific. Included in its endemic flora is a representative of the genus Gossypium colloquially referred to as Wake Island cotton. Stanley G. Stephens pointed out that “Wake Island cotton does not resemble closely either the Caribbean or other Pacific forms.” Taking into consideration morphological distinctions, the geographic isolation of Wake Atoll, and newly generated molecular data presented here, we conclude that the cottons of Wake Atoll do in fact represent a new species of Gossypium, here named Gossypium stephensii . This name is chosen to commemorate the eminent natural historian, evolutionary geneticist, and cotton biologist, S. G. Stephens.

Population genetic studies of nonmodel organisms frequently employ reduced representation library (RRL) methodologies, many of which rely on protocols in which genomic DNA is digested by one or more restriction enzymes. However, because high molecular weight DNA is recommended for these protocols, samples with degraded DNA are generally unsuitable for RRL methods. Given that ancient and historic specimens can provide key temporal perspectives to evolutionary questions, we explored how custom-designed RNA probes could enrich for RRL loci (Restriction Enzyme-Associated Loci baits, or REALbaits). Starting with genotyping-by-sequencing (GBS) data generated on modern common ragweed (Ambrosia artemisiifolia L.) specimens, we designed 20 000 RNA probes to target well-characterized genomic loci in herbarium voucher specimens dating from 1835 to 1913. Compared to shotgun sequencing, we observed enrichment of the targeted loci at 19- to 151-fold. Using our GBS capture pipeline on a data set of 38 herbarium samples, we discovered 22 813 SNPs, providing sufficient genomic resolution to distinguish geographic populations. For these samples, we found that dilution of REALbaits to 10% of their original concentration still yielded sufficient data for downstream analyses and that a sequencing depth of ~7m reads was sufficient to characterize most loci without wasting sequencing capacity. In addition, we observed that targeted loci had highly variable rates of success, which we primarily attribute to similarity between loci, a trait that ultimately interferes with unambiguous read mapping. Our findings can help researchers design capture experiments for RRL loci, thereby providing an efficient means to integrate samples with degraded DNA into existing RRL data sets.

The performance of hybridization capture combined with next-generation sequencing (NGS) has seen limited investigation with samples from hot and arid regions until now. We applied hybridization capture and shotgun sequencing to recover DNA sequences from bone specimens of ancient-domestic dromedary (Camelus dromedarius) and its extinct ancestor, the wild dromedary from Jordan, Syria, Turkey and the Arabian Peninsula, respectively. Our results show that hybridization capture increased the percentage of mitochondrial DNA (mtDNA) recovery by an average 187-fold and in some cases yielded virtually complete mitochondrial (mt) genomes at multifold coverage in a single capture experiment. Furthermore, we tested the effect of hybridization temperature and time by using a touchdown approach on a limited number of samples. We observed no significant difference in the number of unique dromedary mtDNA reads retrieved with the standard capture compared to the touchdown method. In total, we obtained 14 partial mitochondrial genomes from ancient-domestic dromedaries with 17–95% length coverage and 1.27–47.1-fold read depths for the covered regions. Using whole-genome shotgun sequencing, we successfully recovered endogenous dromedary nuclear DNA (nuDNA) from domestic and wild dromedary specimens with 1–1.06-fold read depths for covered regions. Our results highlight that despite recent methodological advances, obtaining ancient DNA (aDNA) from specimens recovered from hot, arid environments is still problematic. Hybridization protocols require specific optimization, and samples at the limit of DNA preservation need multiple replications of DNA extraction and hybridization capture as has been shown previously for Middle Pleistocene specimens.