Museum genomics provide an opportunity to investigate population demographics of extinct species, especially valuable when research prior to extinction was minimal. The Bachman’s warbler (Vermivora bachmanii) is hypothesized to have gone extinct due to loss of its specialized habitat. However, little is known about other potential contributing factors such as natural rarity or changes to connectivity following habitat fragmentation. We examined mitochondrial DNA (mtDNA) and genome-wide SNPs using specimens collected from breeding and migration sites across the range of the Bachman’s warbler. We found no signals of strong population structuring across the breeding range of Bachman’s warblers in both mtDNA and genome-wide SNPs. Thus, long-term population isolation did not appear to be a significant contributor to the extinction of the Bachman’s warbler. Instead, our findings support the theory that Bachman’s warblers underwent a rapid decline likely driven by habitat destruction, which may have been exacerbated by the natural rarity, habitat specificity and low genetic diversity of the species.

Many CRISPR-Cas immune systems generate guide (g)RNAs using trans-activating CRISPR RNAs (tracrRNAs). Recent work revealed that Cas9 tracrRNAs could be reprogrammed to convert any RNA-of-interest into a gRNA, linking the RNA’s presence to Cas9-mediated cleavage of double-stranded (ds)DNA. Here, we reprogram tracrRNAs from diverse Cas12 nucleases, linking the presence of an RNA-of-interest to dsDNA cleavage and subsequent collateral single-stranded DNA cleavage—all without the RNA necessarily encoding a protospacer-adjacent motif (PAM). After elucidating nuclease-specific design rules, we demonstrate PAM-independent RNA detection with Cas12b, Cas12e, and Cas12f nucleases. Furthermore, rationally truncating the dsDNA target boosts collateral cleavage activity, while the absence of a gRNA reduces background collateral activity and enhances sensitivity. Finally, we apply this platform to detect 16 S rRNA sequences from five different bacterial pathogens using a universal reprogrammed tracrRNA. These findings extend tracrRNA reprogramming to diverse dsDNA-targeting Cas12 nucleases, expanding the flexibility and versatility of CRISPR-based RNA detection.

Ligilactobacillus is a diverse genus among lactobacilli with phenotypes that reflect adaptation to various hosts. CRISPR-Cas systems are highly prevalent within lactobacilli, and Ligilactobacillus salivarius, the most abundant species of Ligilactobacillus, possesses both DNA- and RNA-targeting CRISPR-Cas systems. In this study, we explore the presence and functional properties of I-B, I-C, I-E, II-A, and III-A CRISPR-Cas systems in over 500 Ligilactobacillus genomes, emphasizing systems found in L. salivarius. We examined the I-E, II-A, and III-A CRISPR-Cas systems of two L. salivarius strains and observed occurrences of split cas genes and differences in CRISPR RNA maturation in native hosts. This prompted testing of the single Cas9 and multiprotein Cascade and Csm CRISPR-Cas effector complexes in a cell-free context to demonstrate the functionality of these systems. We also predicted self-targeting spacers within L. salivarius CRISPR-Cas systems and found that nearly a third of L. salivarius genomes possess unique self-targeting spacers that generally target elements other than prophages. With these two L. salivarius strains, we performed prophage induction coupled with RNA sequencing and discovered that the prophages residing within these strains are inducible and likely active elements, despite targeting by CRISPR-Cas systems. These findings deepen our comprehension of CRISPR-Cas systems in L. salivarius, further elucidating their relationship with associated prophages and providing a functional basis for the repurposing of these Cas effectors for bacterial manipulation.

AngiospeIn the period between 5,300 and 4,900 calibrated years before present (cal. bp), populations across large parts of Europe underwent a period of demographic decline1,2. However, the cause of this so-called Neolithic decline is still debated. Some argue for an agricultural crisis resulting in the decline3, others for the spread of an early form of plague4. Here we use population-scale ancient genomics to infer ancestry, social structure and pathogen infection in 108 Scandinavian Neolithic individuals from eight megalithic graves and a stone cist. We find that the Neolithic plague was widespread, detected in at least 17% of the sampled population and across large geographical distances. We demonstrate that the disease spread within the Neolithic community in three distinct infection events within a period of around 120 years. Variant graph-based pan-genomics shows that the Neolithic plague genomes retained ancestral genomic variation present in Yersinia pseudotuberculosis, including virulence factors associated with disease outcomes. In addition, we reconstruct four multigeneration pedigrees, the largest of which consists of 38 individuals spanning six generations, showing a patrilineal social organization. Lastly, we document direct genomic evidence for Neolithic female exogamy in a woman buried in a different megalithic tomb than her brothers. Taken together, our findings provide a detailed reconstruction of plague spread within a large patrilineal kinship group and identify multiple plague infections in a population dated to the beginning of the Neolithic decline.

Angiosperms are the cornerstone of most terrestrial ecosystems and human livelihoods1,2. A robust understanding of angiosperm evolution is required to explain their rise to ecological dominance. So far, the angiosperm tree of life has been determined primarily by means of analyses of the plastid genome3,4. Many studies have drawn on this foundational work, such as classification and first insights into angiosperm diversification since their Mesozoic origins5,6,7. However, the limited and biased sampling of both taxa and genomes undermines confidence in the tree and its implications. Here, we build the tree of life for almost 8,000 (about 60%) angiosperm genera using a standardized set of 353 nuclear genes8. This 15-fold increase in genus-level sampling relative to comparable nuclear studies9 provides a critical test of earlier results and brings notable change to key groups, especially in rosids, while substantiating many previously predicted relationships. Scaling this tree to time using 200 fossils, we discovered that early angiosperm evolution was characterized by high gene tree conflict and explosive diversification, giving rise to more than 80% of extant angiosperm orders. Steady diversification ensued through the remaining Mesozoic Era until rates resurged in the Cenozoic Era, concurrent with decreasing global temperatures and tightly linked with gene tree conflict. Taken together, our extensive sampling combined with advanced phylogenomic methods shows the deep history and full complexity in the evolution of a megadiverse clade.

An excavation conducted at Harewood Cemetery to identify the unmarked grave of Samuel Washington resulted in the discovery of burials presumably belonging to George Washington’s paternal grandnephews and their mother, Lucy Payne. To confirm their identities this study examined Y-chromosomal, mitochondrial, and autosomal DNA from the burials and a living Washington descendant. The burial’s Y-STR profile was compared to FamilyTreeDNA’s database, which resulted in a one-step difference from the living descendant and an exact match to another Washington. A more complete Y-STR and Y-SNP profile from the descendant was inferred to be the Washington Y profile. Kinship comparisons performed in relation to the descendant, who is a 4th and 5th degree relative of the putative individuals, resulted in >37,000 overlapping autosomal SNPs and strong statistical support with likelihood ratios exceeding one billion. This study highlights the benefits of a multi-marker approach for kinship prediction and DNA-assisted identification of historical remains.

Parchment, the skins of animals prepared for use as writing surfaces, offers a valuable source of genetic information. Many have clearly defined provenance, allowing for the genetic findings to be evaluated in temporal and spatial context. While these documents can yield evidence of the animal sources, the DNA contained within these aged skins is often damaged and fragmented. Previously, genetic studies targeting parchment have used destructive sampling techniques and so the development and validation of non-destructive sampling methods would expand opportunities and facilitate testing of more precious documents, especially those with historical significance. Here we present genetic data obtained by non-destructive sampling of eight parchments spanning the 15th century to the modern day. We define a workflow for enriching the mitochondrial genome (mtGenome), generating next-generation sequencing reads to permit species identification, and providing interpretation guidance. Using sample replication, comparisons to destructively sampled controls, and by establishing authentication criteria, we were able to confidently assign full/near full mtGenome sequences to 56.3% of non-destructively sampled parchments, each with greater than 90% of the mtGenome reference covered. Six of eight parchments passed all four established thresholds with at least one non-destructive sample, highlighting promise for future studies.

Disease resistance genes in livestock provide health benefits to animals and opportunities for farmers to meet the growing demand for affordable, high-quality protein. Previously, researchers used gene editing to modify the porcine CD163 gene and demonstrated resistance to a harmful virus that causes porcine reproductive and respiratory syndrome (PRRS). To maximize potential benefits, this disease resistance trait needs to be present in commercially relevant breeding populations for multiplication and distribution of pigs. Toward this goal, a first of-its-kind, scaled gene editing program was established to introduce a single modified CD163 allele into four genetically diverse, elite porcine lines. This effort produced healthy pigs that resisted PRRS virus infection as determined by macrophage and animal challenges. This founder population will be used for additional disease and trait testing, multiplication, and commercial distribution upon regulatory approval. Applying CRISPR-Cas to eliminate a viral disease represents a major step toward improving animal health.

Directly observing the chronology and tempo of adaptation in response to ecological change is rarely possible in natural ecosystems. Sedimentary ancient DNA (sedaDNA) has been shown to be a tractable source of genome-scale data of long-dead organisms1,2,3 and to thereby potentially provide an understanding of the evolutionary histories of past populations.4,5 To date, time series of ecosystem biodiversity have been reconstructed from sedaDNA, typically using DNA metabarcoding or shotgun sequence data generated from less than 1 g of sediment.6,7 Here, we maximize sequence coverage by extracting DNA from ∼50× more sediment per sample than the majority of previous studies1,2,3 to achieve genotype resolution. From a time series of Late Pleistocene sediments spanning from a marine to freshwater ecosystem, we compare adaptive genotypes reconstructed from the environmental genomes of three-spined stickleback at key time points of this transition. We find a staggered temporal dynamic in which freshwater alleles at known loci of large effect in marine-freshwater divergence of three-spined stickleback (e.g., EDA)8 were already established during the brackish phase of the formation of the isolation basin. However, marine alleles were still detected across the majority of marine-freshwater divergence-associated loci, even after the complete isolation of the lake from marine ingression. Our retrospective approach to studying adaptation from environmental genomes of three-spined sticklebacks at the end of the last glacial period complements contemporary experimental approaches9,10,11 and highlights the untapped potential for retrospective “evolve and resequence” natural experiments using sedaDNA.

Investigative genetic genealogy (IGG) has emerged as a highly effective tool for tying a forensic DNA sample to an identity. While much of the attention paid to IGG has focused on cases where the DNA is from an unknown suspect, IGG has also been used to help close hundreds of unidentified human remains (UHR) cases. Genome-wide single-nucleotide polymorphism (SNP) genotype data can be obtained from forensic samples using microarray genotyping or whole-genome sequencing (WGS) with protocols optimized for degraded DNA. After bioinformatic processing, the SNP data can be uploaded to public GG databases that allow law enforcement usage, where it can be compared with other users’ data to find distant relatives. A genetic genealogist can then build the family trees of the relatives to narrow down the identity of the source of the forensic DNA sample. To date, 36 UHR identifications using IGG have been publicly announced. The same IGG techniques developed and refined for UHR cases have significant potential for disaster victim identification, where DNA is often extremely compromised, and close family references may not be available. This paper reviews the laboratory, bioinformatic, and genealogical techniques used in IGG for UHR cases and presents three case studies that demonstrate how IGG is assisting with remains identification.