Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA.
Centre for GeoGenetics, Natural History Museum of Denmark, Copenhagen 1350, Denmark; Department of Archaeology, Environment, and Community Planning, Faculty of Humanities and Social Sciences, La Trobe University, Melbourne, VIC 3086, Australia.
Am J Hum Genet. 2013 Nov 7;93(5):852-64. doi: 10.1016/j.ajhg.2013.10.002. Epub 2013 Oct 25.
Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062-147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217-73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples.
大多数古代标本中都含有非常低水平的内源性 DNA,由于成本原因,许多有趣的样本都无法进行 shotgun 测序。古 DNA(aDNA)文库通常含有 <1%的内源性 DNA,而大多数测序能力都被环境 DNA 占据。在这里,我们提出了一种基于捕获的方法来富集 aDNA 测序文库中的内源性成分。通过使用从基因组文库转录的生物素化 RNA 诱饵,我们能够捕获来自人类基因组的 DNA 片段。我们在来自保加利亚的四个铁器时代和青铜时代人类牙齿以及来自秘鲁的七个木乃伊和丹麦的青铜时代头发样本的文库上证明了这种方法。在捕获之前,这些文库的 shotgun 测序平均只有 1.2%的读数映射到人类基因组(包括重复序列)。捕获后,这部分显著增加,高达 59%的读数映射到人类,富集倍数从 6 倍到 159 倍不等。此外,我们保持了在预捕获文库中测序的大多数区域的覆盖度。与 1000 基因组计划参考面板的交集在使用 100 万条读数对 postcapture 文库进行测序时产生了平均 50723 个 SNP(范围为 3062-147243),而在预捕获文库中产生了 13280 个 SNP(范围为 217-73266),增加了种群遗传分析的分辨率。我们的全基因组捕获方法使得从含有非常低水平内源性 DNA 的标本中测序 aDNA 的成本更低,从而能够分析更多数量的样本。