Centre for GeoGenetics, Natural History Museum of Denmark, Copenhagen University, Copenhagen DK-1350, Denmark.
Genome Res. 2011 Oct;21(10):1705-19. doi: 10.1101/gr.122747.111. Epub 2011 Jul 29.
Second-generation sequencing platforms have revolutionized the field of ancient DNA, opening access to complete genomes of past individuals and extinct species. However, these platforms are dependent on library construction and amplification steps that may result in sequences that do not reflect the original DNA template composition. This is particularly true for ancient DNA, where templates have undergone extensive damage post-mortem. Here, we report the results of the first "true single molecule sequencing" of ancient DNA. We generated 115.9 Mb and 76.9 Mb of DNA sequences from a permafrost-preserved Pleistocene horse bone using the Helicos HeliScope and Illumina GAIIx platforms, respectively. We find that the percentage of endogenous DNA sequences derived from the horse is higher among the Helicos data than Illumina data. This result indicates that the molecular biology tools used to generate sequencing libraries of ancient DNA molecules, as required for second-generation sequencing, introduce biases into the data that reduce the efficiency of the sequencing process and limit our ability to fully explore the molecular complexity of ancient DNA extracts. We demonstrate that simple modifications to the standard Helicos DNA template preparation protocol further increase the proportion of horse DNA for this sample by threefold. Comparison of Helicos-specific biases and sequence errors in modern DNA with those in ancient DNA also reveals extensive cytosine deamination damage at the 3' ends of ancient templates, indicating the presence of 3'-sequence overhangs. Our results suggest that paleogenomes could be sequenced in an unprecedented manner by combining current second- and third-generation sequencing approaches.
第二代测序平台彻底改变了古 DNA 领域,使人们能够获取过去个体和已灭绝物种的完整基因组。然而,这些平台依赖于文库构建和扩增步骤,这些步骤可能导致序列不能反映原始 DNA 模板的组成。对于古 DNA 来说,这种情况尤其如此,因为模板在死后已经经历了广泛的损伤。在这里,我们报告了首次对古 DNA 进行“真正的单分子测序”的结果。我们分别使用 Helicos HeliScope 和 Illumina GAIIx 平台,从冻土中保存的更新世马骨中生成了 115.9 Mb 和 76.9 Mb 的 DNA 序列。我们发现,与 Illumina 数据相比,Helicos 数据中源自马的内源性 DNA 序列的百分比更高。这一结果表明,用于生成古 DNA 分子测序文库的分子生物学工具(这是第二代测序所必需的)会给数据带来偏差,从而降低测序效率,并限制我们充分探索古 DNA 提取物的分子复杂性。我们证明,对标准 Helicos DNA 模板制备方案进行简单修改,可以将该样本中马 DNA 的比例进一步提高三倍。对现代 DNA 与古 DNA 中的 Helicos 特异性偏差和序列错误进行比较,还揭示了古模板 3' 末端广泛的胞嘧啶脱氨损伤,表明存在 3' 序列突出。我们的结果表明,通过结合当前的第二和第三代测序方法,可以以前所未有的方式对古基因组进行测序。