Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark.
BGI-Shenzhen, Shenzhen 518083, China.
Genes (Basel). 2019 Jul 5;10(7):509. doi: 10.3390/genes10070509.
The taxonomical identification merely based on morphology is often difficult for ancient remains. Therefore, universal or specific PCR amplification followed by sequencing and BLAST (basic local alignment search tool) search has become the most frequently used genetic-based method for the species identification of biological samples, including ancient remains. However, it is challenging for these methods to process extremely ancient samples with severe DNA fragmentation and contamination. Here, we applied whole-genome sequencing data from 12 ancient samples with ages ranging from 2.7 to 700 kya to compare different mapping algorithms, and tested different reference databases, mapping similarities and query coverage to explore the best method and mapping parameters that can improve the accuracy of ancient mammal species identification. The selected method and parameters were tested using 152 ancient samples, and 150 of the samples were successfully identified. We further screened the BLAST-based mapping results according to the deamination characteristics of ancient DNA to improve the ability of ancient species identification. Our findings demonstrate a marked improvement to the normal procedures used for ancient species identification, which was achieved through defining the mapping and filtering guidelines to identify true ancient DNA sequences. The guidelines summarized in this study could be valuable in archaeology, paleontology, evolution, and forensic science. For the convenience of the scientific community, we wrote a software script with Perl, called AncSid, which is made available on GitHub.
基于形态学的分类鉴定通常对于古代遗骸来说具有一定难度。因此,普遍或特定的 PCR 扩增,随后进行测序和 BLAST(基本局部比对搜索工具)搜索,已成为鉴定生物样本(包括古代遗骸)物种的最常用遗传方法。然而,这些方法在处理具有严重 DNA 片段化和污染的极其古老的样本时具有挑战性。在这里,我们应用了来自 12 个古代样本的全基因组测序数据,这些样本的年龄从 2.7 到 700 kya 不等,以比较不同的映射算法,并测试了不同的参考数据库、映射相似度和查询覆盖率,以探索可以提高古代哺乳动物物种鉴定准确性的最佳方法和映射参数。使用 152 个古代样本对选定的方法和参数进行了测试,其中 150 个样本成功得到鉴定。我们根据古代 DNA 的脱氨基特征进一步筛选了基于 BLAST 的映射结果,以提高古代物种鉴定的能力。我们的研究结果表明,通过定义映射和过滤准则来识别真正的古代 DNA 序列,对古代物种鉴定的常规程序有显著改进。本研究总结的准则对于考古学、古生物学、进化和法医学可能具有重要价值。为了方便科学界使用,我们使用 Perl 编写了一个名为 AncSid 的软件脚本,并将其发布在 GitHub 上。