• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统评估古 DNA 读段映射。

Systematic benchmark of ancient DNA read mapping.

机构信息

Australian Centre for Ancient DNA, School of Biological Sciences, The University of Adelaide, South Australia, 5005, Australia.

South Australian Museum, Adelaide, SA 5005, Australia.

出版信息

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab076.

DOI:10.1093/bib/bbab076
PMID:33834210
Abstract

The current standard practice for assembling individual genomes involves mapping millions of short DNA sequences (also known as DNA 'reads') against a pre-constructed reference genome. Mapping vast amounts of short reads in a timely manner is a computationally challenging task that inevitably produces artefacts, including biases against alleles not found in the reference genome. This reference bias and other mapping artefacts are expected to be exacerbated in ancient DNA (aDNA) studies, which rely on the analysis of low quantities of damaged and very short DNA fragments (~30-80 bp). Nevertheless, the current gold-standard mapping strategies for aDNA studies have effectively remained unchanged for nearly a decade, during which time new software has emerged. In this study, we used simulated aDNA reads from three different human populations to benchmark the performance of 30 distinct mapping strategies implemented across four different read mapping software-BWA-aln, BWA-mem, NovoAlign and Bowtie2-and quantified the impact of reference bias in downstream population genetic analyses. We show that specific NovoAlign, BWA-aln and BWA-mem parameterizations achieve high mapping precision with low levels of reference bias, particularly after filtering out reads with low mapping qualities. However, unbiased NovoAlign results required the use of an IUPAC reference genome. While relevant only to aDNA projects where reference population data are available, the benefit of using an IUPAC reference demonstrates the value of incorporating population genetic information into the aDNA mapping process, echoing recent results based on graph genome representations.

摘要

目前,组装个体基因组的标准做法是将数百万个短 DNA 序列(也称为 DNA“reads”)与预先构建的参考基因组进行比对。及时对大量短读取进行映射是一项具有挑战性的计算任务,不可避免地会产生伪影,包括对参考基因组中未发现的等位基因的偏见。这种参考偏差和其他映射伪影预计在古 DNA(aDNA)研究中会更加严重,这些研究依赖于对受损和非常短的 DNA 片段(~30-80 bp)的低数量的分析。尽管如此,近十年来,aDNA 研究的当前黄金标准映射策略几乎没有变化,在此期间,新的软件已经出现。在这项研究中,我们使用来自三个不同人类群体的模拟 aDNA 读取来对四种不同读取映射软件(BWA-aln、BWA-mem、NovoAlign 和 Bowtie2)中的 30 种不同映射策略的性能进行基准测试,并量化了参考偏差在下游群体遗传分析中的影响。我们表明,特定的 NovoAlign、BWA-aln 和 BWA-mem 参数化方法在具有低水平参考偏差的情况下实现了高精度的映射,特别是在过滤掉具有低映射质量的读取之后。然而,无偏差的 NovoAlign 结果需要使用 IUPAC 参考基因组。虽然仅与具有参考人群数据的 aDNA 项目相关,但使用 IUPAC 参考的好处证明了将群体遗传信息纳入 aDNA 映射过程的价值,这与最近基于图基因组表示的结果相呼应。

相似文献

1
Systematic benchmark of ancient DNA read mapping.系统评估古 DNA 读段映射。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab076.
2
Removing reference bias and improving indel calling in ancient DNA data analysis by mapping to a sequence variation graph.通过映射到序列变异图来去除古 DNA 数据分析中的参考偏倚并提高插入缺失调用。
Genome Biol. 2020 Sep 17;21(1):250. doi: 10.1186/s13059-020-02160-7.
3
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.基于全基因组特征,对多种新一代测序比对器的读段比对进行评估。
Genomics. 2017 Jul;109(3-4):186-191. doi: 10.1016/j.ygeno.2017.03.001. Epub 2017 Mar 9.
4
Improving ancient DNA read mapping against modern reference genomes.提高古代 DNA 读取与现代参考基因组的比对。
BMC Genomics. 2012 May 10;13:178. doi: 10.1186/1471-2164-13-178.
5
Benchmarking short sequence mapping tools.短序列比对工具的基准测试。
BMC Bioinformatics. 2013 Jun 7;14:184. doi: 10.1186/1471-2105-14-184.
6
Unravelling reference bias in ancient DNA datasets.揭示古代DNA数据集中的参考偏差
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae436.
7
The presence and impact of reference bias on population genomic studies of prehistoric human populations.史前人类群体的种群基因组研究中参考偏倚的存在和影响。
PLoS Genet. 2019 Jul 26;15(7):e1008302. doi: 10.1371/journal.pgen.1008302. eCollection 2019 Jul.
8
Re-alignment of the unmapped reads with base quality score.将未映射的 reads 与碱基质量得分重新比对。
BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2105-16-S5-S8. Epub 2015 Mar 18.
9
Assessing the impact of exact reads on reducing the error rate of read mapping.评估精确读取对降低读取映射错误率的影响。
BMC Bioinformatics. 2018 Nov 6;19(1):406. doi: 10.1186/s12859-018-2432-7.
10
SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees.SRPRISM(单读配对读插入缺失替换最小化器):具有明确保证的组装的高效对齐器。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa023.

引用本文的文献

1
The genetic history of Portugal over the past 5,000 years.葡萄牙过去5000年的基因历史。
Genome Biol. 2025 Aug 18;26(1):248. doi: 10.1186/s13059-025-03707-2.
2
Genetic transitions in the Neolithic and Bronze Age at Mas d'en Boixos (Catalonia, Spain).西班牙加泰罗尼亚地区马斯德恩博伊克斯新石器时代和青铜时代的基因转变
iScience. 2025 Jun 11;28(7):112871. doi: 10.1016/j.isci.2025.112871. eCollection 2025 Jul 18.
3
Optimized in-solution enrichment of over a million ancient human SNPs.超过一百万个古代人类单核苷酸多态性的优化溶液内富集。
Genome Biol. 2025 Jul 3;26(1):190. doi: 10.1186/s13059-025-03622-6.
4
Integrative transcriptomic and metabolomic analysis reveals the flavor formation mechanism of green bamboo () shoots.综合转录组学和代谢组学分析揭示了绿竹笋的风味形成机制。
Food Chem (Oxf). 2025 Apr 28;10:100258. doi: 10.1016/j.fochms.2025.100258. eCollection 2025 Jun.
5
Sedimentary DNA insights into Holocene Adélie penguin (Pygoscelis adeliae) populations and ecology in the Ross Sea, Antarctica.沉积DNA揭示了全新世南极罗斯海阿德利企鹅(阿德利企鹅属阿德利企鹅种)种群及生态情况。
Nat Commun. 2025 Mar 5;16(1):1798. doi: 10.1038/s41467-025-56925-4.
6
Exploring the Potential of Genome-Wide Hybridization Capture Enrichment for Forensic DNA Profiling of Degraded Bones.探索全基因组杂交捕获富集技术在降解骨骼法医DNA分型中的应用潜力。
Genes (Basel). 2024 Dec 26;16(1):23. doi: 10.3390/genes16010023.
7
Pre-processing of paleogenomes: mitigating reference bias and postmortem damage in ancient genome data.古基因组的预处理:减轻古代基因组数据中的参考偏差和死后损伤
Genome Biol. 2025 Jan 9;26(1):6. doi: 10.1186/s13059-024-03462-w.
8
Filtering out the noise: metagenomic classifiers optimize ancient DNA mapping.滤除噪音:宏基因组分类器优化古DNA图谱绘制。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae646.
9
Ancient genomes reveal over two thousand years of dingo population structure.古代基因组揭示了两千多年来野狗种群结构。
Proc Natl Acad Sci U S A. 2024 Jul 23;121(30):e2407584121. doi: 10.1073/pnas.2407584121. Epub 2024 Jul 8.
10
Unravelling reference bias in ancient DNA datasets.揭示古代DNA数据集中的参考偏差
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae436.