• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

短读比对工具在种系变异识别中的性能表现。

Short-read aligner performance in germline variant identification.

机构信息

Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD 21218, United States.

Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, United States.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad480.

DOI:10.1093/bioinformatics/btad480
PMID:37527006
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10421969/
Abstract

MOTIVATION

Read alignment is an essential first step in the characterization of DNA sequence variation. The accuracy of variant-calling results depends not only on the quality of read alignment and variant-calling software but also on the interaction between these complex software tools.

RESULTS

In this review, we evaluate short-read aligner performance with the goal of optimizing germline variant-calling accuracy. We examine the performance of three general-purpose short-read aligners-BWA-MEM, Bowtie 2, and Arioc-in conjunction with three germline variant callers: DeepVariant, FreeBayes, and GATK HaplotypeCaller. We discuss the behavior of the read aligners with regard to the data elements on which the variant callers rely, and illustrate how the runtime configurations of these software tools combine to affect variant-calling performance.

AVAILABILITY AND IMPLEMENTATION

The quick brown fox jumps over the lazy dog.

摘要

动机

读段比对是 DNA 序列变异特征描述的首要基本步骤。变异调用结果的准确性不仅取决于读段比对和变异调用软件的质量,还取决于这些复杂软件工具之间的相互作用。

结果

在本综述中,我们评估了短读段比对器的性能,目的是优化种系变异调用的准确性。我们考察了三种通用短读段比对器(BWA-MEM、Bowtie 2 和 Arioc)与三种种系变异调用器(DeepVariant、FreeBayes 和 GATK HaplotypeCaller)结合的性能。我们讨论了读段比对器的行为,以及这些数据元素与变异调用器所依赖的数据元素之间的关系,并举例说明了这些软件工具的运行时配置如何结合起来影响变异调用性能。

可利用性和实施

快速的棕色狐狸跳过了懒惰的狗。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/f180816fbc0b/btad480f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/11336fcea076/btad480f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/c7ea72181d68/btad480f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/f180816fbc0b/btad480f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/11336fcea076/btad480f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/c7ea72181d68/btad480f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ddc9/10421969/f180816fbc0b/btad480f3.jpg

相似文献

1
Short-read aligner performance in germline variant identification.短读比对工具在种系变异识别中的性能表现。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad480.
2
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
3
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.用于分析NGS种系基因检测板的映射、变异位点检测和区间填充流程的性能评估。
BMC Bioinformatics. 2021 Apr 28;22(1):218. doi: 10.1186/s12859-021-04144-1.
4
Performance optimization in DNA short-read alignment.DNA 短读比对中的性能优化。
Bioinformatics. 2022 Apr 12;38(8):2081-2087. doi: 10.1093/bioinformatics/btac066.
5
Arioc: GPU-accelerated alignment of short bisulfite-treated reads.Arioc:用于短亚硫酸氢盐处理读取物的 GPU 加速对齐。
Bioinformatics. 2018 Aug 1;34(15):2673-2675. doi: 10.1093/bioinformatics/bty167.
6
From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.从湿实验室到变异:全基因组和全外显子组测序的生物信息学流程的一致性和速度
Hum Mutat. 2016 Dec;37(12):1263-1271. doi: 10.1002/humu.23114. Epub 2016 Sep 26.
7
VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering.变异元调用器:用于基于定量、精确性筛选的变异调用流程的自动融合。
BMC Genomics. 2015 Oct 28;16:875. doi: 10.1186/s12864-015-2050-y.
8
VC@Scale: Scalable and high-performance variant calling on cluster environments.VC@Scale:在集群环境中进行可扩展且高性能的变体调用。
Gigascience. 2021 Sep 7;10(9). doi: 10.1093/gigascience/giab057.
9
Evaluation of variant calling tools for large plant genome re-sequencing.评价用于大型植物基因组重测序的变异调用工具。
BMC Bioinformatics. 2020 Aug 17;21(1):360. doi: 10.1186/s12859-020-03704-1.
10
Performance analysis of conventional and AI-based variant callers using short and long reads.使用短读长读对常规和基于人工智能的变异调用程序进行性能分析。
BMC Bioinformatics. 2023 Dec 14;24(1):472. doi: 10.1186/s12859-023-05596-3.

引用本文的文献

1
Development and validation of a minimal SNP genotyping panel for the differentiation of Cannabis sativa cultivars.用于区分大麻品种的最小SNP基因分型面板的开发与验证。
BMC Genomics. 2025 Jan 28;26(1):83. doi: 10.1186/s12864-025-11263-z.
2
Bioinformatics of germline variant discovery for rare disease diagnostics: current approaches and remaining challenges.用于罕见病诊断的种系变异发现的生物信息学:当前方法与尚存挑战
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad508.

本文引用的文献

1
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
2
Variant calling and benchmarking in an era of complete human genome sequences.全基因组序列时代的变异调用和基准测试。
Nat Rev Genet. 2023 Jul;24(7):464-483. doi: 10.1038/s41576-023-00590-0. Epub 2023 Apr 14.
3
Comparison of calling pipelines for whole genome sequencing: an empirical study demonstrating the importance of mapping and alignment.比较全基因组测序的调用管道:一项实证研究表明映射和比对的重要性。
Sci Rep. 2022 Dec 13;12(1):21502. doi: 10.1038/s41598-022-26181-3.
4
Benchmarking challenging small variants with linked and long reads.使用连锁读段和长读段对具有挑战性的小变异进行基准测试。
Cell Genom. 2022 May;2(5). doi: 10.1016/j.xgen.2022.100128.
5
PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions.精准FDA真相挑战V2:在难以映射的区域中从短读长和长读长中识别变异体。
Cell Genom. 2022 May 11;2(5). doi: 10.1016/j.xgen.2022.100129. Epub 2022 Apr 27.
6
Robust and accurate estimation of paralog-specific copy number for duplicated genes using whole-genome sequencing.利用全基因组测序对重复基因进行稳健、准确的直系同源基因拷贝数估计。
Nat Commun. 2022 Jun 9;13(1):3221. doi: 10.1038/s41467-022-30930-3.
7
A complete reference genome improves analysis of human genetic variation.完整的参考基因组提高了人类遗传变异分析的能力。
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
8
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
9
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
10
Performance optimization in DNA short-read alignment.DNA 短读比对中的性能优化。
Bioinformatics. 2022 Apr 12;38(8):2081-2087. doi: 10.1093/bioinformatics/btac066.