• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因组数据种系变异调用管道的准确性和效率。

Accuracy and efficiency of germline variant calling pipelines for human genome data.

机构信息

Department of Tumor Biology, Institute of Cancer Research, The Norwegian Radium Hospital, Oslo University Hospital, 0310, Oslo, Norway.

DNV GL, 1363, Høvik, Norway.

出版信息

Sci Rep. 2020 Nov 19;10(1):20222. doi: 10.1038/s41598-020-77218-4.

DOI:10.1038/s41598-020-77218-4
PMID:33214604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7678823/
Abstract

Advances in next-generation sequencing technology have enabled whole genome sequencing (WGS) to be widely used for identification of causal variants in a spectrum of genetic-related disorders, and provided new insight into how genetic polymorphisms affect disease phenotypes. The development of different bioinformatics pipelines has continuously improved the variant analysis of WGS data. However, there is a necessity for a systematic performance comparison of these pipelines to provide guidance on the application of WGS-based scientific and clinical genomics. In this study, we evaluated the performance of three variant calling pipelines (GATK, DRAGEN and DeepVariant) using the Genome in a Bottle Consortium, "synthetic-diploid" and simulated WGS datasets. DRAGEN and DeepVariant show better accuracy in SNP and indel calling, with no significant differences in their F1-score. DRAGEN platform offers accuracy, flexibility and a highly-efficient execution speed, and therefore superior performance in the analysis of WGS data on a large scale. The combination of DRAGEN and DeepVariant also suggests a good balance of accuracy and efficiency as an alternative solution for germline variant detection in further applications. Our results facilitate the standardization of benchmarking analysis of bioinformatics pipelines for reliable variant detection, which is critical in genetics-based medical research and clinical applications.

摘要

下一代测序技术的进步使得全基因组测序(WGS)能够广泛用于鉴定一系列遗传相关疾病的因果变异,并深入了解遗传多态性如何影响疾病表型。不同的生物信息学管道的发展不断改进了 WGS 数据的变异分析。然而,有必要对这些管道进行系统的性能比较,为基于 WGS 的科学和临床基因组学的应用提供指导。在这项研究中,我们使用基因组瓶联合会、“合成二倍体”和模拟 WGS 数据集评估了三种变异调用管道(GATK、DRAGEN 和 DeepVariant)的性能。DRAGEN 和 DeepVariant 在 SNP 和 indel 调用方面具有更好的准确性,其 F1 分数没有显著差异。DRAGEN 平台在准确性、灵活性和高效的执行速度方面表现出色,因此在大规模的 WGS 数据分析方面具有优越的性能。DRAGEN 和 DeepVariant 的结合也为种系变异检测提供了一种准确性和效率的良好平衡,是进一步应用中替代解决方案。我们的结果促进了生物信息学管道基准分析的标准化,以实现可靠的变异检测,这在基于遗传学的医学研究和临床应用中至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/832394a71aa5/41598_2020_77218_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/f740094f640c/41598_2020_77218_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/f55308983d24/41598_2020_77218_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/a86c44fd7508/41598_2020_77218_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/832394a71aa5/41598_2020_77218_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/f740094f640c/41598_2020_77218_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/f55308983d24/41598_2020_77218_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/a86c44fd7508/41598_2020_77218_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd5e/7678823/832394a71aa5/41598_2020_77218_Fig4_HTML.jpg

相似文献

1
Accuracy and efficiency of germline variant calling pipelines for human genome data.人类基因组数据种系变异调用管道的准确性和效率。
Sci Rep. 2020 Nov 19;10(1):20222. doi: 10.1038/s41598-020-77218-4.
2
Comparison of calling pipelines for whole genome sequencing: an empirical study demonstrating the importance of mapping and alignment.比较全基因组测序的调用管道:一项实证研究表明映射和比对的重要性。
Sci Rep. 2022 Dec 13;12(1):21502. doi: 10.1038/s41598-022-26181-3.
3
Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.跨多种下一代测序仪的种系变异调用管道的系统比较。
Sci Rep. 2019 Jun 27;9(1):9345. doi: 10.1038/s41598-019-45835-3.
4
Comparison of GATK and DeepVariant by trio sequencing.基于 trio 测序的 GATK 和 DeepVariant 比较。
Sci Rep. 2022 Feb 2;12(1):1809. doi: 10.1038/s41598-022-05833-4.
5
Benchmarking variant callers in next-generation and third-generation sequencing analysis.在新一代和第三代测序分析中对变异调用程序进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa148.
6
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
7
Comparison of three variant callers for human whole genome sequencing.三种人类全基因组测序变异 caller 的比较。
Sci Rep. 2018 Dec 14;8(1):17851. doi: 10.1038/s41598-018-36177-7.
8
A comparative investigation of single nucleotide variant calling for a personal non-Caucasian sequencing sample.对一个非高加索个体测序样本的单核苷酸变异调用进行比较研究。
Genes Genomics. 2023 Dec;45(12):1527-1536. doi: 10.1007/s13258-023-01439-w. Epub 2023 Aug 31.
9
Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data.使用人类全外显子组测序和模拟数据评估变异调用管道的性能。
BMC Bioinformatics. 2019 Jun 17;20(1):342. doi: 10.1186/s12859-019-2928-9.
10
Variant callers for next-generation sequencing data: a comparison study.下一代测序数据的变异调用者:一项比较研究。
PLoS One. 2013 Sep 27;8(9):e75619. doi: 10.1371/journal.pone.0075619. eCollection 2013.

引用本文的文献

1
Learning a refinement model for variant analysis in non-human primate genomes.学习用于非人灵长类动物基因组变异分析的优化模型。
BMC Genomics. 2025 Aug 25;26(1):775. doi: 10.1186/s12864-025-11921-2.
2
Overcoming limitations to customize DeepVariant for domesticated animals with TrioTrain.利用TrioTrain克服限制以定制适用于家养动物的DeepVariant。
Genome Res. 2025 Aug 1;35(8):1859-1874. doi: 10.1101/gr.279542.124.
3
Benchmarking accelerated next-generation sequencing analysis pipelines.对加速下一代测序分析流程进行基准测试。

本文引用的文献

1
Benchmarking workflows to assess performance and suitability of germline variant calling pipelines in clinical diagnostic assays.评估用于临床诊断检测的种系变异calling 管道性能和适用性的基准测试工作流程。
BMC Bioinformatics. 2021 Feb 24;22(1):85. doi: 10.1186/s12859-020-03934-3.
2
Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.跨多种下一代测序仪的种系变异调用管道的系统比较。
Sci Rep. 2019 Jun 27;9(1):9345. doi: 10.1038/s41598-019-45835-3.
3
An open resource for accurately benchmarking small variant and reference calls.
Bioinform Adv. 2025 May 15;5(1):vbaf085. doi: 10.1093/bioadv/vbaf085. eCollection 2025.
4
: A cloud-based genomics infrastructure with variant-calling pipeline suited for population-scale sequencing projects.一个基于云的基因组学基础设施,带有适用于群体规模测序项目的变异检测流程。
medRxiv. 2025 Apr 30:2025.04.29.25326690. doi: 10.1101/2025.04.29.25326690.
5
A population-based analysis of the molecular landscape of glioma in adolescents and young adults reveals insights into gliomagenesis.一项基于人群的青少年和年轻成人胶质瘤分子图谱分析揭示了胶质瘤发生的相关见解。
Nat Cancer. 2025 May 7. doi: 10.1038/s43018-025-00962-x.
6
UNISOM: Unified Somatic Calling and Machine Learning-based Classification Enhance the Discovery of CHIP.UNISOM:统一体细胞变异检测与基于机器学习的分类提升了克隆性造血的发现
Genomics Proteomics Bioinformatics. 2025 May 30;23(2). doi: 10.1093/gpbjnl/qzaf040.
7
Benchmarking of variant calling software for whole-exome sequencing using gold standard datasets.使用金标准数据集对全外显子组测序的变异检测软件进行基准测试。
Sci Rep. 2025 Apr 21;15(1):13697. doi: 10.1038/s41598-025-97047-7.
8
Prospective, multicenter validation of a platform for rapid molecular profiling of central nervous system tumors.中枢神经系统肿瘤快速分子谱分析平台的前瞻性多中心验证
Nat Med. 2025 May;31(5):1567-1577. doi: 10.1038/s41591-025-03562-5. Epub 2025 Mar 25.
9
Neoadjuvant triplet immune checkpoint blockade in newly diagnosed glioblastoma.新诊断胶质母细胞瘤的新辅助三联免疫检查点阻断疗法
Nat Med. 2025 May;31(5):1557-1566. doi: 10.1038/s41591-025-03512-1. Epub 2025 Feb 27.
10
Whole exome sequencing reveals ABCD1 variant as a potential contributor to male infertility.全外显子组测序揭示ABCD1基因变异可能是男性不育的一个因素。
Mol Biol Rep. 2025 Jan 22;52(1):148. doi: 10.1007/s11033-025-10234-7.
用于准确基准测试小型变体和参考调用的开放资源。
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
4
Systematic benchmarking of omics computational tools.系统生物学计算工具的基准测试。
Nat Commun. 2019 Mar 27;10(1):1393. doi: 10.1038/s41467-019-09406-4.
5
Best practices for benchmarking germline small-variant calls in human genomes.人类基因组中小变异calls 的基准测试最佳实践。
Nat Biotechnol. 2019 May;37(5):555-560. doi: 10.1038/s41587-019-0054-x. Epub 2019 Mar 11.
6
Comparative analysis of whole-genome sequencing pipelines to minimize false negative findings.全基因组测序流程的比较分析,以尽量减少假阴性发现。
Sci Rep. 2019 Mar 1;9(1):3219. doi: 10.1038/s41598-019-39108-2.
7
Empirical evaluation of variant calling accuracy using ultra-deep whole-genome sequencing data.使用超深度全基因组测序数据进行变异调用准确性的实证评估。
Sci Rep. 2019 Feb 11;9(1):1784. doi: 10.1038/s41598-018-38346-0.
8
Comparison of three variant callers for human whole genome sequencing.三种人类全基因组测序变异 caller 的比较。
Sci Rep. 2018 Dec 14;8(1):17851. doi: 10.1038/s41598-018-36177-7.
9
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
10
A synthetic-diploid benchmark for accurate variant-calling evaluation.用于准确变异呼叫评估的合成二倍体基准。
Nat Methods. 2018 Aug;15(8):595-597. doi: 10.1038/s41592-018-0054-7. Epub 2018 Jul 16.