• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从湿实验室到变异:全基因组和全外显子组测序的生物信息学流程的一致性和速度

From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.

作者信息

Laurie Steve, Fernandez-Callejo Marcos, Marco-Sola Santiago, Trotta Jean-Remi, Camps Jordi, Chacón Alejandro, Espinosa Antonio, Gut Marta, Gut Ivo, Heath Simon, Beltran Sergi

机构信息

CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.

Universitat Pompeu Fabra (UPF), Barcelona, Spain.

出版信息

Hum Mutat. 2016 Dec;37(12):1263-1271. doi: 10.1002/humu.23114. Epub 2016 Sep 26.

DOI:10.1002/humu.23114
PMID:27604516
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5129537/
Abstract

As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available.

摘要

随着全基因组测序变得更加便宜和快速,它将逐渐取代靶向新一代测序,成为研究和诊断中的标准做法。然而,计算成本效益比并没有以相同的速度提升。因此,考虑到所需的计算资源,评估变异检测过程的稳健性至关重要。我们在来自NA12878人类样本的全基因组和全外显子组测序数据上,对六种最先进的读段比对器(BWA-MEM和GEM3)和变异调用器(FreeBayes、GATK HaplotypeCaller、SAMtools)的组合进行了基准测试。将它们之间的结果以及与美国国家标准与技术研究院(NIST)基因组瓶中基因组(GIAB)变异参考数据集的结果进行了比较。我们报告了该过程某些步骤中高达20倍的速度差异,并观察到在70%的基因组中,单核苷酸变异(SNV)以及程度较轻的插入缺失(InDel)检测高度一致。在20%的基因组中,SNV尤其是InDel检测不太可靠,而在其余10%的基因组中几乎不可行。这些发现将有助于根据目标、工作量和可用的计算基础设施选择合适的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/2215db3500d3/HUMU-37-1263-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/c799c8aeef93/HUMU-37-1263-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/8ccea4299517/HUMU-37-1263-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/2215db3500d3/HUMU-37-1263-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/c799c8aeef93/HUMU-37-1263-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/8ccea4299517/HUMU-37-1263-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73f2/5129537/2215db3500d3/HUMU-37-1263-g003.jpg

相似文献

1
From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.从湿实验室到变异:全基因组和全外显子组测序的生物信息学流程的一致性和速度
Hum Mutat. 2016 Dec;37(12):1263-1271. doi: 10.1002/humu.23114. Epub 2016 Sep 26.
2
Impact of post-alignment processing in variant discovery from whole exome data.全外显子数据变异发现中比对后处理的影响
BMC Bioinformatics. 2016 Oct 3;17(1):403. doi: 10.1186/s12859-016-1279-z.
3
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
4
Systematic comparison of variant calling pipelines using gold standard personal exome variants.使用金标准个人外显子变体对变异检测流程进行系统比较。
Sci Rep. 2015 Dec 7;5:17875. doi: 10.1038/srep17875.
5
Variant callers for next-generation sequencing data: a comparison study.下一代测序数据的变异调用者:一项比较研究。
PLoS One. 2013 Sep 27;8(9):e75619. doi: 10.1371/journal.pone.0075619. eCollection 2013.
6
VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering.变异元调用器:用于基于定量、精确性筛选的变异调用流程的自动融合。
BMC Genomics. 2015 Oct 28;16:875. doi: 10.1186/s12864-015-2050-y.
7
Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data.全外显子组测序数据中插入/缺失(INDELs)的优化检测
PLoS One. 2017 Aug 9;12(8):e0182272. doi: 10.1371/journal.pone.0182272. eCollection 2017.
8
Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data.使用人类全外显子组测序和模拟数据评估变异调用管道的性能。
BMC Bioinformatics. 2019 Jun 17;20(1):342. doi: 10.1186/s12859-019-2928-9.
9
An analytical workflow for accurate variant discovery in highly divergent regions.一种用于在高度分化区域进行准确变异发现的分析流程。
BMC Genomics. 2016 Sep 2;17(1):703. doi: 10.1186/s12864-016-3045-z.
10
Medical implications of technical accuracy in genome sequencing.基因组测序技术准确性的医学意义。
Genome Med. 2016 Mar 2;8(1):24. doi: 10.1186/s13073-016-0269-0.

引用本文的文献

1
Genomic reanalysis of a pan-European rare-disease resource yields new diagnoses.对泛欧洲罕见病资源进行基因组重新分析得出了新的诊断结果。
Nat Med. 2025 Feb;31(2):478-489. doi: 10.1038/s41591-024-03420-w. Epub 2025 Jan 17.
2
Phenotype-driven genomics enhance diagnosis in children with unresolved neuromuscular diseases.表型驱动的基因组学改善了患有未确诊神经肌肉疾病儿童的诊断。
Eur J Hum Genet. 2025 Mar;33(2):239-247. doi: 10.1038/s41431-024-01699-4. Epub 2024 Sep 27.
3
An interconnected data infrastructure to support large-scale rare disease research.

本文引用的文献

1
Medical implications of technical accuracy in genome sequencing.基因组测序技术准确性的医学意义。
Genome Med. 2016 Mar 2;8(1):24. doi: 10.1186/s13073-016-0269-0.
2
A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing.利用全基因组测序对癌症中体细胞突变检测进行的全面评估。
Nat Commun. 2015 Dec 9;6:10001. doi: 10.1038/ncomms10001.
3
Systematic comparison of variant calling pipelines using gold standard personal exome variants.使用金标准个人外显子变体对变异检测流程进行系统比较。
一个互联的数据基础设施,以支持大规模罕见病研究。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae058.
4
Genomic technologies and the diagnosis of 46, XY differences of sex development.基因组技术与46, XY性发育差异的诊断
Andrology. 2025 Jul;13(5):1025-1043. doi: 10.1111/andr.13708. Epub 2024 Jul 31.
5
Identification and characterization of a new pathologic mutation in a large Leber hereditary optic neuropathy pedigree.在一个大型 Leber 遗传性视神经病变家系中鉴定和特征分析一种新的病理突变。
Orphanet J Rare Dis. 2024 Apr 6;19(1):148. doi: 10.1186/s13023-024-03165-2.
6
Variants in mitochondrial disease genes are common causes of inherited peripheral neuropathies.线粒体疾病基因的变异是遗传性周围神经病的常见原因。
J Neurol. 2024 Jun;271(6):3546-3553. doi: 10.1007/s00415-024-12319-y. Epub 2024 Mar 28.
7
Rare disease research workflow using multilayer networks elucidates the molecular determinants of severity in Congenital Myasthenic Syndromes.利用多层网络研究罕见病工作流程阐明了先天性肌营养不良综合征严重程度的分子决定因素。
Nat Commun. 2024 Feb 28;15(1):1227. doi: 10.1038/s41467-024-45099-0.
8
Phenotypic similarity-based approach for variant prioritization for unsolved rare disease: a preliminary methodological report.基于表型相似性的未解决罕见病变异优先级方法:初步方法学报告。
Eur J Hum Genet. 2024 Feb;32(2):182-189. doi: 10.1038/s41431-023-01486-7. Epub 2023 Nov 6.
9
Functional Evidence of as a New Disease-Associated Gene with Endocrine and Central Nervous System Alterations.作为一个具有内分泌和中枢神经系统改变的新的疾病相关基因的功能证据。
Int J Mol Sci. 2023 Aug 1;24(15):12319. doi: 10.3390/ijms241512319.
10
Remote visualization of large-scale genomic alignments for collaborative clinical research and diagnosis of rare diseases.用于罕见病协作临床研究与诊断的大规模基因组比对的远程可视化。
Cell Genom. 2023 Jan 11;3(2):100246. doi: 10.1016/j.xgen.2022.100246. eCollection 2023 Feb 8.
Sci Rep. 2015 Dec 7;5:17875. doi: 10.1038/srep17875.
4
A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference.使用“瓶中基因组”作为参考的变异检测流程比较
Biomed Res Int. 2015;2015:456479. doi: 10.1155/2015/456479. Epub 2015 Oct 11.
5
Performance evaluation of indel calling tools using real short-read data.使用真实短读长数据对插入缺失(Indel)检测工具进行性能评估。
Hum Genomics. 2015 Aug 19;9(1):20. doi: 10.1186/s40246-015-0042-2.
6
Utility of whole-exome sequencing for those near the end of the diagnostic odyssey: time to address gaps in care.全外显子测序对那些诊断之旅接近尾声的患者的效用:是时候解决护理中的差距了。
Clin Genet. 2016 Mar;89(3):275-84. doi: 10.1111/cge.12654. Epub 2015 Sep 22.
7
Achieving high-sensitivity for clinical applications using augmented exome sequencing.使用增强外显子组测序实现临床应用的高灵敏度。
Genome Med. 2015 Jul 16;7(1):71. doi: 10.1186/s13073-015-0197-4. eCollection 2015.
8
Leveraging the power of high performance computing for next generation sequencing data analysis: tricks and twists from a high throughput exome workflow.利用高性能计算的力量进行下一代测序数据分析:来自高通量外显子组工作流程的技巧与窍门
PLoS One. 2015 May 5;10(5):e0126321. doi: 10.1371/journal.pone.0126321. eCollection 2015.
9
An analytical framework for optimizing variant discovery from personal genomes.用于优化从个人基因组中发现变异的分析框架。
Nat Commun. 2015 Feb 25;6:6275. doi: 10.1038/ncomms7275.
10
Churchill: an ultra-fast, deterministic, highly scalable and balanced parallelization strategy for the discovery of human genetic variation in clinical and population-scale genomics.丘吉尔:一种超快速、确定性、高度可扩展且平衡的并行化策略,用于在临床和群体规模基因组学中发现人类遗传变异。
Genome Biol. 2015 Jan 20;16(1):6. doi: 10.1186/s13059-014-0577-x.