• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向用于体细胞突变检测的准确且稳健的分析流程。

Towards an accurate and robust analysis pipeline for somatic mutation calling.

作者信息

Jin Jingjie, Chen Zixi, Liu Jinchao, Du Hongli, Zhang Gong

机构信息

Key Laboratory of Functional Protein Research, Guangdong Higher Education Institutes, Jinan University, Guangzhou, China.

MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou, China.

出版信息

Front Genet. 2022 Nov 15;13:979928. doi: 10.3389/fgene.2022.979928. eCollection 2022.

DOI:10.3389/fgene.2022.979928
PMID:36457740
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9705725/
Abstract

Accurate and robust somatic mutation detection is essential for cancer treatment, diagnostics and research. Various analysis pipelines give different results and thus should be systematically evaluated. In this study, we benchmarked 5 commonly-used somatic mutation calling pipelines (VarScan, VarDictJava, Mutect2, Strelka2 and FANSe) for their precision, recall and speed, using standard benchmarking datasets based on a series of real-world whole-exome sequencing datasets. All the 5 pipelines showed very high precision in all cases, and high recall rate in mutation rates higher than 10%. However, for the low frequency mutations, these pipelines showed large difference. FANSe showed the highest accuracy (especially the sensitivity) in all cases, and VarScan and VarDictJava outperformed Mutect2 and Strelka2 in low frequency mutations at all sequencing depths. The flaws in filter was the major cause of the low sensitivity of the four pipelines other than FANSe. Concerning the speed, FANSe pipeline was 8.8∼19x faster than the other pipelines. Our benchmarking results demonstrated performance of the somatic calling pipelines and provided a reference for a proper choice of such pipelines in cancer applications.

摘要

准确且稳健的体细胞突变检测对于癌症治疗、诊断和研究至关重要。各种分析流程会给出不同的结果,因此应该进行系统评估。在本研究中,我们使用基于一系列真实世界全外显子组测序数据集的标准基准数据集,对5种常用的体细胞突变检测流程(VarScan、VarDictJava、Mutect2、Strelka2和FANSe)的精度、召回率和速度进行了基准测试。所有这5种流程在所有情况下都显示出非常高的精度,并且在突变率高于10%时具有高召回率。然而,对于低频突变,这些流程表现出很大差异。FANSe在所有情况下都显示出最高的准确性(尤其是灵敏度),并且在所有测序深度下,VarScan和VarDictJava在低频突变方面优于Mutect2和Strelka2。除FANSe外,其他四种流程灵敏度低的主要原因是过滤存在缺陷。在速度方面,FANSe流程比其他流程快8.8至19倍。我们的基准测试结果展示了体细胞检测流程的性能,并为在癌症应用中正确选择此类流程提供了参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/e9ccc5143668/fgene-13-979928-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/6f40cf491159/fgene-13-979928-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/2721697287e8/fgene-13-979928-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/b650b8702890/fgene-13-979928-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/1381f56c427b/fgene-13-979928-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/e9ccc5143668/fgene-13-979928-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/6f40cf491159/fgene-13-979928-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/2721697287e8/fgene-13-979928-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/b650b8702890/fgene-13-979928-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/1381f56c427b/fgene-13-979928-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/469a/9705725/e9ccc5143668/fgene-13-979928-g005.jpg

相似文献

1
Towards an accurate and robust analysis pipeline for somatic mutation calling.迈向用于体细胞突变检测的准确且稳健的分析流程。
Front Genet. 2022 Nov 15;13:979928. doi: 10.3389/fgene.2022.979928. eCollection 2022.
2
Systematic comparison of variant calling pipelines of target genome sequencing cross multiple next-generation sequencers.跨多个下一代测序仪对目标基因组测序变异检测流程的系统比较。
Front Genet. 2024 Jan 4;14:1293974. doi: 10.3389/fgene.2023.1293974. eCollection 2023.
3
Systematic comparison of somatic variant calling performance among different sequencing depth and mutation frequency.不同测序深度和突变频率下体细胞变异调用性能的系统比较
Sci Rep. 2020 Feb 26;10(1):3501. doi: 10.1038/s41598-020-60559-5.
4
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
5
Benchmarking variant callers in next-generation and third-generation sequencing analysis.在新一代和第三代测序分析中对变异调用程序进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa148.
6
Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.跨多种下一代测序仪的种系变异调用管道的系统比较。
Sci Rep. 2019 Jun 27;9(1):9345. doi: 10.1038/s41598-019-45835-3.
7
A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.用于下一代测序数据的体细胞单核苷酸变异检测算法综述。
Comput Struct Biotechnol J. 2018 Feb 6;16:15-24. doi: 10.1016/j.csbj.2018.01.003. eCollection 2018.
8
Set-theory based benchmarking of three different variant callers for targeted sequencing.基于集合论的三种不同靶向测序变异 caller 的基准测试。
BMC Bioinformatics. 2021 Jan 7;22(1):20. doi: 10.1186/s12859-020-03926-3.
9
Dual Deep Sequencing Improves the Accuracy of Low-Frequency Somatic Mutation Detection in Cancer Gene Panel Testing.双重深度测序提高癌症基因panel 检测中低频种系突变检测的准确性。
Int J Mol Sci. 2020 May 16;21(10):3530. doi: 10.3390/ijms21103530.
10
Halvade somatic: Somatic variant calling with Apache Spark.半体变异体调用:基于 Apache Spark 的半体变异体调用。
Gigascience. 2022 Jan 12;11(1). doi: 10.1093/gigascience/giab094.

引用本文的文献

1
DeepSomatic: Accurate somatic small variant discovery for multiple sequencing technologies.DeepSomatic:适用于多种测序技术的精确体细胞小变异发现方法。
bioRxiv. 2024 Aug 19:2024.08.16.608331. doi: 10.1101/2024.08.16.608331.
2
Genomic Insights into Idiopathic Granulomatous Mastitis through Whole-Exome Sequencing: A Case Report of Eight Patients.通过全外显子组测序对特发性肉芽肿性乳腺炎的基因组学研究:八例患者的病例报告。
Int J Mol Sci. 2024 Aug 21;25(16):9058. doi: 10.3390/ijms25169058.
3
Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking.

本文引用的文献

1
The Ultrafast and Accurate Mapping Algorithm FANSe3: Mapping a Human Whole-Genome Sequencing Dataset Within 30 Minutes.超快速且精确的映射算法FANSe3:在30分钟内完成人类全基因组测序数据集的映射
Phenomics. 2021 Feb 22;1(1):22-30. doi: 10.1007/s43657-020-00008-5. eCollection 2021 Feb.
2
Branching clonal evolution patterns predominate mutational landscape in multiple myeloma.分支克隆进化模式在多发性骨髓瘤的突变图谱中占主导地位。
Am J Cancer Res. 2021 Nov 15;11(11):5659-5679. eCollection 2021.
3
Liquid Biopsy for Advanced NSCLC: A Consensus Statement From the International Association for the Study of Lung Cancer.
综合方法生成低肿瘤分数的人工样本用于体细胞变异calling 基准测试。
BMC Bioinformatics. 2024 May 8;25(1):180. doi: 10.1186/s12859-024-05793-8.
4
Neoantigen cancer vaccines: a new star on the horizon.新抗原癌症疫苗:崭露头角的新星。
Cancer Biol Med. 2023 Dec 29;21(4):274-311. doi: 10.20892/j.issn.2095-3941.2023.0395.
5
Personalized Cancer Vaccines Go Viral: Viral Vectors in the Era of Personalized Immunotherapy of Cancer.个性化癌症疫苗走红:个性化癌症免疫疗法时代的病毒载体。
Int J Mol Sci. 2023 Nov 22;24(23):16591. doi: 10.3390/ijms242316591.
6
An effective prognostic model for assessing prognosis of non-small cell lung cancer with brain metastases.一种用于评估非小细胞肺癌脑转移预后的有效预后模型。
Front Genet. 2023 Apr 13;14:1156322. doi: 10.3389/fgene.2023.1156322. eCollection 2023.
液体活检在晚期 NSCLC 中的应用:国际肺癌研究协会的共识声明。
J Thorac Oncol. 2021 Oct;16(10):1647-1662. doi: 10.1016/j.jtho.2021.06.017. Epub 2021 Jul 8.
4
Cancer neoantigens as potential targets for immunotherapy.癌症新生抗原作为免疫治疗的潜在靶点。
Clin Exp Metastasis. 2022 Feb;39(1):51-60. doi: 10.1007/s10585-021-10091-1. Epub 2021 May 5.
5
Germline variation networks in the PI3K/AKT pathway corresponding to familial high-incidence lung cancer pedigrees.与家族性肺癌高发家系相对应的 PI3K/AKT 通路中的种系变异网络。
BMC Cancer. 2020 Dec 9;20(1):1209. doi: 10.1186/s12885-020-07528-3.
6
Targeted sequencing reveals the somatic mutation landscape in a Swedish breast cancer cohort.靶向测序揭示了瑞典乳腺癌队列的体细胞突变景观。
Sci Rep. 2020 Nov 9;10(1):19304. doi: 10.1038/s41598-020-74580-1.
7
Systematic comparison of somatic variant calling performance among different sequencing depth and mutation frequency.不同测序深度和突变频率下体细胞变异调用性能的系统比较
Sci Rep. 2020 Feb 26;10(1):3501. doi: 10.1038/s41598-020-60559-5.
8
Strelka2: fast and accurate calling of germline and somatic variants.Strelka2:快速准确地调用种系和体细胞变异。
Nat Methods. 2018 Aug;15(8):591-594. doi: 10.1038/s41592-018-0051-x. Epub 2018 Jul 16.
9
Somatic mutations precede acute myeloid leukemia years before diagnosis.体细胞突变在急性髓系白血病诊断前数年就已存在。
Nat Med. 2018 Jul;24(7):1015-1023. doi: 10.1038/s41591-018-0081-z. Epub 2018 Jul 9.
10
Patient-Paired Sample Congruence Between 2 Commercial Liquid Biopsy Tests.患者配对样本在 2 种商业液体活检检测中的一致性。
JAMA Oncol. 2018 Jun 1;4(6):868-870. doi: 10.1001/jamaoncol.2017.4027.