• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对不同微环境和异质性水平下癌症测序流程的详细评估。

Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels.

作者信息

Kısakol Batuhan, Sarıhan Şahin, Ergün Mehmet Arif, Baysan Mehmet

机构信息

Department of Physiology and Medical Physics, Centre for Systems Medicine, Royal College of Surgeons in Ireland, Dublin Ireland.

Computer Engineering Department, Faculty of Engineering, Marmara University, İstanbul, Turkey Turkey.

出版信息

Turk J Biol. 2021 Apr 20;45(2):114-126. doi: 10.3906/biy-2008-8. eCollection 2021.

DOI:10.3906/biy-2008-8
PMID:33907494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8068765/
Abstract

The importance of next generation sequencing (NGS) rises in cancer research as accessing this key technology becomes easier for researchers. The sequence data created by NGS technologies must be processed by various bioinformatics algorithms within a pipeline in order to convert raw data to meaningful information. Mapping and variant calling are the two main steps of these analysis pipelines, and many algorithms are available for these steps. Therefore, detailed benchmarking of these algorithms in different scenarios is crucial for the efficient utilization of sequencing technologies. In this study, we compared the performance of twelve pipelines (three mapping and four variant discovery algorithms) with recommended settings to capture single nucleotide variants. We observed significant discrepancy in variant calls among tested pipelines for different heterogeneity levels in real and simulated samples with overall high specificity and low sensitivity. Additional to the individual evaluation of pipelines, we also constructed and tested the performance of pipeline combinations. In these analyses, we observed that certain pipelines complement each other much better than others and display superior performance than individual pipelines. This suggests that adhering to a single pipeline is not optimal for cancer sequencing analysis and sample heterogeneity should be considered in algorithm optimization.

摘要

随着研究人员更容易获得下一代测序(NGS)这项关键技术,它在癌症研究中的重要性日益凸显。由NGS技术生成的序列数据必须在一个流程中通过各种生物信息学算法进行处理,以便将原始数据转化为有意义的信息。比对和变异检测是这些分析流程的两个主要步骤,并且有许多算法可用于这些步骤。因此,在不同场景下对这些算法进行详细的基准测试对于测序技术的高效利用至关重要。在本研究中,我们将十二个流程(三种比对算法和四种变异发现算法)在推荐设置下的性能进行了比较,以捕获单核苷酸变异。我们观察到,在真实和模拟样本中,对于不同的异质性水平,测试流程之间的变异检测结果存在显著差异,总体上特异性高而敏感性低。除了对各个流程进行单独评估外,我们还构建并测试了流程组合的性能。在这些分析中,我们观察到某些流程之间的互补性比其他流程更好,并且表现出比单个流程更优的性能。这表明,在癌症测序分析中坚持使用单一流程并非最佳选择,并且在算法优化中应考虑样本异质性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/bfea6f6fdc9d/turkjbio-45-114-fig006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/2b7cadeedc9d/turkjbio-45-114-fig001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/b4f4c3a56342/turkjbio-45-114-fig002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/6431cc765088/turkjbio-45-114-fig003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/e9bf5fbb2891/turkjbio-45-114-fig004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/cf4954e7bec0/turkjbio-45-114-fig005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/bfea6f6fdc9d/turkjbio-45-114-fig006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/2b7cadeedc9d/turkjbio-45-114-fig001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/b4f4c3a56342/turkjbio-45-114-fig002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/6431cc765088/turkjbio-45-114-fig003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/e9bf5fbb2891/turkjbio-45-114-fig004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/cf4954e7bec0/turkjbio-45-114-fig005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b8d/8068765/bfea6f6fdc9d/turkjbio-45-114-fig006.jpg

相似文献

1
Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels.对不同微环境和异质性水平下癌症测序流程的详细评估。
Turk J Biol. 2021 Apr 20;45(2):114-126. doi: 10.3906/biy-2008-8. eCollection 2021.
2
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.用于分析NGS种系基因检测板的映射、变异位点检测和区间填充流程的性能评估。
BMC Bioinformatics. 2021 Apr 28;22(1):218. doi: 10.1186/s12859-021-04144-1.
3
Systematic comparison of variant calling pipelines of target genome sequencing cross multiple next-generation sequencers.跨多个下一代测序仪对目标基因组测序变异检测流程的系统比较。
Front Genet. 2024 Jan 4;14:1293974. doi: 10.3389/fgene.2023.1293974. eCollection 2023.
4
Challenges in exome analysis by LifeScope and its alternative computational pipelines.LifeScope及其替代计算流程在全外显子组分析中的挑战。
BMC Res Notes. 2015 Sep 7;8:421. doi: 10.1186/s13104-015-1385-4.
5
Set-theory based benchmarking of three different variant callers for targeted sequencing.基于集合论的三种不同靶向测序变异 caller 的基准测试。
BMC Bioinformatics. 2021 Jan 7;22(1):20. doi: 10.1186/s12859-020-03926-3.
6
Detailed simulation of cancer exome sequencing data reveals differences and common limitations of variant callers.癌症外显子组测序数据的详细模拟揭示了变异检测工具的差异和常见局限性。
BMC Bioinformatics. 2017 Jan 3;18(1):8. doi: 10.1186/s12859-016-1417-7.
7
Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses.Unipro UGENE NGS 流水线和组件,用于变体调用、RNA-seq 和 ChIP-seq 数据分析。
PeerJ. 2014 Nov 4;2:e644. doi: 10.7717/peerj.644. eCollection 2014.
8
VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering.变异元调用器:用于基于定量、精确性筛选的变异调用流程的自动融合。
BMC Genomics. 2015 Oct 28;16:875. doi: 10.1186/s12864-015-2050-y.
9
ICR142 Benchmarker: evaluating, optimising and benchmarking variant calling performance using the ICR142 NGS validation series.ICR142基准测试工具:使用ICR142二代测序验证系列评估、优化和基准测试变异检测性能
Wellcome Open Res. 2018 Oct 31;3:108. doi: 10.12688/wellcomeopenres.14754.2. eCollection 2018.
10
Comparison of Read Mapping and Variant Calling Tools for the Analysis of Plant NGS Data.用于植物NGS数据分析的读段比对和变异检测工具比较
Plants (Basel). 2020 Apr 2;9(4):439. doi: 10.3390/plants9040439.

引用本文的文献

1
COSAP: Comparative Sequencing Analysis Platform.COSAP:比较测序分析平台。
BMC Bioinformatics. 2024 Mar 26;25(1):130. doi: 10.1186/s12859-024-05756-z.

本文引用的文献

1
Best practices for variant calling in clinical sequencing.临床测序中变异调用的最佳实践。
Genome Med. 2020 Oct 26;12(1):91. doi: 10.1186/s13073-020-00791-w.
2
Systematic comparison of somatic variant calling performance among different sequencing depth and mutation frequency.不同测序深度和突变频率下体细胞变异调用性能的系统比较
Sci Rep. 2020 Feb 26;10(1):3501. doi: 10.1038/s41598-020-60559-5.
3
Comparison of somatic variant detection algorithms using Ion Torrent targeted deep sequencing data.利用 Ion Torrent 靶向深度测序数据比较体细胞变异检测算法。
BMC Med Genomics. 2019 Dec 24;12(Suppl 9):181. doi: 10.1186/s12920-019-0636-y.
4
Comparative analysis of whole-genome sequencing pipelines to minimize false negative findings.全基因组测序流程的比较分析,以尽量减少假阴性发现。
Sci Rep. 2019 Mar 1;9(1):3219. doi: 10.1038/s41598-019-39108-2.
5
Cancer statistics, 2019.癌症统计数据,2019 年。
CA Cancer J Clin. 2019 Jan;69(1):7-34. doi: 10.3322/caac.21551. Epub 2019 Jan 8.
6
fastp: an ultra-fast all-in-one FASTQ preprocessor.fastp:一个超快速的一体化 FASTQ 预处理程序。
Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.
7
Strelka2: fast and accurate calling of germline and somatic variants.Strelka2:快速准确地调用种系和体细胞变异。
Nat Methods. 2018 Aug;15(8):591-594. doi: 10.1038/s41592-018-0051-x. Epub 2018 Jul 16.
8
Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines.采用多种基因组分析流水线的肿瘤外显子组突变调用的可扩展开放科学方法。
Cell Syst. 2018 Mar 28;6(3):271-281.e7. doi: 10.1016/j.cels.2018.03.002.
9
A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.用于下一代测序数据的体细胞单核苷酸变异检测算法综述。
Comput Struct Biotechnol J. 2018 Feb 6;16:15-24. doi: 10.1016/j.csbj.2018.01.003. eCollection 2018.
10
Comprehensive benchmarking of SNV callers for highly admixed tumor data.针对高度混合肿瘤数据的单核苷酸变异(SNV)检测工具的综合基准测试。
PLoS One. 2017 Oct 11;12(10):e0186175. doi: 10.1371/journal.pone.0186175. eCollection 2017.