• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RFcaller:一种结合读取水平特征以检测体细胞突变的机器学习方法。

RFcaller: a machine learning approach combined with read-level features to detect somatic mutations.

作者信息

Díaz-Navarro Ander, Bousquets-Muñoz Pablo, Nadeu Ferran, López-Tamargo Sara, Beà Silvia, Campo Elias, Puente Xose S

机构信息

Departamento de Bioquímica y Biología Molecular, Instituto Universitario de Oncología (IUOPA), Universidad de Oviedo, 33006 Oviedo, Spain.

Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036 Barcelona, Spain.

出版信息

NAR Genom Bioinform. 2023 May 30;5(2):lqad056. doi: 10.1093/nargab/lqad056. eCollection 2023 Jun.

DOI:10.1093/nargab/lqad056
PMID:37260508
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10227442/
Abstract

The cost reduction in sequencing and the extensive genomic characterization of a wide variety of cancers are expanding tumor sequencing to a wide number of research groups and the clinical practice. Although specific pipelines have been generated for the identification of somatic mutations, their results usually differ considerably, and a common approach is to use several callers to achieve a more reliable set of mutations. This procedure is computationally expensive and time-consuming, and it suffers from the same limitations in sensitivity and specificity as other approaches. Expert revision of mutant calls is therefore required to verify calls that might be used for clinical diagnosis. This step could take advantage of machine learning techniques, as they provide a useful approach to incorporate expert-reviewed information for the identification of somatic mutations. Here we present RFcaller, a pipeline based on machine learning algorithms, for the detection of somatic mutations in tumor-normal paired samples that does not require large computing resources. RFcaller shows high accuracy for the detection of substitutions and insertions/deletions from whole genome or exome data. It allows the detection of mutations in driver genes missed by other approaches, and has been validated by comparison to deep and Sanger sequencing.

摘要

测序成本的降低以及对多种癌症进行广泛的基因组特征分析,正将肿瘤测序扩展到众多研究团队和临床实践中。尽管已经生成了用于识别体细胞突变的特定流程,但其结果通常差异很大,一种常见的方法是使用多种调用程序来获得一组更可靠的突变。这个过程在计算上既昂贵又耗时,并且与其他方法一样存在敏感性和特异性方面的局限性。因此,需要专家对突变调用进行修订,以验证可能用于临床诊断的调用。这一步骤可以利用机器学习技术,因为它们提供了一种有用的方法来整合经过专家审核的信息,用于识别体细胞突变。在这里,我们展示了RFcaller,这是一种基于机器学习算法的流程,用于检测肿瘤-正常配对样本中的体细胞突变,不需要大量计算资源。RFcaller在从全基因组或外显子组数据中检测替换和插入/缺失方面显示出高精度。它能够检测出其他方法遗漏的驱动基因突变,并且已经通过与深度测序和桑格测序的比较得到了验证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/12e391174e55/lqad056fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/dd941b01eab3/lqad056fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/8fd6fbf0eb1d/lqad056fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/2aa0e1601d7c/lqad056fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/12e391174e55/lqad056fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/dd941b01eab3/lqad056fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/8fd6fbf0eb1d/lqad056fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/2aa0e1601d7c/lqad056fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c026/10227442/12e391174e55/lqad056fig4.jpg

相似文献

1
RFcaller: a machine learning approach combined with read-level features to detect somatic mutations.RFcaller:一种结合读取水平特征以检测体细胞突变的机器学习方法。
NAR Genom Bioinform. 2023 May 30;5(2):lqad056. doi: 10.1093/nargab/lqad056. eCollection 2023 Jun.
2
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.评估九种体细胞变异检测工具在全外显子组测序和靶向深度测序数据中检测体细胞突变的性能
PLoS One. 2016 Mar 22;11(3):e0151664. doi: 10.1371/journal.pone.0151664. eCollection 2016.
3
DNN-Boost: Somatic mutation identification of tumor-only whole-exome sequencing data using deep neural network and XGBoost.DNN-Boost:使用深度神经网络和 XGBoost 对仅肿瘤全外显子测序数据进行体细胞突变识别。
J Bioinform Comput Biol. 2021 Dec;19(6):2140017. doi: 10.1142/S0219720021400175. Epub 2021 Dec 13.
4
ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.ExScalibur:一款用于全外显子组种系和体细胞突变识别的高性能云支持套件。
PLoS One. 2015 Aug 13;10(8):e0135800. doi: 10.1371/journal.pone.0135800. eCollection 2015.
5
Construction of a combinatorial pipeline using two somatic variant  calling  methods  for whole exome sequence data of gastric cancer.利用两种体细胞变异检测方法构建针对胃癌全外显子序列数据的组合流程。
J Med Invest. 2017;64(3.4):233-240. doi: 10.2152/jmi.64.233.
6
Exome-wide Sequencing Shows Low Mutation Rates and Identifies Novel Mutated Genes in Seminomas.外显子组测序显示精原细胞瘤的突变率较低,并鉴定出新的突变基因。
Eur Urol. 2015 Jul;68(1):77-83. doi: 10.1016/j.eururo.2014.12.040. Epub 2015 Jan 14.
7
SNooPer: a machine learning-based method for somatic variant identification from low-pass next-generation sequencing.SNooPer:一种基于机器学习从低深度下一代测序中识别体细胞变异的方法。
BMC Genomics. 2016 Nov 14;17(1):912. doi: 10.1186/s12864-016-3281-2.
8
Comparison of somatic variant detection algorithms using Ion Torrent targeted deep sequencing data.利用 Ion Torrent 靶向深度测序数据比较体细胞变异检测算法。
BMC Med Genomics. 2019 Dec 24;12(Suppl 9):181. doi: 10.1186/s12920-019-0636-y.
9
DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.DeepSSV:使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.
10
A machine learning approach for somatic mutation discovery.机器学习在体细胞突变发现中的应用。
Sci Transl Med. 2018 Sep 5;10(457). doi: 10.1126/scitranslmed.aar7939.

引用本文的文献

1
Learning a refinement model for variant analysis in non-human primate genomes.学习用于非人灵长类动物基因组变异分析的优化模型。
BMC Genomics. 2025 Aug 25;26(1):775. doi: 10.1186/s12864-025-11921-2.
2
Predicting high confidence ctDNA somatic variants with ensemble machine learning models.使用集成机器学习模型预测高置信度的ctDNA体细胞变异
Sci Rep. 2025 May 26;15(1):18384. doi: 10.1038/s41598-025-01326-2.
3
Large B-cell lymphomas with CCND1 rearrangement have different immunoglobulin gene breakpoints and genomic profile than mantle cell lymphoma.

本文引用的文献

1
Molecular map of chronic lymphocytic leukemia and its impact on outcome.慢性淋巴细胞白血病的分子图谱及其对预后的影响。
Nat Genet. 2022 Nov;54(11):1664-1674. doi: 10.1038/s41588-022-01140-w. Epub 2022 Aug 4.
2
Detection of oncogenic and clinically actionable mutations in cancer genomes critically depends on variant calling tools.在癌症基因组中检测致癌和具有临床可操作性的突变,关键取决于变异调用工具。
Bioinformatics. 2022 Jun 13;38(12):3181-3191. doi: 10.1093/bioinformatics/btac306.
3
Use to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers.
具有 CCND1 重排的大 B 细胞淋巴瘤与套细胞淋巴瘤的免疫球蛋白基因断裂点和基因组特征不同。
Blood Cancer J. 2024 Sep 23;14(1):166. doi: 10.1038/s41408-024-01146-z.
使用以有效利用绘图空间来处理大型数据集和异常值。
Front Genet. 2021 Nov 2;12:774846. doi: 10.3389/fgene.2021.774846. eCollection 2021.
4
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
5
Machine Learning Methods in Drug Discovery.药物发现中的机器学习方法。
Molecules. 2020 Nov 12;25(22):5277. doi: 10.3390/molecules25225277.
6
Genomic and epigenomic insights into the origin, pathogenesis, and clinical behavior of mantle cell lymphoma subtypes.基因组和表观基因组对套细胞淋巴瘤亚型的起源、发病机制和临床行为的深入了解。
Blood. 2020 Sep 17;136(12):1419-1432. doi: 10.1182/blood.2020005289.
7
The repertoire of mutational signatures in human cancer.人类癌症中的突变特征谱。
Nature. 2020 Feb;578(7793):94-101. doi: 10.1038/s41586-020-1943-3. Epub 2020 Feb 5.
8
Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.分析 2658 个癌症全基因组中的非编码体细胞驱动因子。
Nature. 2020 Feb;578(7793):102-111. doi: 10.1038/s41586-020-1965-x. Epub 2020 Feb 5.
9
Patterns of somatic structural variation in human cancer genomes.人类癌症基因组中体结构变异的模式。
Nature. 2020 Feb;578(7793):112-121. doi: 10.1038/s41586-019-1913-9. Epub 2020 Feb 5.
10
Pan-cancer analysis of whole genomes.泛癌症全基因组分析。
Nature. 2020 Feb;578(7793):82-93. doi: 10.1038/s41586-020-1969-6. Epub 2020 Feb 5.