• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DNN-Boost:使用深度神经网络和 XGBoost 对仅肿瘤全外显子测序数据进行体细胞突变识别。

DNN-Boost: Somatic mutation identification of tumor-only whole-exome sequencing data using deep neural network and XGBoost.

机构信息

School of Computer Science and Engineering, Pusan National University, 63 Busandaehak-Ro, Busan 46241, Republic of Korea.

出版信息

J Bioinform Comput Biol. 2021 Dec;19(6):2140017. doi: 10.1142/S0219720021400175. Epub 2021 Dec 13.

DOI:10.1142/S0219720021400175
PMID:34895111
Abstract

Detection of somatic mutation in whole-exome sequencing data can help elucidate the mechanism of tumor progression. Most computational approaches require exome sequencing for both tumor and normal samples. However, it is more common to sequence exomes for tumor samples only without the paired normal samples. To include these types of data for extensive studies on the process of tumorigenesis, it is necessary to develop an approach for identifying somatic mutations using tumor exome sequencing data only. In this study, we designed a machine learning approach using Deep Neural Network (DNN) and XGBoost to identify somatic mutations in tumor-only exome sequencing data and we integrated this into a pipeline called DNN-Boost. The XGBoost algorithm is used to extract the features from the results of variant callers and these features are then fed into the DNN model as input. The XGBoost algorithm resolves issues of missing values and overfitting. We evaluated our proposed model and compared its performance with other existing benchmark methods. We noted that the DNN-Boost classification model outperformed the benchmark method in classifying somatic mutations from paired tumor-normal exome data and tumor-only exome data.

摘要

在全外显子组测序数据中检测体细胞突变有助于阐明肿瘤进展的机制。大多数计算方法都需要肿瘤和正常样本的外显子组测序。然而,更常见的情况是仅对肿瘤样本进行外显子组测序,而没有配对的正常样本。为了将这些类型的数据纳入对肿瘤发生过程的广泛研究中,有必要开发一种仅使用肿瘤外显子组测序数据识别体细胞突变的方法。在这项研究中,我们设计了一种使用深度神经网络(DNN)和 XGBoost 的机器学习方法来识别仅肿瘤外显子组测序数据中的体细胞突变,并将其集成到称为 DNN-Boost 的管道中。XGBoost 算法用于从变异调用器的结果中提取特征,然后将这些特征作为输入提供给 DNN 模型。XGBoost 算法解决了缺失值和过拟合的问题。我们评估了我们提出的模型,并将其性能与其他现有的基准方法进行了比较。我们注意到,在对来自配对的肿瘤-正常外显子组数据和仅肿瘤外显子组数据的体细胞突变进行分类时,DNN-Boost 分类模型的性能优于基准方法。

相似文献

1
DNN-Boost: Somatic mutation identification of tumor-only whole-exome sequencing data using deep neural network and XGBoost.DNN-Boost:使用深度神经网络和 XGBoost 对仅肿瘤全外显子测序数据进行体细胞突变识别。
J Bioinform Comput Biol. 2021 Dec;19(6):2140017. doi: 10.1142/S0219720021400175. Epub 2021 Dec 13.
2
DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.DeepSSV:使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.
3
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.评估九种体细胞变异检测工具在全外显子组测序和靶向深度测序数据中检测体细胞突变的性能
PLoS One. 2016 Mar 22;11(3):e0151664. doi: 10.1371/journal.pone.0151664. eCollection 2016.
4
Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data.用于改进全外显子组测序数据中体细胞单核苷酸多态性检测的MuTect和GATK工具的优化流程。
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):341. doi: 10.1186/s12859-016-1190-7.
5
Whole-exome sequencing reveals recurrent somatic mutation networks in cancer.全外显子组测序揭示癌症中反复出现的体细胞突变网络。
Cancer Lett. 2013 Nov 1;340(2):270-6. doi: 10.1016/j.canlet.2012.11.002. Epub 2012 Nov 12.
6
Comparing somatic mutation-callers: beyond Venn diagrams.比较体细胞突变 caller:超越 Venn 图。
BMC Bioinformatics. 2013 Jun 10;14:189. doi: 10.1186/1471-2105-14-189.
7
Deep convolutional neural networks for accurate somatic mutation detection.深度卷积神经网络用于准确的体细胞突变检测。
Nat Commun. 2019 Mar 4;10(1):1041. doi: 10.1038/s41467-019-09027-x.
8
A machine learning approach for somatic mutation discovery.机器学习在体细胞突变发现中的应用。
Sci Transl Med. 2018 Sep 5;10(457). doi: 10.1126/scitranslmed.aar7939.
9
ISOWN: accurate somatic mutation identification in the absence of normal tissue controls.ISOWN:在无正常组织对照的情况下准确识别体细胞突变
Genome Med. 2017 Jun 29;9(1):59. doi: 10.1186/s13073-017-0446-9.
10
SomaticCombiner: improving the performance of somatic variant calling based on evaluation tests and a consensus approach.SomaticCombiner:基于评估测试和共识方法提高体细胞变异calling 的性能。
Sci Rep. 2020 Jul 30;10(1):12898. doi: 10.1038/s41598-020-69772-8.

引用本文的文献

1
Validation of genetic variants from NGS data using deep convolutional neural networks.使用深度卷积神经网络验证 NGS 数据中的遗传变异。
BMC Bioinformatics. 2023 Apr 20;24(1):158. doi: 10.1186/s12859-023-05255-7.