• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepSSV:使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。

DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.

机构信息

Suzhou Institute of Systems Medicine, Center for Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Suzhou, Jiangsu, China.

La Trobe University, Melbourne, Victoria, Australia.

出版信息

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.

DOI:10.1093/bib/bbaa272
PMID:33164053
Abstract

It is of considerable interest to detect somatic mutations in paired tumor and normal sequencing data. A number of callers that are based on statistical or machine learning approaches have been developed to detect somatic small variants. However, they take into consideration only limited information about the reference and potential variant allele in both tumor and normal samples at a candidate somatic site. Also, they differ in how biological and technological noises are addressed. Hence, they are expected to produce divergent outputs. To overcome the drawbacks of existing somatic callers, we develop a deep learning-based tool called DeepSSV, which employs a convolutional neural network (CNN) model to learn increasingly abstract feature representations from the raw data in higher feature layers. DeepSSV creates a spatially oriented representation of read alignments around the candidate somatic sites adapted for the convolutional architecture, which enables it to expand to effectively gather scattered evidence. Moreover, DeepSSV incorporates the mapping information of both reference allele-supporting and variant allele-supporting reads in the tumor and normal samples at a genomic site that are readily available in the pileup format file. Together, the CNN model can process the whole alignment information. Such representational richness allows the model to capture the dependencies in the sequence and identify context-based sequencing artifacts. We fitted the model on ground truth somatic mutations and did benchmarking experiments on simulated and real tumors. The benchmarking results demonstrate that DeepSSV outperforms its state-of-the-art competitors in overall F1 score.

摘要

检测配对肿瘤和正常测序数据中的体细胞突变具有重要意义。已经开发了许多基于统计或机器学习方法的调用者来检测体细胞小变体。然而,它们仅考虑了候选体细胞位点处肿瘤和正常样本中参考和潜在变异等位基因的有限信息。此外,它们在如何处理生物和技术噪声方面也存在差异。因此,预计它们会产生不同的输出。为了克服现有体细胞调用者的缺点,我们开发了一种基于深度学习的工具,称为 DeepSSV,它使用卷积神经网络 (CNN) 模型从原始数据中学习越来越抽象的特征表示,这些特征表示在更高的特征层中。DeepSSV 为候选体细胞位点周围的读取对齐创建了一个面向空间的表示形式,该表示形式适应卷积架构,使其能够有效地扩展以收集分散的证据。此外,DeepSSV 结合了肿瘤和正常样本中在基因组位点上易于获得的参考等位基因支持和变异等位基因支持读取的映射信息,这些信息以堆积格式文件的形式提供。CNN 模型可以处理整个对齐信息。这种表示丰富性允许模型捕获序列中的依赖关系并识别基于上下文的测序伪影。我们在真实的体细胞突变上拟合模型,并在模拟和真实肿瘤上进行基准实验。基准实验结果表明,DeepSSV 在总体 F1 评分方面优于其最先进的竞争对手。

相似文献

1
DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.DeepSSV:使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.
2
DNN-Boost: Somatic mutation identification of tumor-only whole-exome sequencing data using deep neural network and XGBoost.DNN-Boost:使用深度神经网络和 XGBoost 对仅肿瘤全外显子测序数据进行体细胞突变识别。
J Bioinform Comput Biol. 2021 Dec;19(6):2140017. doi: 10.1142/S0219720021400175. Epub 2021 Dec 13.
3
Deep convolutional neural networks for accurate somatic mutation detection.深度卷积神经网络用于准确的体细胞突变检测。
Nat Commun. 2019 Mar 4;10(1):1041. doi: 10.1038/s41467-019-09027-x.
4
Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking.综合方法生成低肿瘤分数的人工样本用于体细胞变异calling 基准测试。
BMC Bioinformatics. 2024 May 8;25(1):180. doi: 10.1186/s12859-024-05793-8.
5
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
6
SNooPer: a machine learning-based method for somatic variant identification from low-pass next-generation sequencing.SNooPer:一种基于机器学习从低深度下一代测序中识别体细胞变异的方法。
BMC Genomics. 2016 Nov 14;17(1):912. doi: 10.1186/s12864-016-3281-2.
7
DeepSom: a CNN-based approach to somatic variant calling in WGS samples without a matched normal.DeepSom:一种基于 CNN 的无配对正常样本 WGS 样本体细胞变异calling 方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac828.
8
cnnLSV: detecting structural variants by encoding long-read alignment information and convolutional neural network.cnnLSV:通过编码长读对准信息和卷积神经网络检测结构变体。
BMC Bioinformatics. 2023 Mar 28;24(1):119. doi: 10.1186/s12859-023-05243-x.
9
A database of simulated tumor genomes towards accurate detection of somatic small variants in cancer.模拟肿瘤基因组数据库,用于准确检测癌症中的体细胞小变异。
PLoS One. 2018 Aug 30;13(8):e0202982. doi: 10.1371/journal.pone.0202982. eCollection 2018.
10
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.评估九种体细胞变异检测工具在全外显子组测序和靶向深度测序数据中检测体细胞突变的性能
PLoS One. 2016 Mar 22;11(3):e0151664. doi: 10.1371/journal.pone.0151664. eCollection 2016.

引用本文的文献

1
PREDAC-CNN: predicting antigenic clusters of seasonal influenza A viruses with convolutional neural network.PREDAC-CNN:基于卷积神经网络预测季节性甲型流感病毒的抗原簇。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae033.
2
DeepSom: a CNN-based approach to somatic variant calling in WGS samples without a matched normal.DeepSom:一种基于 CNN 的无配对正常样本 WGS 样本体细胞变异calling 方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac828.
3
mitoSomatic: a tool for accurate identification of mitochondrial DNA somatic mutations without paired controls.
线粒体体细胞突变检测工具(mitoSomatic):无需配对对照即可准确识别线粒体 DNA 体细胞突变。
Mol Oncol. 2023 May;17(5):857-871. doi: 10.1002/1878-0261.13335. Epub 2022 Dec 15.
4
Nanopore Technology and Its Applications in Gene Sequencing.纳米孔技术及其在基因测序中的应用。
Biosensors (Basel). 2021 Jun 30;11(7):214. doi: 10.3390/bios11070214.