• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepSom:一种基于 CNN 的无配对正常样本 WGS 样本体细胞变异calling 方法。

DeepSom: a CNN-based approach to somatic variant calling in WGS samples without a matched normal.

机构信息

Institute of Computational Biology, Computational Health Center, Helmholtz Zentrum München Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), 85764 Neuherberg, Germany.

Department of Computer Science, TUM School of Computation, Information and Technology, Technical University Munich, 85748 Garching, Germany.

出版信息

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac828.

DOI:10.1093/bioinformatics/btac828
PMID:36637201
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9843587/
Abstract

MOTIVATION

Somatic mutations are usually called by analyzing the DNA sequence of a tumor sample in conjunction with a matched normal. However, a matched normal is not always available, for instance, in retrospective analysis or diagnostic settings. For such cases, tumor-only somatic variant calling tools need to be designed. Previously proposed approaches demonstrate inferior performance on whole-genome sequencing (WGS) samples.

RESULTS

We present the convolutional neural network-based approach called DeepSom for detecting somatic single nucleotide polymorphism and short insertion and deletion variants in tumor WGS samples without a matched normal. We validate DeepSom by reporting its performance on five different cancer datasets. We also demonstrate that on WGS samples DeepSom outperforms previously proposed methods for tumor-only somatic variant calling.

AVAILABILITY AND IMPLEMENTATION

DeepSom is available as a GitHub repository at https://github.com/heiniglab/DeepSom.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

体细胞突变通常通过分析肿瘤样本的 DNA 序列并结合匹配的正常样本进行调用。然而,并非总是可以获得匹配的正常样本,例如在回顾性分析或诊断环境中。对于这种情况,需要设计仅针对肿瘤的体细胞变异调用工具。以前提出的方法在全基因组测序 (WGS) 样本上表现不佳。

结果

我们提出了一种基于卷积神经网络的方法,称为 DeepSom,用于在没有匹配正常样本的情况下检测肿瘤 WGS 样本中的体细胞单核苷酸多态性和短插入缺失变异。我们通过在五个不同的癌症数据集上报告其性能来验证 DeepSom。我们还证明,在 WGS 样本上,DeepSom 优于以前提出的仅针对肿瘤的体细胞变异调用方法。

可用性和实现

DeepSom 可在 GitHub 存储库 https://github.com/heiniglab/DeepSom 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/94f3a6dcbc41/btac828f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/7dfa12bf824f/btac828f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/3e10aa7ef595/btac828f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/cd08b333e730/btac828f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/94f3a6dcbc41/btac828f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/7dfa12bf824f/btac828f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/3e10aa7ef595/btac828f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/cd08b333e730/btac828f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f68/9843587/94f3a6dcbc41/btac828f4.jpg

相似文献

1
DeepSom: a CNN-based approach to somatic variant calling in WGS samples without a matched normal.DeepSom:一种基于 CNN 的无配对正常样本 WGS 样本体细胞变异calling 方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac828.
2
Detection of oncogenic and clinically actionable mutations in cancer genomes critically depends on variant calling tools.在癌症基因组中检测致癌和具有临床可操作性的突变,关键取决于变异调用工具。
Bioinformatics. 2022 Jun 13;38(12):3181-3191. doi: 10.1093/bioinformatics/btac306.
3
SECEDO: SNV-based subclone detection using ultra-low coverage single-cell DNA sequencing.SECEDO:基于 SNV 的亚克隆检测,使用超低覆盖度单细胞 DNA 测序。
Bioinformatics. 2022 Sep 15;38(18):4293-4300. doi: 10.1093/bioinformatics/btac510.
4
Benchmarking the empirical accuracy of short-read sequencing across the M. tuberculosis genome.评估短读测序技术在结核分枝杆菌全基因组范围内的实际准确性。
Bioinformatics. 2022 Mar 28;38(7):1781-1787. doi: 10.1093/bioinformatics/btac023.
5
Conpair: concordance and contamination estimator for matched tumor-normal pairs.Conpair:匹配的肿瘤-正常样本对的一致性和污染估计器。
Bioinformatics. 2016 Oct 15;32(20):3196-3198. doi: 10.1093/bioinformatics/btw389. Epub 2016 Jun 26.
6
Very low-depth whole-genome sequencing in complex trait association studies.复杂性状关联研究中的极低深度全基因组测序。
Bioinformatics. 2019 Aug 1;35(15):2555-2561. doi: 10.1093/bioinformatics/bty1032.
7
Accurity: accurate tumor purity and ploidy inference from tumor-normal WGS data by jointly modelling somatic copy number alterations and heterozygous germline single-nucleotide-variants.Accurity:通过联合建模体细胞拷贝数改变和杂合性胚系单核苷酸变异,从肿瘤-正常 WGS 数据中准确推断肿瘤纯度和倍性。
Bioinformatics. 2018 Jun 15;34(12):2004-2011. doi: 10.1093/bioinformatics/bty043.
8
Comparative analysis of somatic variant calling on matched FF and FFPE WGS samples.配对 FF 和 FFPE WGS 样本的体细胞变异调用的比较分析。
BMC Med Genomics. 2020 Jul 6;13(1):94. doi: 10.1186/s12920-020-00746-5.
9
TOSCA: an automated Tumor Only Somatic CAlling workflow for somatic mutation detection without matched normal samples.TOSCA:一种用于在无匹配正常样本情况下进行体细胞突变检测的仅肿瘤体细胞调用自动化工作流程。
Bioinform Adv. 2022 Sep 26;2(1):vbac070. doi: 10.1093/bioadv/vbac070. eCollection 2022.
10
Halvade somatic: Somatic variant calling with Apache Spark.半体变异体调用:基于 Apache Spark 的半体变异体调用。
Gigascience. 2022 Jan 12;11(1). doi: 10.1093/gigascience/giab094.

引用本文的文献

1
Refined variant calling pipeline on RNA-seq data of breast cancer cell lines without matched-normal samples.针对无匹配正常样本的乳腺癌细胞系RNA测序数据的精细化变异检测流程
BMC Res Notes. 2025 Feb 15;18(1):67. doi: 10.1186/s13104-025-07140-3.
2
A benchmarking study of individual somatic variant callers and voting-based ensembles for whole-exome sequencing.全外显子组测序中个体体细胞变异检测工具及基于投票的集成方法的基准研究
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae697.

本文引用的文献

1
The 5th edition of the World Health Organization Classification of Haematolymphoid Tumours: Myeloid and Histiocytic/Dendritic Neoplasms.世界卫生组织血液淋巴肿瘤分类第五版:髓系和组织细胞/树突状肿瘤。
Leukemia. 2022 Jul;36(7):1703-1719. doi: 10.1038/s41375-022-01613-1. Epub 2022 Jun 22.
2
UNMASC: tumor-only variant calling with unmatched normal controls.UNMASC:使用不匹配的正常对照进行仅肿瘤变异检测。
NAR Cancer. 2021 Oct 6;3(4):zcab040. doi: 10.1093/narcan/zcab040. eCollection 2021 Dec.
3
Strand Orientation Bias Detector to determine the probability of FFPE sequencing artifacts.
链取向偏差探测器,用于确定 FFPE 测序伪影的概率。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab186.
4
Genetic-variant hotspots and hotspot clusters in the human genome facilitating adaptation while increasing instability.人类基因组中促进适应但增加不稳定性的遗传变异热点和热点簇。
Hum Genomics. 2021 Mar 19;15(1):19. doi: 10.1186/s40246-021-00318-3.
5
DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.DeepSSV:使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.
6
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.
7
Systematic comparison of somatic variant calling performance among different sequencing depth and mutation frequency.不同测序深度和突变频率下体细胞变异调用性能的系统比较
Sci Rep. 2020 Feb 26;10(1):3501. doi: 10.1038/s41598-020-60559-5.
8
Lean and deep models for more accurate filtering of SNP and INDEL variant calls.用于更准确筛选 SNP 和 INDEL 变异体调用的精简且深入的模型。
Bioinformatics. 2020 Apr 1;36(7):2060-2067. doi: 10.1093/bioinformatics/btz901.
9
The International Cancer Genome Consortium Data Portal.国际癌症基因组联盟数据门户
Nat Biotechnol. 2019 Apr;37(4):367-369. doi: 10.1038/s41587-019-0055-9.
10
Deep convolutional neural networks for accurate somatic mutation detection.深度卷积神经网络用于准确的体细胞突变检测。
Nat Commun. 2019 Mar 4;10(1):1041. doi: 10.1038/s41467-019-09027-x.