• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用弱监督深度学习进行准确的体细胞变异检测。

Accurate somatic variant detection using weakly supervised deep learning.

机构信息

Department of Computational and Systems Biology, Agency for Science Technology and Research, Genome Institute of Singapore, Singapore, Singapore.

School of Computing, National University of Singapore, Singapore, Singapore.

出版信息

Nat Commun. 2022 Jul 22;13(1):4248. doi: 10.1038/s41467-022-31765-8.

DOI:10.1038/s41467-022-31765-8
PMID:35869060
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9307817/
Abstract

Identification of somatic mutations in tumor samples is commonly based on statistical methods in combination with heuristic filters. Here we develop VarNet, an end-to-end deep learning approach for identification of somatic variants from aligned tumor and matched normal DNA reads. VarNet is trained using image representations of 4.6 million high-confidence somatic variants annotated in 356 tumor whole genomes. We benchmark VarNet across a range of publicly available datasets, demonstrating performance often exceeding current state-of-the-art methods. Overall, our results demonstrate how a scalable deep learning approach could augment and potentially supplant human engineered features and heuristic filters in somatic variant calling.

摘要

肿瘤样本中的体细胞突变的识别通常基于统计方法,并结合启发式过滤器。在这里,我们开发了 VarNet,这是一种从对齐的肿瘤和匹配的正常 DNA 读取中识别体细胞变体的端到端深度学习方法。VarNet 使用在 356 个肿瘤全基因组中注释的 460 万个高可信度体细胞变体的图像表示进行训练。我们在一系列公开可用的数据集上对 VarNet 进行了基准测试,结果表明其性能通常超过当前最先进的方法。总的来说,我们的结果表明,一种可扩展的深度学习方法可以增强并可能替代体细胞变异调用中的人为设计的特征和启发式过滤器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/2756be0f0e92/41467_2022_31765_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/c22bfceb02a0/41467_2022_31765_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/430590125910/41467_2022_31765_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/d954480b51c3/41467_2022_31765_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/8bed7262bac1/41467_2022_31765_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/5092b797d8eb/41467_2022_31765_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/2756be0f0e92/41467_2022_31765_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/c22bfceb02a0/41467_2022_31765_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/430590125910/41467_2022_31765_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/d954480b51c3/41467_2022_31765_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/8bed7262bac1/41467_2022_31765_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/5092b797d8eb/41467_2022_31765_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5627/9307817/2756be0f0e92/41467_2022_31765_Fig6_HTML.jpg

相似文献

1
Accurate somatic variant detection using weakly supervised deep learning.利用弱监督深度学习进行准确的体细胞变异检测。
Nat Commun. 2022 Jul 22;13(1):4248. doi: 10.1038/s41467-022-31765-8.
2
Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking.综合方法生成低肿瘤分数的人工样本用于体细胞变异calling 基准测试。
BMC Bioinformatics. 2024 May 8;25(1):180. doi: 10.1186/s12859-024-05793-8.
3
Improving somatic exome sequencing performance by biological replicates.通过生物学重复提高体细胞外显子组测序性能。
BMC Bioinformatics. 2024 Mar 22;25(1):124. doi: 10.1186/s12859-024-05742-5.
4
Combining accurate tumor genome simulation with crowdsourcing to benchmark somatic structural variant detection.结合精确的肿瘤基因组模拟和众包基准测试体细胞结构变异检测。
Genome Biol. 2018 Nov 6;19(1):188. doi: 10.1186/s13059-018-1539-5.
5
DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.DeepSSV:使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.
6
A deep learning approach to automate refinement of somatic variant calling from cancer sequencing data.深度学习方法自动优化癌症测序数据中体细胞变异的调用。
Nat Genet. 2018 Dec;50(12):1735-1743. doi: 10.1038/s41588-018-0257-y. Epub 2018 Nov 5.
7
Deep whole-genome sequencing of 3 cancer cell lines on 2 sequencing platforms.对 2 个测序平台上的 3 种癌细胞系进行深度全基因组测序。
Sci Rep. 2019 Dec 13;9(1):19123. doi: 10.1038/s41598-019-55636-3.
8
ISOWN: accurate somatic mutation identification in the absence of normal tissue controls.ISOWN:在无正常组织对照的情况下准确识别体细胞突变
Genome Med. 2017 Jun 29;9(1):59. doi: 10.1186/s13073-017-0446-9.
9
A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.用于下一代测序数据的体细胞单核苷酸变异检测算法综述。
Comput Struct Biotechnol J. 2018 Feb 6;16:15-24. doi: 10.1016/j.csbj.2018.01.003. eCollection 2018.
10
Somatic and Germline Variant Calling from Next-Generation Sequencing Data.从下一代测序数据中进行体细胞和种系变异调用。
Adv Exp Med Biol. 2022;1361:37-54. doi: 10.1007/978-3-030-91836-1_3.

引用本文的文献

1
araCNA: somatic copy number profiling using long-range sequence models.araCNA:使用长程序列模型进行体细胞拷贝数分析
NAR Genom Bioinform. 2025 Sep 9;7(3):lqaf124. doi: 10.1093/nargab/lqaf124. eCollection 2025 Sep.
2
Identification of Somatic Variants in Cancer Genomes from Tissue and Liquid Biopsy Samples.从组织和液体活检样本中鉴定癌症基因组中的体细胞变异
Methods Mol Biol. 2025;2932:291-301. doi: 10.1007/978-1-0716-4566-6_16.
3
Integrating Artificial Intelligence in Next-Generation Sequencing: Advances, Challenges, and Future Directions.

本文引用的文献

1
Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing.建立社区参考样本、数据和调用集,用于使用全基因组测序进行癌症突变检测的基准测试。
Nat Biotechnol. 2021 Sep;39(9):1151-1160. doi: 10.1038/s41587-021-00993-6. Epub 2021 Sep 9.
2
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
3
Array programming with NumPy.使用 NumPy 进行数组编程。
将人工智能整合到下一代测序中:进展、挑战与未来方向。
Curr Issues Mol Biol. 2025 Jun 19;47(6):470. doi: 10.3390/cimb47060470.
4
A Comprehensive Review of Deep Learning Applications with Multi-Omics Data in Cancer Research.癌症研究中多组学数据深度学习应用的综合综述
Genes (Basel). 2025 May 28;16(6):648. doi: 10.3390/genes16060648.
5
Identification of minimal residual disease using the clonesight test for ultrasensitive ctDNA detection to anticipate late relapse in early breast cancer.使用Clonesight检测超灵敏循环肿瘤DNA以识别微小残留病,预测早期乳腺癌的晚期复发。
Breast Cancer Res. 2025 May 1;27(1):65. doi: 10.1186/s13058-025-02016-7.
6
A benchmarking study of individual somatic variant callers and voting-based ensembles for whole-exome sequencing.全外显子组测序中个体体细胞变异检测工具及基于投票的集成方法的基准研究
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae697.
7
Resolving the chromatin impact of mosaic variants with targeted Fiber-seq.利用靶向纤维测序解析嵌合变异对染色质的影响。
Genome Res. 2024 Dec 23;34(12):2269-2278. doi: 10.1101/gr.279747.124.
8
Role of artificial intelligence in haematolymphoid diagnostics.人工智能在血液淋巴系统诊断中的作用。
Histopathology. 2025 Jan;86(1):58-68. doi: 10.1111/his.15327. Epub 2024 Oct 22.
9
Resolving the chromatin impact of mosaic variants with targeted Fiber-seq.利用靶向纤维测序解析嵌合变异对染色质的影响。
bioRxiv. 2024 Jul 13:2024.07.09.602608. doi: 10.1101/2024.07.09.602608.
10
Deep learning in cancer genomics and histopathology.深度学习在癌症基因组学和组织病理学中的应用。
Genome Med. 2024 Mar 27;16(1):44. doi: 10.1186/s13073-024-01315-6.
Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.
4
Deep convolutional neural networks for accurate somatic mutation detection.深度卷积神经网络用于准确的体细胞突变检测。
Nat Commun. 2019 Mar 4;10(1):1041. doi: 10.1038/s41467-019-09027-x.
5
SMuRF: portable and accurate ensemble prediction of somatic mutations.SMuRF:体细胞突变的便携式精确集成预测
Bioinformatics. 2019 Sep 1;35(17):3157-3159. doi: 10.1093/bioinformatics/btz018.
6
A deep learning approach to automate refinement of somatic variant calling from cancer sequencing data.深度学习方法自动优化癌症测序数据中体细胞变异的调用。
Nat Genet. 2018 Dec;50(12):1735-1743. doi: 10.1038/s41588-018-0257-y. Epub 2018 Nov 5.
7
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
8
Strelka2: fast and accurate calling of germline and somatic variants.Strelka2:快速准确地调用种系和体细胞变异。
Nat Methods. 2018 Aug;15(8):591-594. doi: 10.1038/s41592-018-0051-x. Epub 2018 Jul 16.
9
Mutation hotspots at CTCF binding sites coupled to chromosomal instability in gastrointestinal cancers.CTCF 结合位点的突变热点与胃肠道癌症中的染色体不稳定性相关联。
Nat Commun. 2018 Apr 18;9(1):1520. doi: 10.1038/s41467-018-03828-2.
10
Comprehensive and Integrative Genomic Characterization of Hepatocellular Carcinoma.肝细胞癌的综合与整合基因组特征分析
Cell. 2017 Jun 15;169(7):1327-1341.e23. doi: 10.1016/j.cell.2017.05.046.