• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepSomatic:适用于多种测序技术的精确体细胞小变异发现方法。

DeepSomatic: Accurate somatic small variant discovery for multiple sequencing technologies.

作者信息

Park Jimin, Cook Daniel E, Chang Pi-Chuan, Kolesnikov Alexey, Brambrink Lucas, Mier Juan Carlos, Gardner Joshua, McNulty Brandy, Sacco Samuel, Keskus Ayse, Bryant Asher, Ahmad Tanveer, Shetty Jyoti, Zhao Yongmei, Tran Bao, Narzisi Giuseppe, Helland Adrienne, Yoo Byunggil, Pushel Irina, Lansdon Lisa A, Bi Chengpeng, Walter Adam, Gibson Margaret, Pastinen Tomi, Farooqi Midhat S, Robine Nicolas, Miga Karen H, Carroll Andrew, Kolmogorov Mikhail, Paten Benedict, Shafin Kishwar

机构信息

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA.

Google Inc, Mountain View, CA, USA.

出版信息

bioRxiv. 2024 Aug 19:2024.08.16.608331. doi: 10.1101/2024.08.16.608331.

DOI:10.1101/2024.08.16.608331
PMID:39229187
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370364/
Abstract

Somatic variant detection is an integral part of cancer genomics analysis. While most methods have focused on short-read sequencing, long-read technologies now offer potential advantages in terms of repeat mapping and variant phasing. We present DeepSomatic, a deep learning method for detecting somatic SNVs and insertions and deletions (indels) from both short-read and long-read data, with modes for whole-genome and exome sequencing, and able to run on tumor-normal, tumor-only, and with FFPE-prepared samples. To help address the dearth of publicly available training and benchmarking data for somatic variant detection, we generated and make openly available a dataset of five matched tumor-normal cell line pairs sequenced with Illumina, PacBio HiFi, and Oxford Nanopore Technologies, along with benchmark variant sets. Across samples and technologies (short-read and long-read), DeepSomatic consistently outperforms existing callers, particularly for indels.

摘要

体细胞变异检测是癌症基因组学分析的一个重要组成部分。虽然大多数方法都集中在短读长测序上,但长读长技术现在在重复序列映射和变异定相方面具有潜在优势。我们提出了DeepSomatic,这是一种深度学习方法,用于从短读长和长读长数据中检测体细胞单核苷酸变异(SNV)以及插入和缺失(indel),具有全基因组和外显子组测序模式,并且能够在肿瘤-正常样本、仅肿瘤样本以及福尔马林固定石蜡包埋(FFPE)制备的样本上运行。为了帮助解决体细胞变异检测方面公开可用的训练和基准测试数据匮乏的问题,我们生成并公开了一个数据集,该数据集包含五对匹配的肿瘤-正常细胞系,使用Illumina、PacBio HiFi和Oxford Nanopore Technologies进行测序,同时还提供了基准变异集。在各种样本和技术(短读长和长读长)中,DeepSomatic始终优于现有的变异检测工具,尤其是在检测indel方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/008157667f70/nihpp-2024.08.16.608331v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/2ebb02ecebd3/nihpp-2024.08.16.608331v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/e112a6588aba/nihpp-2024.08.16.608331v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/13aa67147c79/nihpp-2024.08.16.608331v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/008157667f70/nihpp-2024.08.16.608331v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/2ebb02ecebd3/nihpp-2024.08.16.608331v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/e112a6588aba/nihpp-2024.08.16.608331v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/13aa67147c79/nihpp-2024.08.16.608331v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f31/11370364/008157667f70/nihpp-2024.08.16.608331v1-f0004.jpg

相似文献

1
DeepSomatic: Accurate somatic small variant discovery for multiple sequencing technologies.DeepSomatic:适用于多种测序技术的精确体细胞小变异发现方法。
bioRxiv. 2024 Aug 19:2024.08.16.608331. doi: 10.1101/2024.08.16.608331.
2
Benchmarking reveals superiority of deep learning variant callers on bacterial nanopore sequence data.基准测试显示深度学习变异调用程序在细菌纳米孔测序数据上的优越性。
Elife. 2024 Oct 10;13:RP98300. doi: 10.7554/eLife.98300.
3
A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark.基于光学基因组图谱作为基准的短读长和纳米孔全基因组测序的结构变异调用比较。
Genes (Basel). 2024 Jul 16;15(7):925. doi: 10.3390/genes15070925.
4
Performance of somatic structural variant calling in lung cancer using Oxford Nanopore sequencing technology.利用牛津纳米孔测序技术在肺癌中进行体细胞结构变异检测的性能。
BMC Genomics. 2024 Sep 30;25(1):898. doi: 10.1186/s12864-024-10792-3.
5
Performance analysis of conventional and AI-based variant callers using short and long reads.使用短读长读对常规和基于人工智能的变异调用程序进行性能分析。
BMC Bioinformatics. 2023 Dec 14;24(1):472. doi: 10.1186/s12859-023-05596-3.
6
Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology.联合使用牛津纳米孔和 Illumina 测序技术揭示了大豆结构变异生物学的见解。
BMC Biol. 2022 Feb 23;20(1):53. doi: 10.1186/s12915-022-01255-w.
7
Comprehensive de novo mutation discovery with HiFi long-read sequencing.利用 HiFi 长读测序进行全面的从头突变发现。
Genome Med. 2023 May 8;15(1):34. doi: 10.1186/s13073-023-01183-6.
8
The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies.简而言之:使用Illumina、Nanopore和PacBio测序技术对病毒组进行基准测试。
Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001198.
9
Whole-genome long-read sequencing downsampling and its effect on variant-calling precision and recall.全基因组长读长测序下采样及其对变异calling 精度和召回率的影响。
Genome Res. 2023 Dec 27;33(12):2029-2040. doi: 10.1101/gr.278070.123.
10
Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.高质量的长读长序列对于实现生物多样性基因组学的潜力至关重要。
BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.

本文引用的文献

1
Local read haplotagging enables accurate long-read small variant calling.局部读取标签化可实现长读长小型变异calling 的准确性。
Nat Commun. 2024 Jul 13;15(1):5907. doi: 10.1038/s41467-024-50079-5.
2
Symphonizing pileup and full-alignment for deep learning-based long-read variant calling.基于深度学习的长读变异调用的交响乐堆积和全对齐。
Nat Comput Sci. 2022 Dec;2(12):797-803. doi: 10.1038/s43588-022-00387-x. Epub 2022 Dec 19.
3
Long-Read DNA and RNA Sequencing to Streamline Clinical Genetic Testing and Reduce Barriers to Comprehensive Genetic Testing.
长读 DNA 和 RNA 测序简化临床基因检测并减少全面基因检测的障碍。
J Appl Lab Med. 2024 Jan 3;9(1):138-150. doi: 10.1093/jalm/jfad107.
4
Assigning mutational signatures to individual samples and individual somatic mutations with SigProfilerAssignment.使用 SigProfilerAssignment 将突变特征分配给个体样本和个体体细胞突变。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad756.
5
Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation.可扩展的纳米孔测序技术对人类基因组进行测序,提供了全面的单倍型分辨率变异和甲基化视图。
Nat Methods. 2023 Oct;20(10):1483-1492. doi: 10.1038/s41592-023-01993-x. Epub 2023 Sep 14.
6
A critical spotlight on the paradigms of FFPE-DNA sequencing.对 FFPE-DNA 测序范式的批判性关注。
Nucleic Acids Res. 2023 Aug 11;51(14):7143-7162. doi: 10.1093/nar/gkad519.
7
Improving variant calling using population data and deep learning.利用群体数据和深度学习提高变异calling 的准确性。
BMC Bioinformatics. 2023 May 12;24(1):197. doi: 10.1186/s12859-023-05294-0.
8
Towards an accurate and robust analysis pipeline for somatic mutation calling.迈向用于体细胞突变检测的准确且稳健的分析流程。
Front Genet. 2022 Nov 15;13:979928. doi: 10.3389/fgene.2022.979928. eCollection 2022.
9
Detection of oncogenic and clinically actionable mutations in cancer genomes critically depends on variant calling tools.在癌症基因组中检测致癌和具有临床可操作性的突变,关键取决于变异调用工具。
Bioinformatics. 2022 Jun 13;38(12):3181-3191. doi: 10.1093/bioinformatics/btac306.
10
Branching clonal evolution patterns predominate mutational landscape in multiple myeloma.分支克隆进化模式在多发性骨髓瘤的突变图谱中占主导地位。
Am J Cancer Res. 2021 Nov 15;11(11):5659-5679. eCollection 2021.