• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组范围内体细胞突变率的绘制揭示了癌症的驱动因素。

Genome-wide mapping of somatic mutation rates uncovers drivers of cancer.

机构信息

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.

Harvard-MIT Health Sciences and Technology Program, Cambridge, MA, USA.

出版信息

Nat Biotechnol. 2022 Nov;40(11):1634-1643. doi: 10.1038/s41587-022-01353-8. Epub 2022 Jun 20.

DOI:10.1038/s41587-022-01353-8
PMID:35726091
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9646522/
Abstract

Identification of cancer driver mutations that confer a proliferative advantage is central to understanding cancer; however, searches have often been limited to protein-coding sequences and specific non-coding elements (for example, promoters) because of the challenge of modeling the highly variable somatic mutation rates observed across tumor genomes. Here we present Dig, a method to search for driver elements and mutations anywhere in the genome. We use deep neural networks to map cancer-specific mutation rates genome-wide at kilobase-scale resolution. These estimates are then refined to search for evidence of driver mutations under positive selection throughout the genome by comparing observed to expected mutation counts. We mapped mutation rates for 37 cancer types and applied these maps to identify putative drivers within intronic cryptic splice regions, 5' untranslated regions and infrequently mutated genes. Our high-resolution mutation rate maps, available for web-based exploration, are a resource to enable driver discovery genome-wide.

摘要

鉴定赋予增殖优势的癌症驱动突变是理解癌症的核心;然而,由于难以对肿瘤基因组中观察到的高度可变的体细胞突变率进行建模,搜索通常仅限于蛋白质编码序列和特定的非编码元件(例如启动子)。在这里,我们提出了 Dig,这是一种在基因组的任何位置搜索驱动元件和突变的方法。我们使用深度神经网络在千碱基分辨率的全基因组范围内绘制癌症特异性突变率图。然后,通过将观察到的突变计数与预期的突变计数进行比较,对这些估计值进行细化,以在整个基因组中搜索正选择驱动突变的证据。我们绘制了 37 种癌症类型的突变率图,并将这些图谱应用于识别内含子隐匿剪接区、5'非翻译区和突变频率较低的基因中的潜在驱动因子。我们的高分辨率突变率图谱可在网上进行探索,是一个有助于在全基因组范围内发现驱动因子的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/b37003729ada/41587_2022_1353_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/0baa20b755ba/41587_2022_1353_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/3f8264309794/41587_2022_1353_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/c583af8c36d3/41587_2022_1353_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/33fdad2f7f5f/41587_2022_1353_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/aac9a2ca80f2/41587_2022_1353_Fig5_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/21e90d85ce28/41587_2022_1353_Fig6_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/5c8d7a15d2ca/41587_2022_1353_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/b37003729ada/41587_2022_1353_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/0baa20b755ba/41587_2022_1353_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/3f8264309794/41587_2022_1353_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/c583af8c36d3/41587_2022_1353_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/33fdad2f7f5f/41587_2022_1353_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/aac9a2ca80f2/41587_2022_1353_Fig5_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/21e90d85ce28/41587_2022_1353_Fig6_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/5c8d7a15d2ca/41587_2022_1353_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa04/9646522/b37003729ada/41587_2022_1353_Fig8_ESM.jpg

相似文献

1
Genome-wide mapping of somatic mutation rates uncovers drivers of cancer.全基因组范围内体细胞突变率的绘制揭示了癌症的驱动因素。
Nat Biotechnol. 2022 Nov;40(11):1634-1643. doi: 10.1038/s41587-022-01353-8. Epub 2022 Jun 20.
2
OncodriveFML: a general framework to identify coding and non-coding regions with cancer driver mutations.OncodriveFML:一种识别具有癌症驱动突变的编码和非编码区域的通用框架。
Genome Biol. 2016 Jun 16;17(1):128. doi: 10.1186/s13059-016-0994-0.
3
CNCDatabase: a database of non-coding cancer drivers.CNCDatabase:一个非编码癌症驱动因子数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1094-D1101. doi: 10.1093/nar/gkaa915.
4
Identification of coding and non-coding mutational hotspots in cancer genomes.癌症基因组中编码和非编码突变热点的鉴定。
BMC Genomics. 2017 Jan 5;18(1):17. doi: 10.1186/s12864-016-3420-9.
5
NIMBus: a negative binomial regression based Integrative Method for mutation Burden Analysis.NIMBus:一种基于负二项回归的突变负担分析综合方法。
BMC Bioinformatics. 2020 Oct 22;21(1):474. doi: 10.1186/s12859-020-03758-1.
6
Association of mutation signature effectuating processes with mutation hotspots in driver genes and non-coding regions.关联突变特征执行过程与驱动基因和非编码区域突变热点。
Nat Commun. 2022 Jan 10;13(1):178. doi: 10.1038/s41467-021-27792-6.
7
Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.分析 2658 个癌症全基因组中的非编码体细胞驱动因子。
Nature. 2020 Feb;578(7793):102-111. doi: 10.1038/s41586-020-1965-x. Epub 2020 Feb 5.
8
Differential DNA repair underlies mutation hotspots at active promoters in cancer genomes.差异 DNA 修复是癌症基因组中活跃启动子突变热点的基础。
Nature. 2016 Apr 14;532(7598):259-63. doi: 10.1038/nature17437.
9
Cancer regulatory variation.癌症调控变异。
Curr Opin Genet Dev. 2021 Feb;66:41-49. doi: 10.1016/j.gde.2020.11.010. Epub 2021 Jan 7.
10
Candidate Cancer Driver Mutations in Distal Regulatory Elements and Long-Range Chromatin Interaction Networks.远端调控元件和长程染色质互作网络中的候选癌症驱动突变。
Mol Cell. 2020 Mar 19;77(6):1307-1321.e10. doi: 10.1016/j.molcel.2019.12.027. Epub 2020 Jan 17.

引用本文的文献

1
Single-Cell RNA Sequencing Integrated with Bulk-RNA Sequencing Analysis Reveals Prognostic Signatures Based on PANoptosis in Hepatocellular Carcinoma.单细胞RNA测序与批量RNA测序分析相结合揭示基于肝细胞癌PAN凋亡的预后特征
J Hepatocell Carcinoma. 2025 Jul 29;12:1661-1676. doi: 10.2147/JHC.S533777. eCollection 2025.
2
A review of the use of tumour DNA methylation for breast cancer subtyping and prediction of outcomes.肿瘤DNA甲基化在乳腺癌亚型分类及预后预测中的应用综述。
Clin Epigenetics. 2025 Jul 2;17(1):109. doi: 10.1186/s13148-025-01922-z.
3
Progress and challenges of artificial intelligence in lung cancer clinical translation.

本文引用的文献

1
Illuminating the noncoding genome in cancer.揭示癌症中的非编码基因组。
Nat Cancer. 2020 Sep;1(9):864-872. doi: 10.1038/s43018-020-00114-3. Epub 2020 Sep 14.
2
Synonymous mutations reveal genome-wide levels of positive selection in healthy tissues.同义突变揭示了健康组织中全基因组水平的正选择。
Nat Genet. 2021 Nov;53(11):1597-1605. doi: 10.1038/s41588-021-00957-1. Epub 2021 Nov 4.
3
Effective gene expression prediction from sequence by integrating long-range interactions.通过整合长程相互作用,从序列中有效预测基因表达。
人工智能在肺癌临床转化中的进展与挑战
NPJ Precis Oncol. 2025 Jul 1;9(1):210. doi: 10.1038/s41698-025-00986-7.
4
Cancer gene identification from RNA variant allelic frequencies using RVdriver.使用RVdriver从RNA变异等位基因频率中鉴定癌症基因。
Genome Biol. 2025 Jun 13;26(1):165. doi: 10.1186/s13059-025-03557-y.
5
Genomic landscape of multiple myeloma and its precursor conditions.多发性骨髓瘤及其前驱疾病的基因组格局。
Nat Genet. 2025 May 21. doi: 10.1038/s41588-025-02196-0.
6
Element-specific estimation of background mutation rates in whole cancer genomes through transfer learning.通过迁移学习对全癌基因组中背景突变率进行元素特异性估计。
NPJ Precis Oncol. 2025 Mar 29;9(1):92. doi: 10.1038/s41698-025-00871-3.
7
Decoding the functional impact of the cancer genome through protein-protein interactions.通过蛋白质-蛋白质相互作用解码癌症基因组的功能影响。
Nat Rev Cancer. 2025 Mar;25(3):189-208. doi: 10.1038/s41568-024-00784-6. Epub 2025 Jan 14.
8
Translational Approach to Social Isolation During a Global Pandemic: Hippocampal Somatic Mutation and Stress.全球大流行期间社会隔离的转化研究方法:海马体细胞突变与压力
Psychiatry Investig. 2024 Dec;21(12):1360-1371. doi: 10.30773/pi.2024.0178. Epub 2024 Dec 23.
9
The theory of massively repeated evolution and full identifications of cancer-driving nucleotides (CDNs).大规模重复进化理论与癌症驱动核苷酸(CDN)的完全鉴定。
Elife. 2024 Dec 17;13:RP99340. doi: 10.7554/eLife.99340.
10
Characterization of cancer-driving nucleotides (CDNs) across genes, cancer types, and patients.跨基因、癌症类型和患者对癌症驱动核苷酸(CDN)的特征描述。
Elife. 2024 Dec 17;13:RP99341. doi: 10.7554/eLife.99341.
Nat Methods. 2021 Oct;18(10):1196-1203. doi: 10.1038/s41592-021-01252-x. Epub 2021 Oct 4.
4
A body map of somatic mutagenesis in morphologically normal human tissues.形态正常人体组织中的躯体突变体图谱。
Nature. 2021 Sep;597(7876):398-403. doi: 10.1038/s41586-021-03836-1. Epub 2021 Aug 25.
5
The mutational landscape of human somatic and germline cells.人类体细胞和生殖细胞的突变景观。
Nature. 2021 Sep;597(7876):381-386. doi: 10.1038/s41586-021-03822-7. Epub 2021 Aug 25.
6
In silico saturation mutagenesis of cancer genes.癌症基因的计算机饱和诱变。
Nature. 2021 Aug;596(7872):428-432. doi: 10.1038/s41586-021-03771-1. Epub 2021 Jul 28.
7
Few-shot learning creates predictive models of drug response that translate from high-throughput screens to individual patients.少样本学习创建了药物反应的预测模型,这些模型可以从高通量筛选转化到个体患者身上。
Nat Cancer. 2021 Feb;2(2):233-244. doi: 10.1038/s43018-020-00169-2. Epub 2021 Jan 25.
8
The polarity protein PARD3 and cancer.极性蛋白PARD3与癌症。
Oncogene. 2021 Jun;40(25):4245-4262. doi: 10.1038/s41388-021-01813-6. Epub 2021 Jun 7.
9
Non-coding region variants upstream of MEF2C cause severe developmental disorder through three distinct loss-of-function mechanisms.非编码区 MEF2C 上游的变异通过三种不同的功能丧失机制导致严重的发育障碍。
Am J Hum Genet. 2021 Jun 3;108(6):1083-1094. doi: 10.1016/j.ajhg.2021.04.025. Epub 2021 May 21.
10
Base-resolution models of transcription-factor binding reveal soft motif syntax.基于分辨率的转录因子结合模型揭示了软基序语法。
Nat Genet. 2021 Mar;53(3):354-366. doi: 10.1038/s41588-021-00782-6. Epub 2021 Feb 18.