• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过蛋白质功能改变偏向量化癌症中的基因选择。

Quantifying gene selection in cancer through protein functional alteration bias.

机构信息

School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel.

Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Israel.

出版信息

Nucleic Acids Res. 2019 Jul 26;47(13):6642-6655. doi: 10.1093/nar/gkz546.

DOI:10.1093/nar/gkz546
PMID:31334812
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6649814/
Abstract

Compiling the catalogue of genes actively involved in cancer is an ongoing endeavor, with profound implications to the understanding and treatment of the disease. An abundance of computational methods have been developed to screening the genome for candidate driver genes based on genomic data of somatic mutations in tumors. Existing methods make many implicit and explicit assumptions about the distribution of random mutations. We present FABRIC, a new framework for quantifying the selection of genes in cancer by assessing the effects of de-novo somatic mutations on protein-coding genes. Using a machine-learning model, we quantified the functional effects of ∼3M somatic mutations extracted from over 10 000 human cancerous samples, and compared them against the effects of all possible single-nucleotide mutations in the coding human genome. We detected 593 protein-coding genes showing statistically significant bias towards harmful mutations. These genes, discovered without any prior knowledge, show an overwhelming overlap with known cancer genes, but also include many overlooked genes. FABRIC is designed to avoid false discoveries by comparing each gene to its own background model using rigorous statistics, making minimal assumptions about the distribution of random somatic mutations. The framework is an open-source project with a simple command-line interface.

摘要

编制积极参与癌症的基因目录是一项持续的努力,对理解和治疗这种疾病具有深远的意义。已经开发了大量的计算方法,基于肿瘤体细胞突变的基因组数据筛选候选驱动基因。现有的方法对随机突变的分布做出了许多隐含和显式的假设。我们提出了 FABRIC,这是一种通过评估新生体突变对蛋白质编码基因的影响来量化癌症中基因选择的新框架。我们使用机器学习模型,量化了从超过 10000 个人类癌症样本中提取的约 300 万个体细胞突变的功能效应,并将其与编码人类基因组中所有可能的单核苷酸突变的效应进行了比较。我们检测到 593 个蛋白质编码基因显示出对有害突变的显著偏向。这些基因是在没有任何先验知识的情况下发现的,与已知的癌症基因有压倒性的重叠,但也包括许多被忽视的基因。 FABRIC 通过使用严格的统计学将每个基因与其自身的背景模型进行比较,从而避免了假发现,对随机体细胞突变的分布做出了最小的假设。该框架是一个开源项目,具有简单的命令行界面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/baeb6a3e0ec5/gkz546fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/33ac45135e4f/gkz546fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/1a3c643d75ee/gkz546fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/c7ed44e9cc22/gkz546fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/eea6a8f79b18/gkz546fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/baeb6a3e0ec5/gkz546fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/33ac45135e4f/gkz546fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/1a3c643d75ee/gkz546fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/c7ed44e9cc22/gkz546fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/eea6a8f79b18/gkz546fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c67/6649814/baeb6a3e0ec5/gkz546fig5.jpg

相似文献

1
Quantifying gene selection in cancer through protein functional alteration bias.通过蛋白质功能改变偏向量化癌症中的基因选择。
Nucleic Acids Res. 2019 Jul 26;47(13):6642-6655. doi: 10.1093/nar/gkz546.
2
Identifying Driver Interfaces Enriched for Somatic Missense Mutations in Tumors.识别在肿瘤中因体细胞错义突变而富集的驱动接口。
Methods Mol Biol. 2019;1907:51-72. doi: 10.1007/978-1-4939-8967-6_4.
3
Predicting the functional consequences of somatic missense mutations found in tumors.预测肿瘤中发现的体细胞错义突变的功能后果。
Methods Mol Biol. 2014;1101:135-59. doi: 10.1007/978-1-62703-721-1_8.
4
Frequent mutations in acetylation and ubiquitination sites suggest novel driver mechanisms of cancer.乙酰化和泛素化位点的频繁突变提示了癌症新的驱动机制。
Genome Med. 2016 May 12;8(1):55. doi: 10.1186/s13073-016-0311-2.
5
Protein dynamics analysis identifies candidate cancer driver genes and mutations in TCGA data.蛋白质动力学分析在TCGA数据中识别出候选癌症驱动基因和突变。
Proteins. 2021 Jun;89(6):721-730. doi: 10.1002/prot.26054. Epub 2021 Feb 16.
6
The FABRIC Cancer Portal: A Ranked Catalogue of Gene Selection in Tumors Over the Human Coding Genome.FABRIC 癌症门户:人类编码基因组中肿瘤基因选择的排名目录。
Cancer Res. 2021 Feb 15;81(4):1178-1185. doi: 10.1158/0008-5472.CAN-20-3147. Epub 2020 Dec 4.
7
Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples.预测肿瘤样本中错义、无义和移码突变的基因特征。
BMC Bioinformatics. 2018 Nov 19;19(1):430. doi: 10.1186/s12859-018-2455-0.
8
Comparison of different functional prediction scores using a gene-based permutation model for identifying cancer driver genes.基于基因排列模型的不同功能预测评分比较,用于鉴定癌症驱动基因。
BMC Med Genomics. 2019 Jan 31;12(Suppl 1):22. doi: 10.1186/s12920-018-0452-9.
9
DriverML: a machine learning algorithm for identifying driver genes in cancer sequencing studies.DriverML:一种用于鉴定癌症测序研究中驱动基因的机器学习算法。
Nucleic Acids Res. 2019 May 7;47(8):e45. doi: 10.1093/nar/gkz096.
10
Cancer Gene Discovery by Network Analysis of Somatic Mutations Using the MUFFINN Server.使用MUFFINN服务器通过体细胞突变的网络分析发现癌症基因
Methods Mol Biol. 2019;1907:37-50. doi: 10.1007/978-1-4939-8967-6_3.

引用本文的文献

1
Integrative machine learning approach to risk prediction for dementia and Alzheimer's disease.用于痴呆症和阿尔茨海默病风险预测的综合机器学习方法。
Geroscience. 2025 Aug 27. doi: 10.1007/s11357-025-01828-x.
2
PWAS Hub: exploring gene-based associations of complex diseases with sex dependency.全基因组关联研究(PWAS)中心:探索复杂疾病基于基因的关联及其性别依赖性。
Nucleic Acids Res. 2025 Jan 6;53(D1):D1132-D1143. doi: 10.1093/nar/gkae1125.
3
Multiome-wide Association Studies: Novel Approaches for Understanding Diseases.全基因组关联研究:理解疾病的新方法

本文引用的文献

1
Comprehensive Characterization of Cancer Driver Genes and Mutations.全面描绘癌症驱动基因和突变。
Cell. 2018 Apr 5;173(2):371-385.e18. doi: 10.1016/j.cell.2018.02.060.
2
ClinVar: improving access to variant interpretations and supporting evidence.ClinVar:改善变异解读和支持证据的获取。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1062-D1067. doi: 10.1093/nar/gkx1153.
3
Universal Patterns of Selection in Cancer and Somatic Tissues.癌症和体细胞组织中的普遍选择模式。
Genomics Proteomics Bioinformatics. 2024 Dec 3;22(5). doi: 10.1093/gpbjnl/qzae077.
4
PWAS Hub for exploring gene-based associations of common complex diseases.PWAS 枢纽,用于探索常见复杂疾病的基于基因的关联。
Genome Res. 2024 Oct 29;34(10):1674-1686. doi: 10.1101/gr.278916.123.
5
Discovering predisposing genes for hereditary breast cancer using deep learning.利用深度学习发现遗传性乳腺癌的易患基因。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae346.
6
Post-implantation analysis of genomic variations in the progeny from developing fetus to birth.从发育中的胎儿到出生后的后代中基因组变异的植入后分析。
Hum Genomics. 2024 Jul 15;18(1):79. doi: 10.1186/s40246-024-00634-4.
7
Revealing the genetic complexity of hypothyroidism: integrating complementary association methods.揭示甲状腺功能减退症的遗传复杂性:整合互补关联方法。
Front Genet. 2024 Jun 11;15:1409226. doi: 10.3389/fgene.2024.1409226. eCollection 2024.
8
Gene-based association study reveals a distinct female genetic signal in primary hypertension.基于基因的关联研究揭示了原发性高血压中一种独特的女性遗传信号。
Hum Genet. 2023 Jul;142(7):863-878. doi: 10.1007/s00439-023-02567-9. Epub 2023 May 3.
9
Evolutionary and functional lessons from human-specific amino acid substitution matrices.来自人类特异性氨基酸替换矩阵的进化与功能启示。
NAR Genom Bioinform. 2021 Sep 16;3(3):lqab079. doi: 10.1093/nargab/lqab079. eCollection 2021 Sep.
10
PWAS: proteome-wide association study-linking genes and phenotypes by functional variation in proteins.PWAS:通过蛋白质功能变异将基因与表型联系起来的蛋白质组范围关联研究。
Genome Biol. 2020 Jul 14;21(1):173. doi: 10.1186/s13059-020-02089-x.
Cell. 2017 Nov 16;171(5):1029-1041.e21. doi: 10.1016/j.cell.2017.09.042. Epub 2017 Oct 19.
4
Differential analysis between somatic mutation and germline variation profiles reveals cancer-related genes.体细胞突变和种系变异图谱之间的差异分析揭示了癌症相关基因。
Genome Med. 2017 Aug 25;9(1):79. doi: 10.1186/s13073-017-0465-6.
5
Comparison of algorithms for the detection of cancer drivers at subgene resolution.亚基因分辨率下癌症驱动基因检测算法的比较
Nat Methods. 2017 Aug;14(8):782-788. doi: 10.1038/nmeth.4364. Epub 2017 Jul 17.
6
A variant by any name: quantifying annotation discordance across tools and clinical databases.无论名称如何的变体:量化不同工具和临床数据库之间的注释不一致性。
Genome Med. 2017 Jan 26;9(1):7. doi: 10.1186/s13073-016-0396-7.
7
Mutation effects predicted from sequence co-variation.根据序列共变预测的突变效应。
Nat Biotechnol. 2017 Feb;35(2):128-135. doi: 10.1038/nbt.3769. Epub 2017 Jan 16.
8
DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants.DisGeNET:一个整合人类疾病相关基因和变异信息的综合平台。
Nucleic Acids Res. 2017 Jan 4;45(D1):D833-D839. doi: 10.1093/nar/gkw943. Epub 2016 Oct 19.
9
Evaluating the evaluation of cancer driver genes.评估癌症驱动基因的评估。
Proc Natl Acad Sci U S A. 2016 Dec 13;113(50):14330-14335. doi: 10.1073/pnas.1616440113. Epub 2016 Nov 22.
10
COSMIC: High-Resolution Cancer Genetics Using the Catalogue of Somatic Mutations in Cancer.COSMIC:利用癌症体细胞突变目录进行高分辨率癌症遗传学研究
Curr Protoc Hum Genet. 2016 Oct 11;91:10.11.1-10.11.37. doi: 10.1002/cphg.21.