• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习预测突变功能。

Predicting mutational function using machine learning.

机构信息

Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA.

Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA; Bioinformatics and Computational Biology Program, University of Minnesota, Minneapolis, MN 55455, USA.

出版信息

Mutat Res Rev Mutat Res. 2023 Jan-Jun;791:108457. doi: 10.1016/j.mrrev.2023.108457. Epub 2023 Mar 23.

DOI:10.1016/j.mrrev.2023.108457
PMID:36965820
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10239318/
Abstract

Genetic variations are one of the major causes of phenotypic variations between human individuals. Although beneficial as being the substrate of evolution, germline mutations may cause diseases, including Mendelian diseases and complex diseases such as diabetes and heart diseases. Mutations occurring in somatic cells are a main cause of cancer and likely cause age-related phenotypes and other age-related diseases. Because of the high abundance of genetic variations in the human genome, i.e., millions of germline variations per human subject and thousands of additional somatic mutations per cell, it is technically challenging to experimentally verify the function of every possible mutation and their interactions. Significant progress has been made to solve this problem using computational approaches, especially machine learning (ML). Here, we review the progress and achievements made in recent years in this field of research. We classify the computational models in two ways: one according to their prediction goals including protein structural alterations, gene expression changes, and disease risks, and the other according to their methodologies, including non-machine learning methods, classical machine learning methods, and deep neural network methods. For models in each category, we discuss their architecture, prediction accuracy, and potential limitations. This review provides new insights into the applications and future directions of computational approaches in understanding the role of mutations in aging and disease.

摘要

遗传变异是人类个体之间表型变异的主要原因之一。虽然作为进化的基础是有益的,但种系突变可能导致疾病,包括孟德尔疾病和糖尿病、心脏病等复杂疾病。体细胞中的突变是癌症的主要原因,并可能导致与年龄相关的表型和其他与年龄相关的疾病。由于人类基因组中遗传变异的高度丰富性,即每个人类个体有上百万种种系变异和每个细胞有数千种额外的体细胞突变,因此实验验证每个可能的突变及其相互作用的功能具有技术挑战性。利用计算方法,特别是机器学习 (ML),在解决这个问题方面取得了重大进展。在这里,我们回顾了近年来在这一研究领域取得的进展和成就。我们根据预测目标将计算模型分为两类:一类是根据其预测目标进行分类,包括蛋白质结构改变、基因表达变化和疾病风险,另一类是根据其方法学进行分类,包括非机器学习方法、经典机器学习方法和深度神经网络方法。对于每个类别的模型,我们讨论了它们的架构、预测准确性和潜在的局限性。本综述为理解突变在衰老和疾病中的作用提供了计算方法应用和未来方向的新见解。

相似文献

1
Predicting mutational function using machine learning.利用机器学习预测突变功能。
Mutat Res Rev Mutat Res. 2023 Jan-Jun;791:108457. doi: 10.1016/j.mrrev.2023.108457. Epub 2023 Mar 23.
2
Machine learning methods for prediction of cancer driver genes: a survey paper.机器学习方法在癌症驱动基因预测中的应用:综述论文。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac062.
3
Predicting cancer-associated germline variations in proteins.预测蛋白质中的与癌症相关的种系变异。
BMC Genomics. 2012 Jun 18;13 Suppl 4(Suppl 4):S8. doi: 10.1186/1471-2164-13-S4-S8.
4
Predicting mutant outcome by combining deep mutational scanning and machine learning.通过结合深度突变扫描和机器学习预测突变结果。
Proteins. 2022 Jan;90(1):45-57. doi: 10.1002/prot.26184. Epub 2021 Jul 31.
5
Direct mutation analysis by high-throughput sequencing: from germline to low-abundant, somatic variants.高通量测序的直接突变分析:从种系到低丰度、体细胞变异。
Mutat Res. 2012 Jan 3;729(1-2):1-15. doi: 10.1016/j.mrfmmm.2011.10.001. Epub 2011 Oct 12.
6
A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal.一种计算方法,用于从无匹配正常样本的癌症标本深度测序中区分基因组改变的体细胞起源与种系起源。
PLoS Comput Biol. 2018 Feb 7;14(2):e1005965. doi: 10.1371/journal.pcbi.1005965. eCollection 2018 Feb.
7
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
8
Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer.全基因组测序与长读长揭示了人类遗传变异和癌症体细胞突变中结构变异的复杂结构和起源。
Genome Med. 2021 Apr 29;13(1):65. doi: 10.1186/s13073-021-00883-1.
9
Machine Learning Classification and Structure-Functional Analysis of Cancer Mutations Reveal Unique Dynamic and Network Signatures of Driver Sites in Oncogenes and Tumor Suppressor Genes.机器学习分类和癌症突变的结构-功能分析揭示了癌基因和肿瘤抑制基因中驱动位点的独特动态和网络特征。
J Chem Inf Model. 2018 Oct 22;58(10):2131-2150. doi: 10.1021/acs.jcim.8b00414. Epub 2018 Oct 3.
10
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.

引用本文的文献

1
Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks.预测癌症驱动基因和突变:集成计算框架的潜力。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad519.

本文引用的文献

1
Classification of non-coding variants with high pathogenic impact.高致病性非编码变异分类。
PLoS Genet. 2022 Apr 29;18(4):e1010191. doi: 10.1371/journal.pgen.1010191. eCollection 2022 Apr.
2
Single-cell analysis of somatic mutations in human bronchial epithelial cells in relation to aging and smoking.单细胞分析人类支气管上皮细胞中的体细胞突变与衰老和吸烟的关系。
Nat Genet. 2022 Apr;54(4):492-498. doi: 10.1038/s41588-022-01035-w. Epub 2022 Apr 11.
3
Researchers turn to deep learning to decode protein structures.研究人员借助深度学习来解析蛋白质结构。
Proc Natl Acad Sci U S A. 2022 Mar 8;119(10):e2202107119. doi: 10.1073/pnas.2202107119. Epub 2022 Mar 2.
4
Disease variant prediction with deep generative models of evolutionary data.利用进化数据的深度生成模型进行疾病变异预测。
Nature. 2021 Nov;599(7883):91-95. doi: 10.1038/s41586-021-04043-8. Epub 2021 Oct 27.
5
Effective gene expression prediction from sequence by integrating long-range interactions.通过整合长程相互作用,从序列中有效预测基因表达。
Nat Methods. 2021 Oct;18(10):1196-1203. doi: 10.1038/s41592-021-01252-x. Epub 2021 Oct 4.
6
A wider field of view to predict expression.更广阔的视野预测表达。
Nat Methods. 2021 Oct;18(10):1155-1156. doi: 10.1038/s41592-021-01259-4.
7
Decoding disease: from genomes to networks to phenotypes.解码疾病:从基因组到网络再到表型。
Nat Rev Genet. 2021 Dec;22(12):774-790. doi: 10.1038/s41576-021-00389-x. Epub 2021 Aug 2.
8
Assessment of the CASP14 assembly predictions.CASP14 组装预测评估。
Proteins. 2021 Dec;89(12):1787-1799. doi: 10.1002/prot.26199. Epub 2021 Aug 31.
9
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
10
Machine learning in protein structure prediction.机器学习在蛋白质结构预测中的应用。
Curr Opin Chem Biol. 2021 Dec;65:1-8. doi: 10.1016/j.cbpa.2021.04.005. Epub 2021 May 18.