对具有……的致病性错义变体进行准确鉴定和机制评估。（原文结尾不完整，翻译可能不太准确，需结合完整原文理解）

Accurate identification and mechanistic evaluation of pathogenic missense variants with .

作者信息

Banerjee Anupam, Bogetti Anthony T, Bahar Ivet

机构信息

Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794.

Department of Biochemistry and Cell Biology, Renaissance School of Medicine, Stony Brook University, Stony Brook, NY 11794.

出版信息

Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2418100122. doi: 10.1073/pnas.2418100122. Epub 2025 May 2.

DOI:10.1073/pnas.2418100122

PMID:40314982

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12067267/

Abstract

Understanding the effects of missense mutations or single amino acid variants (SAVs) on protein function is crucial for elucidating the molecular basis of diseases/disorders and designing rational therapies. We introduce here , a machine learning tool for discriminating pathogenic and neutral SAVs, significantly expanding on a precursor limited by the availability of structural data. With the advent of AlphaFold2 as a powerful tool for structure prediction, is trained on a significantly expanded dataset of 117,525 SAVs corresponding to 12,094 human proteins reported in the ClinVar database. Adopting a broad set of descriptors composed of sequence evolutionary, structural, dynamic, and energetics features in the training algorithm, achieved an AUROC of 0.94 in 10-fold cross-validation when all SAVs of a particular test protein (mutant) were excluded from the training set. Benchmarking against a variety of testing datasets demonstrated the high performance of . While sequence evolutionary descriptors play a dominant role in pathogenicity prediction, those based on structural dynamics provide a mechanistic interpretation. Notably, residues involved in allosteric communication and those distinguished by pronounced fluctuations in the high-frequency modes of motion or subject to spatial constraints in soft modes usually give rise to pathogenicity when mutated. Overall, provides an efficient and transparent tool for accurately predicting the pathogenicity of SAVs and unraveling the mechanistic basis of the observed behavior, thus advancing our understanding of genotype-to-phenotype relations.

摘要

了解错义突变或单氨基酸变体（SAVs）对蛋白质功能的影响对于阐明疾病/病症的分子基础和设计合理的治疗方法至关重要。我们在此介绍一种用于区分致病性和中性SAVs的机器学习工具，它在很大程度上扩展了受结构数据可用性限制的前身工具。随着AlphaFold2作为一种强大的结构预测工具的出现，该工具在ClinVar数据库中报告的对应于12,094种人类蛋白质的117,525个SAVs的显著扩展数据集上进行了训练。在训练算法中采用由序列进化、结构、动力学和能量学特征组成的广泛描述符集，当将特定测试蛋白质（突变体）的所有SAVs从训练集中排除时，该工具在10折交叉验证中实现了0.94的曲线下面积（AUROC）。针对各种测试数据集的基准测试证明了该工具的高性能。虽然序列进化描述符在致病性预测中起主导作用，但基于结构动力学的描述符提供了一种机理解释。值得注意的是，参与变构通讯的残基以及那些在高频运动模式中表现出明显波动或在软模式中受到空间限制的残基在发生突变时通常会导致致病性。总体而言，该工具为准确预测SAVs的致病性和揭示观察到的行为的机理解释提供了一种高效且透明的工具，从而推进了我们对基因型与表型关系的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2eb1/12067267/5c3ed6901929/pnas.2418100122fig01.jpg

相似文献

Accurate identification and mechanistic evaluation of pathogenic missense variants with .对具有……的致病性错义变体进行准确鉴定和机制评估。（原文结尾不完整，翻译可能不太准确，需结合完整原文理解）

Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2418100122. doi: 10.1073/pnas.2418100122. Epub 2025 May 2.

Accurate Identification and Mechanistic Evaluation of Pathogenic Missense Variants with .使用……对致病性错义变异进行准确鉴定和机制评估

bioRxiv. 2025 Mar 6:2025.02.17.638727. doi: 10.1101/2025.02.17.638727.

Rhapsody: predicting the pathogenicity of human missense variants.Rhapsody：预测人类错义变异的致病性。

Bioinformatics. 2020 May 1;36(10):3084-3092. doi: 10.1093/bioinformatics/btaa127.

LYRUS: a machine learning model for predicting the pathogenicity of missense variants.LYRUS：一种用于预测错义变异致病性的机器学习模型。

Bioinform Adv. 2021 Dec 25;2(1):vbab045. doi: 10.1093/bioadv/vbab045. eCollection 2022.

Pathogenicity Prediction of Single Amino Acid Variants With Machine Learning Model Based on Protein Structural Energies.基于蛋白质结构能量的机器学习模型对单氨基酸变体的致病性预测

IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):606-615. doi: 10.1109/TCBB.2021.3139048. Epub 2023 Feb 3.

AFFIPred: AlphaFold2 structure-based Functional Impact Prediction of missense variations.AFFIPred：基于AlphaFold2结构的错义变异功能影响预测

Protein Sci. 2025 Feb;34(2):e70030. doi: 10.1002/pro.70030.

Predicting the pathogenicity of missense variants using features derived from AlphaFold2.利用源自 AlphaFold2 的特征预测错义变异的致病性。

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad280.

Comprehensive characterization of amino acid positions in protein structures reveals molecular effect of missense variants.全面描述蛋白质结构中氨基酸位置的特征，揭示错义变异的分子效应。

Proc Natl Acad Sci U S A. 2020 Nov 10;117(45):28201-28211. doi: 10.1073/pnas.2002660117. Epub 2020 Oct 26.

Novel gene-specific Bayesian Gaussian mixture model to predict the missense variants pathogenicity of Sanfilippo syndrome.新型基因特异性贝叶斯高斯混合模型预测黏多糖贮积症 III 型错义变异的致病性。

Sci Rep. 2024 May 27;14(1):12148. doi: 10.1038/s41598-024-62352-0.

Accuracy of a machine learning method based on structural and locational information from AlphaFold2 for predicting the pathogenicity of TARDBP and FUS gene variants in ALS.基于 AlphaFold2 的结构和位置信息的机器学习方法预测 ALS 中 TARDBP 和 FUS 基因突变致病性的准确性。

BMC Bioinformatics. 2023 May 19;24(1):206. doi: 10.1186/s12859-023-05338-5.

本文引用的文献

SIGMA leverages protein structural information to predict the pathogenicity of missense variants.SIGMA 利用蛋白质结构信息来预测错义变异的致病性。

Cell Rep Methods. 2024 Jan 22;4(1):100687. doi: 10.1016/j.crmeth.2023.100687. Epub 2024 Jan 10.

Missense3D-TM: Predicting the Effect of Missense Variants in Helical Transmembrane Protein Regions Using 3D Protein Structures.错义突变 3D-TM：利用 3D 蛋白质结构预测螺旋跨膜蛋白区域中错义变异的影响。

J Mol Biol. 2024 Jan 15;436(2):168374. doi: 10.1016/j.jmb.2023.168374. Epub 2023 Dec 7.

The molecular basis for cellular function of intrinsically disordered protein regions.无定形蛋白质区域的细胞功能的分子基础。

Nat Rev Mol Cell Biol. 2024 Mar;25(3):187-211. doi: 10.1038/s41580-023-00673-0. Epub 2023 Nov 13.

Accurate proteome-wide missense variant effect prediction with AlphaMissense.使用 AlphaMissense 进行精确的全蛋白质错义变异效应预测。

Science. 2023 Sep 22;381(6664):eadg7492. doi: 10.1126/science.adg7492.

Zero-shot mutation effect prediction on protein stability and function using RoseTTAFold.使用 RoseTTAFold 对蛋白质稳定性和功能的零-shot 突变效应预测。

Protein Sci. 2023 Nov;32(11):e4780. doi: 10.1002/pro.4780.

Genome-wide prediction of disease variant effects with a deep protein language model.利用深度蛋白质语言模型进行全基因组疾病变异效应预测。

Nat Genet. 2023 Sep;55(9):1512-1522. doi: 10.1038/s41588-023-01465-0. Epub 2023 Aug 10.

Predicting functional effect of missense variants using graph attention neural networks.使用图注意力神经网络预测错义变异的功能效应。

Nat Mach Intell. 2022 Nov;4(11):1017-1028. doi: 10.1038/s42256-022-00561-w. Epub 2022 Nov 15.

Missense3D-PPI: A Web Resource to Predict the Impact of Missense Variants at Protein Interfaces Using 3D Structural Data.错义突变 3D-PPI：一个利用 3D 结构数据预测蛋白质界面错义变异影响的网络资源。

J Mol Biol. 2023 Jul 15;435(14):168060. doi: 10.1016/j.jmb.2023.168060. Epub 2023 Mar 24.

Structure-based pathogenicity relationship identifier for predicting effects of single missense variants and discovery of higher-order cancer susceptibility clusters of mutations.基于结构的致病性关系识别器，用于预测单错义变异的影响，并发现更高阶的癌症易感性突变簇。

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad206.

Structural Dynamics Predominantly Determine the Adaptability of Proteins to Amino Acid Deletions.结构动力学主要决定蛋白质对氨基酸缺失的适应能力。

Int J Mol Sci. 2023 May 8;24(9):8450. doi: 10.3390/ijms24098450.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

对具有……的致病性错义变体进行准确鉴定和机制评估。 （原文结尾不完整，翻译可能不太准确，需结合完整原文理解）

Accurate identification and mechanistic evaluation of pathogenic missense variants with .

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献

对具有……的致病性错义变体进行准确鉴定和机制评估。（原文结尾不完整，翻译可能不太准确，需结合完整原文理解）