使用深度图神经网络进行快速有效的蛋白质模型优化。

Fast and effective protein model refinement using deep graph neural networks.

作者信息

Jing Xiaoyang, Xu Jinbo

机构信息

Toyota Technological Institute at Chicago, Chicago, IL 60637, USA.

出版信息

Nat Comput Sci. 2021 Jul;1(7):462-469. doi: 10.1038/s43588-021-00098-9. Epub 2021 Jul 15.

DOI:10.1038/s43588-021-00098-9

PMID:35321360

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8939834/

Abstract

Protein model refinement is the last step applied to improve the quality of a predicted protein model. Currently the most successful refinement methods rely on extensive conformational sampling and thus, take hours or days to refine even a single protein model. Here we propose a fast and effective model refinement method that applies GNN (graph neural networks) to predict refined inter-atom distance probability distribution from an initial model and then rebuilds 3D models from the predicted distance distribution. Tested on the CASP (Critical Assessment of Structure Prediction) refinement targets, our method has comparable accuracy as two leading human groups Feig and Baker, but runs substantially faster. Our method may refine one protein model within ~11 minutes on 1 CPU while Baker needs ~30 hours on 60 CPUs and Feig needs ~16 hours on 1 GPU. Finally, our study shows that GNN outperforms ResNet (convolutional residual neural networks) for model refinement when very limited conformational sampling is allowed.

摘要

蛋白质模型优化是用于提高预测蛋白质模型质量的最后一步。目前，最成功的优化方法依赖于广泛的构象采样，因此，即使是优化单个蛋白质模型也需要数小时或数天时间。在此，我们提出了一种快速有效的模型优化方法，该方法应用图神经网络（GNN）从初始模型预测优化后的原子间距离概率分布，然后根据预测的距离分布重建三维模型。在蛋白质结构预测关键评估（CASP）优化目标上进行测试时，我们的方法与两个领先的人类团队Feig和Baker具有相当的准确性，但运行速度要快得多。我们的方法在1个中央处理器（CPU）上约11分钟内可优化一个蛋白质模型，而Baker团队在60个CPU上需要约30小时，Feig团队在1个图形处理器（GPU）上需要约16小时。最后，我们的研究表明，在允许非常有限的构象采样时，对于模型优化，图神经网络优于残差神经网络（ResNet）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b484/8939834/e089b02d26ad/nihms-1738480-f0004.jpg

相似文献

Fast and effective protein model refinement using deep graph neural networks.使用深度图神经网络进行快速有效的蛋白质模型优化。

Nat Comput Sci. 2021 Jul;1(7):462-469. doi: 10.1038/s43588-021-00098-9. Epub 2021 Jul 15.

Improved protein model quality assessment by integrating sequential and pairwise features using deep learning.通过深度学习整合序列和成对特征改进蛋白质模型质量评估。

Bioinformatics. 2021 Apr 1;36(22-23):5361-5367. doi: 10.1093/bioinformatics/btaa1037.

Graph refinement based airway extraction using mean-field networks and graph neural networks.基于均值场网络和图神经网络的图形细化气道提取。

Med Image Anal. 2020 Aug;64:101751. doi: 10.1016/j.media.2020.101751. Epub 2020 Jun 9.

Study of real-valued distance prediction for protein structure prediction with deep learning.基于深度学习的蛋白质结构预测中实值距离预测的研究。

Bioinformatics. 2021 Oct 11;37(19):3197-3203. doi: 10.1093/bioinformatics/btab333.

Atomic protein structure refinement using all-atom graph representations and SE(3)-equivariant graph transformer.使用全原子图表示和 SE(3)-等变图转换器进行原子蛋白质结构精修。

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad298.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

LR-GNN: a graph neural network based on link representation for predicting molecular associations.LR-GNN：一种基于链接表示的图神经网络，用于预测分子关联。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab513.

Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based Motion Prediction.基于多尺度时空图神经网络的 3D 骨骼运动预测

IEEE Trans Image Process. 2021;30:7760-7775. doi: 10.1109/TIP.2021.3108708. Epub 2021 Sep 14.

Princeton_TIGRESS 2.0: High refinement consistency and net gains through support vector machines and molecular dynamics in double-blind predictions during the CASP11 experiment.普林斯顿TIGRESS 2.0：在蛋白质结构预测技术关键评估第11轮（CASP11）实验的双盲预测中，通过支持向量机和分子动力学实现高度精确一致性和净增益。

Proteins. 2017 Jun;85(6):1078-1098. doi: 10.1002/prot.25274. Epub 2017 Mar 21.

NPI-GNN: Predicting ncRNA-protein interactions with deep graph neural networks.NPI-GNN：利用深度图神经网络预测 ncRNA-蛋白质相互作用。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab051.

引用本文的文献

Evaluation of the structural models of the human reference proteome: AlphaFold2 versus ESMFold.人类参考蛋白质组结构模型的评估：AlphaFold2与ESMFold对比

Curr Res Struct Biol. 2025 May 22;9:100167. doi: 10.1016/j.crstbi.2025.100167. eCollection 2025 Jun.

Development of multi-epitope Cathepsin L driven short peptide vaccine against .针对……开发多表位组织蛋白酶L驱动的短肽疫苗。（原文against后内容缺失）

Front Vet Sci. 2025 May 22;12:1547937. doi: 10.3389/fvets.2025.1547937. eCollection 2025.

EquiRank: Improved protein-protein interface quality estimation using protein language-model-informed equivariant graph neural networks.EquiRank：使用蛋白质语言模型引导的等变图神经网络改进蛋白质-蛋白质界面质量评估

Comput Struct Biotechnol J. 2024 Dec 30;27:160-170. doi: 10.1016/j.csbj.2024.12.015. eCollection 2025.

Beyond AlphaFold2: The Impact of AI for the Further Improvement of Protein Structure Prediction.超越 AlphaFold2：人工智能对进一步改进蛋白质结构预测的影响。

Methods Mol Biol. 2025;2867:121-139. doi: 10.1007/978-1-0716-4196-5_7.

DGCPPISP: a PPI site prediction model based on dynamic graph convolutional network and two-stage transfer learning.DGCPPISP：一种基于动态图卷积网络和两阶段迁移学习的蛋白质-蛋白质相互作用位点预测模型。

BMC Bioinformatics. 2024 Jul 31;25(1):252. doi: 10.1186/s12859-024-05864-w.

Recent advances and challenges in protein complex model accuracy estimation.蛋白质复合物模型准确性评估的最新进展与挑战

Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.

Selection among site-dependent structurally constrained substitution models of protein evolution by approximate Bayesian computation.基于近似贝叶斯计算的蛋白质进化中依赖于位置的结构约束替代模型的选择。

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae096.

Assessment of the Performances of the Protein Modeling Techniques Participating in CASP15 Using a Structure-Based Functional Site Prediction Approach: ResiRole.使用基于结构的功能位点预测方法ResiRole评估参与CASP15的蛋白质建模技术的性能

Bioengineering (Basel). 2023 Nov 30;10(12):1377. doi: 10.3390/bioengineering10121377.

Low-Data Drug Design with Few-Shot Generative Domain Adaptation.基于少样本生成域适应的低数据药物设计

Bioengineering (Basel). 2023 Sep 21;10(9):1104. doi: 10.3390/bioengineering10091104.

Amalgamated Pharmacoinformatics Study to Investigate the Mechanism of Xiao Jianzhong Tang against Chronic Atrophic Gastritis.肖建忠唐方治疗慢性萎缩性胃炎的药效物质基础及作用机制研究。

Curr Comput Aided Drug Des. 2024;20(5):598-615. doi: 10.2174/1573409919666230720141115.

本文引用的文献

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.生物结构和功能源于将无监督学习扩展到 2.5 亿个蛋白质序列。

Proc Natl Acad Sci U S A. 2021 Apr 13;118(15). doi: 10.1073/pnas.2016239118.

Improved protein structure refinement guided by deep learning based accuracy estimation.基于深度学习的准确性评估指导的蛋白质结构改进精修。

Nat Commun. 2021 Feb 26;12(1):1340. doi: 10.1038/s41467-021-21511-x.

VoroCNN: deep convolutional neural network built on 3D Voronoi tessellation of protein structures.VoroCNN：基于蛋白质结构的3D Voronoi镶嵌构建的深度卷积神经网络。

Bioinformatics. 2021 Aug 25;37(16):2332-2339. doi: 10.1093/bioinformatics/btab118.

Improved Sampling Strategies for Protein Model Refinement Based on Molecular Dynamics Simulation.基于分子动力学模拟的蛋白质模型优化的改进采样策略

J Chem Theory Comput. 2021 Mar 9;17(3):1931-1943. doi: 10.1021/acs.jctc.0c01238. Epub 2021 Feb 9.

GraphQA: protein model quality assessment using graph convolutional networks.GraphQA：基于图卷积网络的蛋白质模型质量评估。

Bioinformatics. 2021 Apr 20;37(3):360-366. doi: 10.1093/bioinformatics/btaa714.

Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。

Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.

Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。

Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.

Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.评估 CASP13 中蛋白质模型结构准确性估计：深度学习时代的挑战。

Proteins. 2019 Dec;87(12):1351-1360. doi: 10.1002/prot.25804. Epub 2019 Aug 30.

Distance-based protein folding powered by deep learning.基于深度学习的距离相关蛋白质折叠。

Proc Natl Acad Sci U S A. 2019 Aug 20;116(34):16856-16865. doi: 10.1073/pnas.1821309116. Epub 2019 Aug 9.

Evaluation of model refinement in CASP13.CASP13 模型优化评估

Proteins. 2019 Dec;87(12):1249-1262. doi: 10.1002/prot.25794. Epub 2019 Aug 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用深度图神经网络进行快速有效的蛋白质模型优化。

Fast and effective protein model refinement using deep graph neural networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献