Suppr超能文献

使用深度图神经网络进行快速有效的蛋白质模型优化。

Fast and effective protein model refinement using deep graph neural networks.

作者信息

Jing Xiaoyang, Xu Jinbo

机构信息

Toyota Technological Institute at Chicago, Chicago, IL 60637, USA.

出版信息

Nat Comput Sci. 2021 Jul;1(7):462-469. doi: 10.1038/s43588-021-00098-9. Epub 2021 Jul 15.

Abstract

Protein model refinement is the last step applied to improve the quality of a predicted protein model. Currently the most successful refinement methods rely on extensive conformational sampling and thus, take hours or days to refine even a single protein model. Here we propose a fast and effective model refinement method that applies GNN (graph neural networks) to predict refined inter-atom distance probability distribution from an initial model and then rebuilds 3D models from the predicted distance distribution. Tested on the CASP (Critical Assessment of Structure Prediction) refinement targets, our method has comparable accuracy as two leading human groups Feig and Baker, but runs substantially faster. Our method may refine one protein model within ~11 minutes on 1 CPU while Baker needs ~30 hours on 60 CPUs and Feig needs ~16 hours on 1 GPU. Finally, our study shows that GNN outperforms ResNet (convolutional residual neural networks) for model refinement when very limited conformational sampling is allowed.

摘要

蛋白质模型优化是用于提高预测蛋白质模型质量的最后一步。目前,最成功的优化方法依赖于广泛的构象采样,因此,即使是优化单个蛋白质模型也需要数小时或数天时间。在此,我们提出了一种快速有效的模型优化方法,该方法应用图神经网络(GNN)从初始模型预测优化后的原子间距离概率分布,然后根据预测的距离分布重建三维模型。在蛋白质结构预测关键评估(CASP)优化目标上进行测试时,我们的方法与两个领先的人类团队Feig和Baker具有相当的准确性,但运行速度要快得多。我们的方法在1个中央处理器(CPU)上约11分钟内可优化一个蛋白质模型,而Baker团队在60个CPU上需要约30小时,Feig团队在1个图形处理器(GPU)上需要约16小时。最后,我们的研究表明,在允许非常有限的构象采样时,对于模型优化,图神经网络优于残差神经网络(ResNet)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b484/8939834/e089b02d26ad/nihms-1738480-f0004.jpg

相似文献

6
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

引用本文的文献

6
Recent advances and challenges in protein complex model accuracy estimation.蛋白质复合物模型准确性评估的最新进展与挑战
Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.
9
Low-Data Drug Design with Few-Shot Generative Domain Adaptation.基于少样本生成域适应的低数据药物设计
Bioengineering (Basel). 2023 Sep 21;10(9):1104. doi: 10.3390/bioengineering10091104.

本文引用的文献

7
Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。
Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.
9
Distance-based protein folding powered by deep learning.基于深度学习的距离相关蛋白质折叠。
Proc Natl Acad Sci U S A. 2019 Aug 20;116(34):16856-16865. doi: 10.1073/pnas.1821309116. Epub 2019 Aug 9.
10
Evaluation of model refinement in CASP13.CASP13 模型优化评估
Proteins. 2019 Dec;87(12):1249-1262. doi: 10.1002/prot.25794. Epub 2019 Aug 20.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验