Suppr超能文献

基于深度学习方法的蛋白质复合物物理感知模型准确性估计

Physical-aware model accuracy estimation for protein complex using deep learning method.

作者信息

Wang Haodong, Sun Meng, Xie Lei, Liu Dong, Zhang Guijun

机构信息

College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China.

出版信息

Comput Struct Biotechnol J. 2025 Jan 22;27:478-487. doi: 10.1016/j.csbj.2025.01.017. eCollection 2025.

Abstract

With the breakthrough of AlphaFold2 on monomers, the research focus of structure prediction has shifted to protein complexes, driving the continued development of new methods for multimer structure prediction. Therefore, it is crucial to accurately estimate quality scores for the multimer model independent of the used prediction methods. In this work, we propose a physical-aware deep learning method, DeepUMQA-PA, to evaluate the residue-wise quality of protein complex models. Given the input protein complex model, the residue-based contact area and orientation features were first constructed using Voronoi tessellation, representing the potential physical interactions and hydrophobic properties. Then, the relationship between local residues and the overall complex topology as well as the inter-residue evolutionary information are characterized by geometry-based features, protein language model embedding representation, and knowledge-based statistical potential features. Finally, these features are fed into a fused network architecture employing equivalent graph neural network and ResNet network to estimate residue-wise model accuracy. Experimental results on the CASP15 test set demonstrate that our method outperforms the state-of-the-art method DeepUMQA3 by 3.69 % and 3.49 % on Pearson and Spearman, respectively. Notably, our method achieved 16.8 % and 15.5 % improvement in Pearson and Spearman, respectively, for the evaluation of nanobody-antigens. In addition, DeepUMQA-PA achieved better MAE scores than AlphaFold-Multimer and AlphaFold3 self-assessment methods on 43 % and 50 % of the targets, respectively. All these results suggest that physical-aware information based on the area and orientation of atom-atom and atom-solvent contacts has the potential to capture sequence-structure-quality relationships of proteins, especially in the case of flexible proteins. The DeepUMQA-PA server is freely available at http://zhanglab-bioinf.com/DeepUMQA-PA/.

摘要

随着AlphaFold2在单体结构预测上取得突破,结构预测的研究重点已转向蛋白质复合物,推动了多聚体结构预测新方法的持续发展。因此,独立于所使用的预测方法准确估计多聚体模型的质量分数至关重要。在这项工作中,我们提出了一种基于物理感知的深度学习方法DeepUMQA-PA,用于评估蛋白质复合物模型中每个残基的质量。给定输入的蛋白质复合物模型,首先使用Voronoi镶嵌构建基于残基的接触面积和方向特征,以表示潜在的物理相互作用和疏水特性。然后,通过基于几何的特征、蛋白质语言模型嵌入表示和基于知识的统计势特征来表征局部残基与整体复合物拓扑结构之间的关系以及残基间的进化信息。最后,将这些特征输入到一个融合网络架构中,该架构采用等效图神经网络和ResNet网络来估计每个残基的模型准确性。在CASP15测试集上的实验结果表明,我们的方法在Pearson和Spearman指标上分别比当前最优方法DeepUMQA3高出3.69%和3.49%。值得注意的是,在评估纳米抗体-抗原时,我们的方法在Pearson和Spearman指标上分别提高了16.8%和15.5%。此外,在43%和50%的目标上,DeepUMQA-PA分别比AlphaFold-Multimer和AlphaFold3的自我评估方法取得了更好的平均绝对误差(MAE)分数。所有这些结果表明,基于原子-原子和原子-溶剂接触面积和方向的物理感知信息有潜力捕捉蛋白质的序列-结构-质量关系,特别是在柔性蛋白质的情况下。DeepUMQA-PA服务器可在http://zhanglab-bioinf.com/DeepUMQA-PA/免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ac0/11799971/c5b613806889/ga1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验