College of Information Engineering, Zhejiang University of Technology, Hangzhou, China.
Proteins. 2023 Dec;91(12):1861-1870. doi: 10.1002/prot.26564. Epub 2023 Aug 8.
This article reports and analyzes the results of protein complex model accuracy estimation by our methods (DeepUMQA3 and GraphGPSM) in the 15th Critical Assessment of techniques for protein Structure Prediction (CASP15). The new deep learning-based multimeric complex model accuracy estimation methods are proposed based on the ensemble of three-level features coupling with deep residual/graph neural networks. For the input multimeric complex model, we describe it from three levels: overall complex features, intra-monomer features, and inter-monomer features. We designed an overall ultrafast shape recognition (USR) to characterize the relationship between local residues and the overall complex topology, and an inter-monomer USR to characterize the relationship between the residues of one monomer and the topology of other monomers. DeepUMQA3 (Group name: GuijunLab-RocketX) ranked first in the interface residue accuracy estimation of CASP15. The Pearson correlation between the interface residue Local Distance Difference Test (lDDT) predicted by DeepUMQA3 and the real lDDT is 0.570, the only method that exceeds 0.5. Among the top 5 methods, DeepUMQA3 achieved the highest Pearson correlation of lDDT on 25 out of 39 targets. GraphGPSM (Group name: GuijunLab-PAthreader) has TM-score Pearson correlations greater than 0.9 on 14 targets, showing a good ability to estimate the overall fold accuracy. The DeepUMQA3 server is available at http://zhanglab-bioinf.com/DeepUMQA/ and the GraphGPSM server is available at http://zhanglab-bioinf.com/GraphGPSM/.
本文报道并分析了我们的方法(DeepUMQA3 和 GraphGPSM)在第 15 届蛋白质结构预测技术评估(CASP15)中对蛋白质复合物模型精度估计的结果。新的基于深度学习的多聚体复合物模型精度估计方法是基于三级特征与深度残差/图神经网络的集成提出的。对于输入的多聚体复合物模型,我们从三个层面描述它:整体复合物特征、单体内部特征和单体间特征。我们设计了一个整体超快形状识别(USR)来描述局部残基与整体复合物拓扑之间的关系,以及一个单体间 USR 来描述一个单体的残基与其他单体拓扑之间的关系。DeepUMQA3(小组名称:GuijunLab-RocketX)在 CASP15 的界面残基精度估计中排名第一。DeepUMQA3 预测的界面残基局部距离差异测试(lDDT)与真实 lDDT 的 Pearson 相关系数为 0.570,是唯一超过 0.5 的方法。在排名前 5 的方法中,DeepUMQA3 在 39 个目标中的 25 个目标上实现了 lDDT 的最高 Pearson 相关系数。GraphGPSM(小组名称:GuijunLab-PAthreader)在 14 个目标上的 TM-score Pearson 相关系数大于 0.9,显示出很好的整体折叠精度估计能力。DeepUMQA3 服务器可在 http://zhanglab-bioinf.com/DeepUMQA/ 上访问,GraphGPSM 服务器可在 http://zhanglab-bioinf.com/GraphGPSM/ 上访问。