Xia Yuhao, Pu Yilin, Wang Suhui, Zhuang Jianan, Liu Dong, Hou Minghua, Zhang Guijun
College of Information Engineering, Zhejiang University of Technology, HangZhou 310023, China.
College of Information Engineering, Zhejiang University of Technology, HangZhou 310023, China.
J Mol Biol. 2025 Aug 1;437(15):169128. doi: 10.1016/j.jmb.2025.169128. Epub 2025 Apr 4.
Proteins often perform biological functions by forming complexes, thereby accurately predicting the structure of protein complexes is crucial to understanding and mastering their functions, as well as facilitating drug discovery. Protein monomeric structure prediction has made a breakthrough in recent years, but the accurate prediction of complex structure remains a challenge. In this work, we present DeepAssembly2, a web server for automatically assembling protein complex structure based on domain-domain interactions. First, the features are constructed according to the input complex sequence and monomeric structures, then these features are used to predict the inter-chain residue distance through a deep learning model, and finally, the complex structure is assembled under the guidance of inter-chain residue distances. Compared with the previously developed version, DeepAssembly2 is trained on a newly constructed inter-chain domain-domain interaction dataset. Meanwhile, several important features have been added, such as Interface Residue Propensity and Ultrafast Shape Recognition. In addition, we introduced the inter-chain residue distance from the AlphaFold-Multimer model to further improve the accuracy. Finally, we also integrate our recently developed model quality assessment method to select the output models. The performance of DeepAssembly2 is significantly improved compared with the previous version, and it is expected to provide new insights and an effective tool for drug development, vaccine design, etc. The web server of DeepAssembly2 is freely available at https://zhanglab-bioinf.com/DeepAssembly/.
蛋白质通常通过形成复合物来执行生物学功能,因此准确预测蛋白质复合物的结构对于理解和掌握其功能以及促进药物发现至关重要。蛋白质单体结构预测近年来取得了突破,但复合物结构的准确预测仍然是一个挑战。在这项工作中,我们展示了DeepAssembly2,一个基于结构域-结构域相互作用自动组装蛋白质复合物结构的网络服务器。首先,根据输入的复合物序列和单体结构构建特征,然后利用这些特征通过深度学习模型预测链间残基距离,最后在链间残基距离的指导下组装复合物结构。与之前开发的版本相比,DeepAssembly2是在新构建的链间结构域-结构域相互作用数据集上进行训练的。同时,添加了几个重要特征,如界面残基倾向和超快形状识别。此外,我们引入了来自AlphaFold-Multimer模型的链间残基距离以进一步提高准确性。最后,我们还整合了我们最近开发的模型质量评估方法来选择输出模型。与之前的版本相比,DeepAssembly2的性能有了显著提高,有望为药物开发、疫苗设计等提供新的见解和有效工具。DeepAssembly2的网络服务器可在https://zhanglab-bioinf.com/DeepAssembly/免费获取。