School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
Nat Commun. 2023 Aug 15;14(1):4935. doi: 10.1038/s41467-023-40426-3.
Membrane proteins are encoded by approximately a quarter of human genes. Inter-chain residue-residue contact information is important for structure prediction of membrane protein complexes and valuable for understanding their molecular mechanism. Although many deep learning methods have been proposed to predict the intra-protein contacts or helix-helix interactions in membrane proteins, it is still challenging to accurately predict their inter-chain contacts due to the limited number of transmembrane proteins. Addressing the challenge, here we develop a deep transfer learning method for predicting inter-chain contacts of transmembrane protein complexes, named DeepTMP, by taking advantage of the knowledge pre-trained from a large data set of non-transmembrane proteins. DeepTMP utilizes a geometric triangle-aware module to capture the correct inter-chain interaction from the coevolution information generated by protein language models. DeepTMP is extensively evaluated on a test set of 52 self-associated transmembrane protein complexes, and compared with state-of-the-art methods including DeepHomo2.0, CDPred, GLINTER, DeepHomo, and DNCON2_Inter. It is shown that DeepTMP considerably improves the precision of inter-chain contact prediction and outperforms the existing approaches in both accuracy and robustness.
膜蛋白大约由人类基因的四分之一编码。链间残基残基接触信息对于膜蛋白复合物的结构预测很重要,并且对于理解它们的分子机制也很有价值。尽管已经提出了许多深度学习方法来预测膜蛋白中的蛋白质内接触或螺旋-螺旋相互作用,但由于跨膜蛋白的数量有限,准确预测它们的链间接触仍然具有挑战性。为了解决这一挑战,我们开发了一种深度迁移学习方法来预测跨膜蛋白复合物的链间接触,称为 DeepTMP,该方法利用从大量非跨膜蛋白数据集预训练的知识。DeepTMP 利用几何三角形感知模块从蛋白质语言模型生成的共进化信息中捕获正确的链间相互作用。在 52 个自我相关的跨膜蛋白复合物测试集上对 DeepTMP 进行了广泛评估,并与包括 DeepHomo2.0、CDPred、GLINTER、DeepHomo 和 DNCON2_Inter 在内的最先进的方法进行了比较。结果表明,DeepTMP 大大提高了链间接触预测的精度,在准确性和稳健性方面均优于现有方法。