Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, 18 Shilongshan Road, Hangzhou, 310024, Zhejiang, China.
Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, 18 Shilongshan Road, Hangzhou, 310024, Zhejiang, China.
Commun Biol. 2023 Oct 28;6(1):1098. doi: 10.1038/s42003-023-05489-4.
AlphaFold is making great progress in protein structure prediction, not only for single-chain proteins but also for multi-chain protein complexes. When using AlphaFold-Multimer to predict protein‒protein complexes, we observed some unusual structures in which chains are looped around each other to form topologically intertwining links at the interface. Based on physical principles, such topological links should generally not exist in native protein complex structures unless covalent modifications of residues are involved. Although it is well known and has been well studied that protein structures may have topologically complex shapes such as knots and links, existing methods are hampered by the chain closure problem and show poor performance in identifying topologically linked structures in protein‒protein complexes. Therefore, we address the chain closure problem by using sliding windows from a local perspective and propose an algorithm to measure the topological-geometric features that can be used to identify topologically linked structures. An application of the method to AlphaFold-Multimer-predicted protein complex structures finds that approximately 1.72% of the predicted structures contain topological links. The method presented in this work will facilitate the computational study of protein‒protein interactions and help further improve the structural prediction of multi-chain protein complexes.
AlphaFold 在蛋白质结构预测方面取得了重大进展,不仅可以预测单链蛋白质,还可以预测多链蛋白质复合物。当使用 AlphaFold-Multimer 预测蛋白质-蛋白质复合物时,我们观察到一些不寻常的结构,其中链相互缠绕,在界面处形成拓扑交织的连接。基于物理原理,这种拓扑连接通常不应该存在于天然蛋白质复合物结构中,除非涉及残基的共价修饰。尽管众所周知,并且已经进行了很好的研究,蛋白质结构可能具有复杂的拓扑形状,如结和环,但现有的方法受到链闭合问题的限制,在识别蛋白质-蛋白质复合物中的拓扑连接结构方面表现不佳。因此,我们从局部角度使用滑动窗口来解决链闭合问题,并提出了一种算法来测量拓扑-几何特征,这些特征可用于识别拓扑连接结构。该方法应用于 AlphaFold-Multimer 预测的蛋白质复合物结构,发现约 1.72%的预测结构包含拓扑连接。本工作中提出的方法将有助于蛋白质-蛋白质相互作用的计算研究,并有助于进一步提高多链蛋白质复合物的结构预测。