Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom.
Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2315005121. doi: 10.1073/pnas.2315005121. Epub 2024 Aug 12.
The process of protein phase separation into liquid condensates has been implicated in the formation of membraneless organelles (MLOs), which selectively concentrate biomolecules to perform essential cellular functions. Although the importance of this process in health and disease is increasingly recognized, the experimental identification of proteins forming MLOs remains a complex challenge. In this study, we addressed this problem by harnessing the power of AlphaFold2 to perform computational predictions of the conformational properties of proteins from their amino acid sequences. We thus developed the CoDropleT (co-condensation into droplet transformer) method of predicting the propensity of co-condensation of protein pairs. The method was trained by combining experimental datasets of co-condensing proteins from the CD-CODE database with curated negative datasets of non-co-condensing proteins. To illustrate the performance of the method, we applied it to estimate the propensity of proteins to co-condense into MLOs. Our results suggest that CoDropleT could facilitate functional and therapeutic studies on protein condensation by predicting the composition of protein condensates.
蛋白质相分离成液相凝聚物的过程与无膜细胞器(MLO)的形成有关,这些细胞器选择性地浓缩生物分子以执行基本的细胞功能。尽管这一过程在健康和疾病中的重要性日益被认识,但实验鉴定形成 MLO 的蛋白质仍然是一个复杂的挑战。在这项研究中,我们利用 AlphaFold2 的强大功能,通过从蛋白质的氨基酸序列预测其构象特性来解决这个问题。我们因此开发了 CoDropleT(共凝聚到滴转型)方法来预测蛋白质对的共凝聚倾向。该方法通过将 CD-CODE 数据库中具有共凝聚特性的蛋白质的实验数据集与经过精心筛选的非共凝聚蛋白质的负数据集相结合进行训练。为了说明该方法的性能,我们将其应用于估计蛋白质共凝聚形成 MLO 的倾向。我们的结果表明,CoDropleT 可以通过预测蛋白质凝聚物的组成,促进蛋白质凝聚的功能和治疗研究。