van Weesep Laura, Özçelik Rıza, Pennings Marloes, Criscuolo Emanuele, Ottmann Christian, Brunsveld Luc, Grisoni Francesca
Institute for Complex Molecular Systems (ICMS), Eindhoven University of Technology Eindhoven The Netherlands
Eindhoven AI Systems Institute (EAISI), Eindhoven University of Technology Eindhoven The Netherlands.
Digit Discov. 2025 Aug 8. doi: 10.1039/d5dd00132c.
Protein-protein interactions are at the heart of biological processes. Understanding how proteins interact is key for deciphering their roles in health and disease, and for therapeutic interventions. However, identifying protein interaction sites, especially for intrinsically disordered proteins, is challenging. Here, we developed a deep learning framework to predict potential protein binding sites to 14-3-3 - a 'central hub' protein holding a key role in cellular signaling networks. After systematically testing multiple deep learning approaches to predict sequence binding to 14-3-3, we developed an ensemble model that achieved a 75% balanced accuracy on external sequences. Our approach was applied prospectively to identify putative binding sites across medically relevant proteins (ranging from highly structured to intrinsically disordered) for a total of approximately 300 sequences. The top eight predicted peptide sequences were experimentally validated in the wet-lab, and binding to 14-3-3 was confirmed for five out of eight sequences ( ranging from 1.6 ± 0.1 μM to 70 ± 5 μM). The relevance of our results was further confirmed by X-ray crystallography and molecular dynamics simulations. These sequences represent potential new binding sites within the 14-3-3 interactome (, relating to Alzheimer's disease as the binding to tau is not the new part), and provide opportunities to investigate their functional relevance. Our results highlight the ability of deep learning to capture intricate patterns underlying protein-protein interactions, even for challenging cases like intrinsically disordered proteins. To further the understanding and targeting of 14-3-3/protein interactions, our model was provided as a freely accessible web resource at the following URL: https://14-3-3-bindsite.streamlit.app/.
蛋白质-蛋白质相互作用是生物过程的核心。了解蛋白质如何相互作用是解读它们在健康和疾病中的作用以及进行治疗干预的关键。然而,识别蛋白质相互作用位点,尤其是对于内在无序蛋白质来说,具有挑战性。在这里,我们开发了一个深度学习框架来预测与14-3-3的潜在蛋白质结合位点,14-3-3是一种在细胞信号网络中起关键作用的“中枢枢纽”蛋白质。在系统测试了多种预测与14-3-3序列结合的深度学习方法后,我们开发了一个集成模型,该模型在外部序列上实现了75%的平衡准确率。我们的方法被前瞻性地应用于识别总共约300个序列的医学相关蛋白质(从高度结构化到内在无序)中的假定结合位点。前八个预测的肽序列在湿实验室中进行了实验验证,八个序列中有五个被证实与14-3-3结合(范围从1.6±0.1μM到70±5μM)。X射线晶体学和分子动力学模拟进一步证实了我们结果的相关性。这些序列代表了14-3-3相互作用组内潜在的新结合位点(与阿尔茨海默病相关,因为与tau的结合不是新的部分),并为研究它们的功能相关性提供了机会。我们的结果突出了深度学习捕捉蛋白质-蛋白质相互作用背后复杂模式的能力,即使对于像内在无序蛋白质这样具有挑战性的情况也是如此。为了进一步理解和靶向14-3-3/蛋白质相互作用,我们的模型可通过以下网址免费获取:https://14-3-3-bindsite.streamlit.app/ 。