Dipalma Alessandro, Fontanesi Michele, Micheli Alessio, Milazzo Paolo, Podda Marco
Department of Computer Science, University of Pisa, Largo Bruno Pontecorvo, 3, 56125, Pisa, PI, Italy.
BMC Bioinformatics. 2025 May 8;26(1):124. doi: 10.1186/s12859-025-06140-1.
Protein-protein interaction networks (PPINs) provide a comprehensive view of the intricate biochemical processes that take place in living organisms. In recent years, the size and information content of PPINs have grown thanks to techniques that allow for the functional association of proteins. However, PPINs are static objects that cannot fully describe the dynamics of the protein interactions; these dynamics are usually studied from external sources and can only be added to the PPIN as annotations. In contrast, the time-dependent characteristics of cellular processes are described in Biochemical Pathways (BP), which frame complex networks of chemical reactions as dynamical systems. Their analysis with numerical simulations allows for the study of different dynamical properties. Unfortunately, available BPs cover only a small portion of the interactome, and simulations are often hampered by the unavailability of kinetic parameters or by their computational cost. In this study, we explore the possibility of enriching PPINs with dynamical properties computed from BPs. We focus on the global dynamical property of sensitivity, which measures how a change in the concentration of an input molecular species influences the concentration of an output molecular species at the steady state of the dynamical system.
We started with the analysis of BPs via ODE simulations, which enabled us to compute the sensitivity associated with multiple pairs of chemical species. The sensitivity information was then injected into a PPIN, using public ontologies (BioGRID, UniPROT) to map entities at the BP level with nodes at the PPIN level. The resulting annotated PPIN, termed the DyPPIN (Dynamics of PPIN) dataset, was used to train a DGN to predict the sensitivity relationships among PPIN proteins. Our experimental results show that this model can predict these relationships effectively under different use case scenarios. Furthermore, we show that the PPIN structure (i.e., the way the PPIN is "wired") is essential to infer the sensitivity, and that further annotating the PPIN nodes with protein sequence embeddings improves the predictive accuracy.
To the best of our knowledge, the model proposed in this study is the first that allows performing sensitivity analysis directly on PPINs. Our findings suggest that, despite the high level of abstraction, the structure of the PPIN holds enough information to infer dynamic properties without needing an exact model of the underlying processes. In addition, the designed pipeline is flexible and can be easily integrated into drug design, repurposing, and personalized medicine processes.
蛋白质-蛋白质相互作用网络(PPINs)全面展示了生物体中发生的复杂生化过程。近年来,由于能够实现蛋白质功能关联的技术,PPINs的规模和信息含量不断增长。然而,PPINs是静态对象,无法完全描述蛋白质相互作用的动态过程;这些动态过程通常从外部来源进行研究,并且只能作为注释添加到PPIN中。相比之下,生化途径(BP)描述了细胞过程的时间依赖性特征,它将复杂的化学反应网络构建为动态系统。通过数值模拟对其进行分析,可以研究不同的动态特性。不幸的是,现有的BP仅涵盖了相互作用组的一小部分,并且模拟常常因动力学参数不可用或计算成本过高而受到阻碍。在本研究中,我们探索了用从BP计算得到的动态特性丰富PPINs的可能性。我们关注敏感性这一全局动态特性,它衡量了在动态系统稳态下输入分子物种浓度的变化如何影响输出分子物种的浓度。
我们首先通过常微分方程(ODE)模拟对BP进行分析,这使我们能够计算与多对化学物种相关的敏感性。然后,利用公共本体(BioGRID、UniPROT)将BP层面的实体与PPIN层面的节点进行映射,将敏感性信息注入到PPIN中。由此产生的带注释的PPIN,即DyPPIN(PPIN的动态特性)数据集,用于训练一个深度生成网络(DGN)来预测PPIN蛋白质之间的敏感性关系。我们的实验结果表明,该模型在不同的用例场景下能够有效地预测这些关系。此外,我们表明PPIN结构(即PPIN的“连接”方式)对于推断敏感性至关重要,并且用蛋白质序列嵌入进一步注释PPIN节点可以提高预测准确性。
据我们所知,本研究中提出的模型是首个能够直接对PPINs进行敏感性分析的模型。我们的研究结果表明,尽管抽象程度较高,但PPIN的结构包含了足够的信息来推断动态特性,而无需底层过程的精确模型。此外,所设计的流程具有灵活性,可以轻松地集成到药物设计、药物再利用和个性化医疗过程中。