Zhang Diya, Meng Qiaozhen, Guo Fei
School of Computer Science and Engineering, Central South University, Changsha 410000, China.
School of Computer Science, Xiangtan University, Xiangtan 411105, China.
Int J Mol Sci. 2024 Nov 26;25(23):12676. doi: 10.3390/ijms252312676.
In the binding process between proteins and ligand molecules, water molecules play a pivotal role by forming hydrogen bonds that enable proteins and ligand molecules to bind more strongly. However, current methodologies for predicting binding affinity overlook the importance of water molecules. Therefore, we developed a model called GraphWater-Net, specifically designed for predicting protein-ligand binding affinity, by incorporating water molecules. GraphWater-Net employs topological structures to represent protein atoms, ligand atoms and water molecules, and their interactions. Leveraging the Graphormer network, the model extracts interaction features between nodes within the topology, alongside the interaction features of edges and nodes. Subsequently, it generates embeddings with attention weights, inputs them into a Softmax function for regression prediction, and ultimately outputs the predicted binding affinity value. Experimental results on the Comparative Assessment of Scoring Functions (CASF) 2016 test set show that the introduction of water molecules into the complex significantly improves the prediction performance of the proposed model for protein and ligand binding affinity. Specifically, the Pearson correlation coefficient () exceeds that of current state-of-the-art methods by a margin of 0.022 to 0.129. By integrating water molecules, GraphWater-Net has the potential to facilitate the rational design of protein-ligand interactions and aid in drug discovery.
在蛋白质与配体分子的结合过程中,水分子通过形成氢键发挥关键作用,使蛋白质和配体分子能更紧密地结合。然而,当前预测结合亲和力的方法忽略了水分子的重要性。因此,我们开发了一种名为GraphWater-Net的模型,通过纳入水分子专门用于预测蛋白质-配体结合亲和力。GraphWater-Net采用拓扑结构来表示蛋白质原子、配体原子和水分子及其相互作用。该模型利用Graphormer网络提取拓扑结构内节点之间的相互作用特征以及边和节点的相互作用特征。随后,它生成带有注意力权重的嵌入,将其输入Softmax函数进行回归预测,最终输出预测的结合亲和力值。在2016年评分函数比较评估(CASF)测试集上的实验结果表明,将水分子引入复合物中显著提高了所提出模型对蛋白质和配体结合亲和力的预测性能。具体而言,皮尔逊相关系数()比当前最先进的方法高出0.022至0.129。通过整合水分子,GraphWater-Net有潜力促进蛋白质-配体相互作用的合理设计并助力药物发现。