School of Medical Information Engineering, Guangdong Pharmaceutical University, Guangzhou, China.
Guangdong Provincial Traditional Chinese Medicine Precision Medicine Big Data Engineering Technology Research Center, Guangzhou, China.
Curr Comput Aided Drug Des. 2024;20(6):1013-1024. doi: 10.2174/1573409919666230713142255.
In this study, we aimed to develop a new end-to-end learning model called Graph-Drug-Target Interaction (DTI), which integrates various types of information in the heterogeneous network data, and to explore automatic learning of the topology-maintaining representations of drugs and targets, thereby effectively contributing to the prediction of DTI. Precise predictions of DTI can guide drug discovery and development. Most machine learning algorithms integrate multiple data sources and combine them with common embedding methods. However, the relationship between the drugs and target proteins is not well reported. Although some existing studies have used heterogeneous network graphs for DTI prediction, there are many limitations in the neighborhood information between the nodes in the heterogeneous network graphs. We studied the drug-drug interaction (DDI) and DTI from DrugBank Version 3.0, protein-protein interaction (PPI) from the human protein reference database Release 9, drug structure similarity from Morgan fingerprints of radius 2 and calculated by RDKit, and protein sequence similarity from Smith-Waterman score.
Our study consists of three major components. First, various drugs and target proteins were integrated, and a heterogeneous network was established based on a series of data sets. Second, the graph neural networks-inspired graph auto-encoding method was used to extract high-order structural information from the heterogeneous networks, thereby revealing the description of nodes (drugs and proteins) and their topological neighbors. Finally, potential DTI prediction was made, and the obtained samples were sent to the classifier for secondary classification.
The performance of Graph-DTI and all baseline methods was evaluated using the sums of the area under the precision-recall curve (AUPR) and the area under the receiver operating characteristic curve (AUC). The results indicated that Graph-DTI outperformed the baseline methods in both performance results.
Compared with other baseline DTI prediction methods, the results showed that Graph-DTI had better prediction performance. Additionally, in this study, we effectively classified drugs corresponding to different targets and vice versa. The above findings showed that Graph-DTI provided a powerful tool for drug research, development, and repositioning. Graph- DTI can serve as a drug development and repositioning tool more effectively than previous studies that did not use heterogeneous network graph embedding.
在这项研究中,我们旨在开发一种新的端到端学习模型,称为图-药物-靶标相互作用(DTI),该模型整合了异构网络数据中的各种类型的信息,并探索药物和靶标的拓扑保持表示的自动学习,从而有效地促进 DTI 的预测。DTI 的准确预测可以指导药物发现和开发。大多数机器学习算法整合了多个数据源,并将它们与常见的嵌入方法相结合。然而,药物和靶标蛋白之间的关系并没有得到很好的报道。尽管一些现有研究已经使用异构网络图进行 DTI 预测,但在异构网络图的节点之间的邻域信息方面存在许多限制。我们研究了来自 DrugBank Version 3.0 的药物-药物相互作用(DDI)和 DTI、来自人类蛋白质参考数据库 Release 9 的蛋白质-蛋白质相互作用(PPI)、来自 Morgan 指纹半径为 2 的药物结构相似性和由 RDKit 计算、以及来自 Smith-Waterman 分数的蛋白质序列相似性。
我们的研究由三个主要部分组成。首先,整合各种药物和靶标蛋白,基于一系列数据集建立异构网络。其次,使用图神经网络启发的图自动编码方法从异构网络中提取高阶结构信息,从而揭示节点(药物和蛋白质)及其拓扑邻居的描述。最后,进行潜在的 DTI 预测,并将获得的样本发送到分类器进行二次分类。
使用精度-召回曲线下面积总和(AUPR)和接收者操作特征曲线下面积总和(AUC)评估 Graph-DTI 和所有基线方法的性能。结果表明,Graph-DTI 在性能结果上均优于基线方法。
与其他基线 DTI 预测方法相比,结果表明 Graph-DTI 具有更好的预测性能。此外,在这项研究中,我们有效地对不同靶标对应的药物和反之亦然进行了分类。上述发现表明,Graph-DTI 为药物研究、开发和重新定位提供了有力工具。Graph-DTI 比以前未使用异构网络图嵌入的研究更有效地作为药物开发和重新定位工具。