School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.
Department of Computer Science and Information Technology, La Trobe University, Melbourne 3083, Australia.
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab453.
Identifying proteins that interact with drugs plays an important role in the initial period of developing drugs, which helps to reduce the development cost and time. Recent methods for predicting drug-protein interactions mainly focus on exploiting various data about drugs and proteins. These methods failed to completely learn and integrate the attribute information of a pair of drug and protein nodes and their attribute distribution.
We present a new prediction method, GVDTI, to encode multiple pairwise representations, including attention-enhanced topological representation, attribute representation and attribute distribution. First, a framework based on graph convolutional autoencoder is constructed to learn attention-enhanced topological embedding that integrates the topology structure of a drug-protein network for each drug and protein nodes. The topological embeddings of each drug and each protein are then combined and fused by multi-layer convolution neural networks to obtain the pairwise topological representation, which reveals the hidden topological relationships between drug and protein nodes. The proposed attribute-wise attention mechanism learns and adjusts the importance of individual attribute in each topological embedding of drug and protein nodes. Secondly, a tri-layer heterogeneous network composed of drug, protein and disease nodes is created to associate the similarities, interactions and associations across the heterogeneous nodes. The attribute distribution of the drug-protein node pair is encoded by a variational autoencoder. The pairwise attribute representation is learned via a multi-layer convolutional neural network to deeply integrate the attributes of drug and protein nodes. Finally, the three pairwise representations are fused by convolutional and fully connected neural networks for drug-protein interaction prediction. The experimental results show that GVDTI outperformed other seven state-of-the-art methods in comparison. The improved recall rates indicate that GVDTI retrieved more actual drug-protein interactions in the top ranked candidates than conventional methods. Case studies on five drugs further confirm GVDTI's ability in discovering the potential candidate drug-related proteins.
zhang@hlju.edu.cn Supplementary information: Supplementary data are available at Briefings in Bioinformatics online.
鉴定与药物相互作用的蛋白质在药物开发的初始阶段起着重要作用,有助于降低开发成本和时间。最近用于预测药物-蛋白质相互作用的方法主要集中在利用关于药物和蛋白质的各种数据上。这些方法未能完全学习和整合一对药物和蛋白质节点及其属性分布的属性信息。
我们提出了一种新的预测方法 GVDTI,用于编码多种成对表示形式,包括注意力增强拓扑表示、属性表示和属性分布。首先,构建了一个基于图卷积自动编码器的框架,以学习注意力增强拓扑嵌入,该嵌入集成了药物-蛋白质网络中每个药物和蛋白质节点的拓扑结构。然后,通过多层卷积神经网络对每个药物和每个蛋白质的拓扑嵌入进行组合和融合,得到成对的拓扑表示,揭示了药物和蛋白质节点之间隐藏的拓扑关系。所提出的属性感知注意力机制学习并调整了药物和蛋白质节点的每个拓扑嵌入中各个属性的重要性。其次,创建了一个由药物、蛋白质和疾病节点组成的三层异构网络,以关联异构节点之间的相似性、相互作用和关联。药物-蛋白质节点对的属性分布由变分自动编码器编码。通过多层卷积神经网络学习成对属性表示,以深入整合药物和蛋白质节点的属性。最后,通过卷积和全连接神经网络融合这三种成对表示,进行药物-蛋白质相互作用预测。实验结果表明,GVDTI 在比较中优于其他七种最先进的方法。改进的召回率表明,GVDTI 在排名靠前的候选药物中检索到了更多实际的药物-蛋白质相互作用,优于传统方法。对五种药物的案例研究进一步证实了 GVDTI 发现潜在候选药物相关蛋白质的能力。
zhang@hlju.edu.cn 补充信息:补充资料可在Briefings in Bioinformatics 在线获取。