基于图正则化矩阵分解的药物-靶点相互作用预测

Drug-Target Interaction Prediction with Graph Regularized Matrix Factorization.

作者信息

Ezzat Ali, Zhao Peilin, Wu Min, Li Xiao-Li, Kwoh Chee-Keong

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):646-656. doi: 10.1109/TCBB.2016.2530062. Epub 2016 Feb 15.

DOI:10.1109/TCBB.2016.2530062

Abstract

Experimental determination of drug-target interactions is expensive and time-consuming. Therefore, there is a continuous demand for more accurate predictions of interactions using computational techniques. Algorithms have been devised to infer novel interactions on a global scale where the input to these algorithms is a drug-target network (i.e., a bipartite graph where edges connect pairs of drugs and targets that are known to interact). However, these algorithms had difficulty predicting interactions involving new drugs or targets for which there are no known interactions (i.e., "orphan" nodes in the network). Since data usually lie on or near to low-dimensional non-linear manifolds, we propose two matrix factorization methods that use graph regularization in order to learn such manifolds. In addition, considering that many of the non-occurring edges in the network are actually unknown or missing cases, we developed a preprocessing step to enhance predictions in the "new drug" and "new target" cases by adding edges with intermediate interaction likelihood scores. In our cross validation experiments, our methods achieved better results than three other state-of-the-art methods in most cases. Finally, we simulated some "new drug" and "new target" cases and found that GRMF predicted the left-out interactions reasonably well.

摘要

药物 - 靶点相互作用的实验测定既昂贵又耗时。因此，人们一直需要使用计算技术更准确地预测相互作用。已经设计出算法来推断全球范围内的新相互作用，这些算法的输入是药物 - 靶点网络（即一个二分图，其中边连接已知相互作用的药物和靶点对）。然而，这些算法难以预测涉及没有已知相互作用的新药或靶点的相互作用（即网络中的“孤儿”节点）。由于数据通常位于低维非线性流形上或其附近，我们提出了两种使用图正则化的矩阵分解方法来学习此类流形。此外，考虑到网络中许多未出现的边实际上是未知或缺失的情况，我们开发了一个预处理步骤，通过添加具有中间相互作用似然分数的边来增强“新药”和“新靶点”情况下的预测。在我们的交叉验证实验中，在大多数情况下，我们的方法比其他三种最先进的方法取得了更好的结果。最后，我们模拟了一些“新药”和“新靶点”情况，发现GRMF能较好地预测遗漏的相互作用。