Meng Fan-Rong, You Zhu-Hong, Chen Xing, Zhou Yong, An Ji-Yong
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 21116, China.
Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China.
Molecules. 2017 Jul 5;22(7):1119. doi: 10.3390/molecules22071119.
Knowledge of drug-target interaction (DTI) plays an important role in discovering new drug candidates. Unfortunately, there are unavoidable shortcomings; including the time-consuming and expensive nature of the experimental method to predict DTI. Therefore, it motivates us to develop an effective computational method to predict DTI based on protein sequence. In the paper, we proposed a novel computational approach based on protein sequence, namely PDTPS (Predicting Drug Targets with Protein Sequence) to predict DTI. The PDTPS method combines Bi-gram probabilities (BIGP), Position Specific Scoring Matrix (PSSM), and Principal Component Analysis (PCA) with Relevance Vector Machine (RVM). In order to evaluate the prediction capacity of the PDTPS, the experiment was carried out on enzyme, ion channel, GPCR, and nuclear receptor datasets by using five-fold cross-validation tests. The proposed PDTPS method achieved average accuracy of 97.73%, 93.12%, 86.78%, and 87.78% on enzyme, ion channel, GPCR and nuclear receptor datasets, respectively. The experimental results showed that our method has good prediction performance. Furthermore, in order to further evaluate the prediction performance of the proposed PDTPS method, we compared it with the state-of-the-art support vector machine (SVM) classifier on enzyme and ion channel datasets, and other exiting methods on four datasets. The promising comparison results further demonstrate that the efficiency and robust of the proposed PDTPS method. This makes it a useful tool and suitable for predicting DTI, as well as other bioinformatics tasks.
药物-靶点相互作用(DTI)的知识在发现新的候选药物中起着重要作用。不幸的是,存在一些不可避免的缺点,包括预测DTI的实验方法耗时且昂贵。因此,这促使我们开发一种基于蛋白质序列的有效计算方法来预测DTI。在本文中,我们提出了一种基于蛋白质序列的新型计算方法,即PDTPS(利用蛋白质序列预测药物靶点)来预测DTI。PDTPS方法将双词概率(BIGP)、位置特异性评分矩阵(PSSM)和主成分分析(PCA)与相关向量机(RVM)相结合。为了评估PDTPS的预测能力,通过五折交叉验证测试在酶、离子通道、GPCR和核受体数据集上进行了实验。所提出的PDTPS方法在酶、离子通道、GPCR和核受体数据集上分别达到了97.73%、93.12%、86.78%和87.78%的平均准确率。实验结果表明我们的方法具有良好的预测性能。此外,为了进一步评估所提出的PDTPS方法的预测性能,我们将其与酶和离子通道数据集上的最新支持向量机(SVM)分类器以及四个数据集上的其他现有方法进行了比较。有前景的比较结果进一步证明了所提出的PDTPS方法的效率和鲁棒性。这使其成为一个有用的工具,适用于预测DTI以及其他生物信息学任务。