Wang Lei, You Zhu-Hong, Chen Xing, Yan Xin, Liu Gang, Zhang Wei
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, China.
College of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277100, China.
Curr Protein Pept Sci. 2018;19(5):445-454. doi: 10.2174/1389203718666161114111656.
Identification of interaction between drugs and target proteins plays an important role in discovering new drug candidates. However, through the experimental method to identify the drug-target interactions remain to be extremely time-consuming, expensive and challenging even nowadays. Therefore, it is urgent to develop new computational methods to predict potential drugtarget interactions (DTI).
In this article, a novel computational model is developed for predicting potential drug-target interactions under the theory that each drug-target interaction pair can be represented by the structural properties from drugs and evolutionary information derived from proteins. Specifically, the protein sequences are encoded as Position-Specific Scoring Matrix (PSSM) descriptor which contains information of biological evolutionary and the drug molecules are encoded as fingerprint feature vector which represents the existence of certain functional groups or fragments.
Four benchmark datasets involving enzymes, ion channels, GPCRs and nuclear receptors, are independently used for establishing predictive models with Rotation Forest (RF) model. The proposed method achieved the prediction accuracy of 91.3%, 89.1%, 84.1% and 71.1% for four datasets respectively. In order to make our method more persuasive, we compared our classifier with the state-of-theart Support Vector Machine (SVM) classifier. We also compared the proposed method with other excellent methods.
Experimental results demonstrate that the proposed method is effective in the prediction of DTI, and can provide assistance for new drug research and development.
药物与靶蛋白之间相互作用的识别在发现新的候选药物中起着重要作用。然而,即使在当今,通过实验方法来识别药物 - 靶标相互作用仍然极其耗时、昂贵且具有挑战性。因此,迫切需要开发新的计算方法来预测潜在的药物 - 靶标相互作用(DTI)。
在本文中,基于每个药物 - 靶标相互作用对可以由药物的结构特性和蛋白质的进化信息来表示这一理论,开发了一种用于预测潜在药物 - 靶标相互作用的新型计算模型。具体而言,蛋白质序列被编码为包含生物进化信息的位置特异性评分矩阵(PSSM)描述符,药物分子被编码为代表某些官能团或片段存在的指纹特征向量。
四个涉及酶、离子通道、G蛋白偶联受体(GPCR)和核受体的基准数据集被独立用于使用旋转森林(RF)模型建立预测模型。所提出的方法对四个数据集的预测准确率分别达到了91.3%、89.1%、84.1%和71.1%。为了使我们的方法更具说服力,我们将我们的分类器与最先进的支持向量机(SVM)分类器进行了比较。我们还将所提出的方法与其他优秀方法进行了比较。
实验结果表明,所提出的方法在DTI预测中是有效的,并且可以为新药研发提供帮助。