Shaikh Naeem, Sharma Mahesh, Garg Prabha
Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research (NIPER), S. A. S. Nagar, Punjab 160062, India.
Mol Biosyst. 2016 Mar;12(3):1006-14. doi: 10.1039/c5mb00650c.
Proteochemometric (PCM) methods, which use descriptors of both the interacting species, i.e. drug and the target, are being successfully employed for the prediction of drug-target interactions (DTI). However, unavailability of non-interacting dataset and determining the applicability domain (AD) of model are a main concern in PCM modeling. In the present study, traditional PCM modeling was improved by devising novel methodologies for reliable negative dataset generation and fingerprint based AD analysis. In addition, various types of descriptors and classifiers were evaluated for their performance. The Random Forest and Support Vector Machine models outperformed the other classifiers (accuracies >98% and >89% for 10-fold cross validation and external validation, respectively). The type of protein descriptors had negligible effect on the developed models, encouraging the use of sequence-based descriptors over the structure-based descriptors. To establish the practical utility of built models, targets were predicted for approved anticancer drugs of natural origin. The molecular recognition interactions between the predicted drug-target pair were quantified with the help of a reverse molecular docking approach. The majority of predicted targets are known for anticancer therapy. These results thus correlate well with anticancer potential of the selected drugs. Interestingly, out of all predicted DTIs, thirty were found to be reported in the ChEMBL database, further validating the adopted methodology. The outcome of this study suggests that the proposed approach, involving use of the improved PCM methodology and molecular docking, can be successfully employed to elucidate the intricate mode of action for drug molecules as well as repositioning them for new therapeutic applications.
蛋白质化学计量学(PCM)方法利用相互作用的两种物质(即药物和靶点)的描述符,已成功用于预测药物-靶点相互作用(DTI)。然而,非相互作用数据集的不可用以及确定模型的适用域(AD)是PCM建模中的主要问题。在本研究中,通过设计用于可靠生成阴性数据集和基于指纹的AD分析的新方法,对传统的PCM建模进行了改进。此外,还评估了各种类型的描述符和分类器的性能。随机森林和支持向量机模型的表现优于其他分类器(10折交叉验证和外部验证的准确率分别>98%和>89%)。蛋白质描述符的类型对所开发的模型影响可忽略不计,这鼓励使用基于序列的描述符而非基于结构的描述符。为了确定所构建模型的实际效用,对天然来源的已批准抗癌药物的靶点进行了预测。借助反向分子对接方法对预测的药物-靶点对之间的分子识别相互作用进行了量化。大多数预测的靶点在抗癌治疗中是已知的。因此,这些结果与所选药物的抗癌潜力密切相关。有趣的是,在所有预测的DTI中,有30个在ChEMBL数据库中被报道,进一步验证了所采用的方法。本研究的结果表明,所提出的方法,包括使用改进的PCM方法和分子对接,可以成功地用于阐明药物分子的复杂作用模式以及将它们重新定位用于新的治疗应用。