School of Computer Science and Engineering, Nanyang Technological University, Singapore.
Institute for Infocomm Research (I(2)R), A*Star, Singapore.
Methods. 2017 Oct 1;129:81-88. doi: 10.1016/j.ymeth.2017.05.016. Epub 2017 May 24.
Experimental prediction of drug-target interactions is expensive, time-consuming and tedious. Fortunately, computational methods help narrow down the search space for interaction candidates to be further examined via wet-lab techniques. Nowadays, the number of attributes/features for drugs and targets, as well as the amount of their interactions, are increasing, making these computational methods inefficient or occasionally prohibitive. This motivates us to derive a reduced feature set for prediction. In addition, since ensemble learning techniques are widely used to improve the classification performance, it is also worthwhile to design an ensemble learning framework to enhance the performance for drug-target interaction prediction. In this paper, we propose a framework for drug-target interaction prediction leveraging both feature dimensionality reduction and ensemble learning. First, we conducted feature subspacing to inject diversity into the classifier ensemble. Second, we applied three different dimensionality reduction methods to the subspaced features. Third, we trained homogeneous base learners with the reduced features and then aggregated their scores to derive the final predictions. For base learners, we selected two classifiers, namely Decision Tree and Kernel Ridge Regression, resulting in two variants of ensemble models, EnsemDT and EnsemKRR, respectively. In our experiments, we utilized AUC (Area under ROC Curve) as an evaluation metric. We compared our proposed methods with various state-of-the-art methods under 5-fold cross validation. Experimental results showed EnsemKRR achieving the highest AUC (94.3%) for predicting drug-target interactions. In addition, dimensionality reduction helped improve the performance of EnsemDT. In conclusion, our proposed methods produced significant improvements for drug-target interaction prediction.
实验预测药物-靶标相互作用既昂贵又耗时,且乏味。幸运的是,计算方法有助于缩小候选相互作用的搜索空间,以便进一步通过湿实验室技术进行检查。如今,药物和靶标的属性/特征数量以及它们相互作用的数量都在增加,这使得这些计算方法效率低下,或者偶尔不切实际。这促使我们得出一个减少的特征集进行预测。此外,由于集成学习技术被广泛用于提高分类性能,因此设计一个集成学习框架来增强药物-靶标相互作用预测的性能也是值得的。在本文中,我们提出了一个利用特征降维和集成学习的药物-靶标相互作用预测框架。首先,我们进行了特征子空间划分,为分类器集成注入多样性。其次,我们将三种不同的降维方法应用于子空间特征。然后,我们使用减少的特征训练同质基础学习者,然后聚合他们的分数来得出最终的预测。对于基础学习者,我们选择了两种分类器,即决策树和核岭回归,从而分别产生了两种集成模型的变体,即 EnsemDT 和 EnsemKRR。在我们的实验中,我们使用 AUC(ROC 曲线下的面积)作为评估指标。我们在 5 折交叉验证下将我们提出的方法与各种最先进的方法进行了比较。实验结果表明,EnsemKRR 预测药物-靶标相互作用的 AUC(94.3%)最高。此外,降维有助于提高 EnsemDT 的性能。总之,我们提出的方法为药物-靶标相互作用预测带来了显著的改进。