Li Jiang, Tong Xin-Yu, Zhu Li-Da, Zhang Hong-Yu
Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China.
Front Genet. 2020 Aug 25;11:1000. doi: 10.3389/fgene.2020.01000. eCollection 2020.
Drug combination is now a hot research topic in the pharmaceutical industry, but experiment-based methodologies are extremely costly in time and money. Many computational methods have been proposed to address these problems by starting from existing drug combinations. However, in most cases, only molecular structure information is included, which covers too limited a set of drug characteristics to efficiently screen drug combinations. Here, we integrated similarity-based multifeature drug data to improve the prediction accuracy by using the neighbor recommender method combined with ensemble learning algorithms. By conducting feature assessment analysis, we selected the most useful drug features and achieved 0.964 AUC in the ensemble models. The comparison results showed that the ensemble models outperform traditional machine learning algorithms such as support vector machine (SVM), naïve Bayes (NB), and logistic regression (GLM). Furthermore, we predicted 7 candidate drug combinations for a specific drug, paclitaxel, and successfully verified that the two of the predicted combinations have promising effects.
药物组合目前是制药行业的一个热门研究课题,但基于实验的方法在时间和金钱上成本极高。已经提出了许多计算方法,通过从现有的药物组合入手来解决这些问题。然而,在大多数情况下,只包含了分子结构信息,所涵盖的药物特征集过于有限,无法有效地筛选药物组合。在此,我们整合了基于相似性的多特征药物数据,通过结合邻居推荐方法和集成学习算法来提高预测准确性。通过进行特征评估分析,我们选择了最有用的药物特征,并在集成模型中实现了0.964的曲线下面积(AUC)。比较结果表明,集成模型优于传统机器学习算法,如支持向量机(SVM)、朴素贝叶斯(NB)和逻辑回归(GLM)。此外,我们为特定药物紫杉醇预测了7种候选药物组合,并成功验证了其中两种预测组合具有良好的效果。