Xu Qian, Xiong Yi, Dai Hao, Kumari Kotni Meena, Xu Qin, Ou Hong-Yu, Wei Dong-Qing
State Key Laboratory of Microbial Metabolism, and College of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
State Key Laboratory of Microbial Metabolism, and College of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
J Theor Biol. 2017 Mar 21;417:1-7. doi: 10.1016/j.jtbi.2017.01.019. Epub 2017 Jan 16.
Combinatorial therapy is a promising strategy for combating complex diseases by improving the efficacy and reducing the side effects. To facilitate the identification of drug combinations in pharmacology, we proposed a new computational model, termed PDC-SGB, to predict effective drug combinations by integrating biological, chemical and pharmacological information based on a stochastic gradient boosting algorithm. To begin with, a set of 352 golden positive samples were collected from the public drug combination database. Then, a set of 732 dimensional feature vector involving biological, chemical and pharmaceutical information was constructed for each drug combination to describe its properties. To avoid overfitting, the maximum relevance & minimum redundancy (mRMR) method was performed to extract useful ones by removing redundant subsets. Based on the selected features, the three different type of classification algorithms were employed to build the drug combination prediction models. Our results demonstrated that the model based on the stochastic gradient boosting algorithm yield out the best performance. Furthermore, it is indicated that the feature patterns of therapy had powerful ability to discriminate effective drug combinations from non-effective ones. By analyzing various features, it is shown that the enriched features occurred frequently in golden positive samples can help predict novel drug combinations.
联合治疗是一种通过提高疗效和减少副作用来对抗复杂疾病的有前景的策略。为了便于在药理学中识别药物组合,我们提出了一种新的计算模型,称为PDC-SGB,通过基于随机梯度提升算法整合生物学、化学和药理学信息来预测有效的药物组合。首先,从公共药物组合数据库中收集了一组352个黄金阳性样本。然后,为每个药物组合构建了一组包含生物学、化学和药学信息的732维特征向量,以描述其性质。为了避免过拟合,采用最大相关最小冗余(mRMR)方法通过去除冗余子集来提取有用的特征。基于所选特征,采用三种不同类型的分类算法构建药物组合预测模型。我们的结果表明,基于随机梯度提升算法的模型表现最佳。此外,结果表明治疗的特征模式具有强大的能力来区分有效的药物组合和无效的药物组合。通过分析各种特征,结果表明在黄金阳性样本中频繁出现的富集特征有助于预测新的药物组合。