Lu Jing, Lu Dong, Zhang Xiaochen, Bi Yi, Cheng Keguang, Zheng Mingyue, Luo Xiaomin
School of Pharmacy, Key Laboratory of Molecular Pharmacology and Drug Evaluation (Yantai University), Ministry of Education, Collaborative Innovation Center of Advanced Drug Delivery System and Biotech Drugs in Universities of Shandong, Yantai University, 32 Qingquan Road, Yantai 264005, PR China.
Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, PR China; Stake Key Laboratory of Natural and Biomimetic Drugs, Peking University, 38 Xueyuan Road, Beijing 100191, PR China; University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, PR China.
Biochim Biophys Acta. 2016 Nov;1860(11 Pt B):2664-71. doi: 10.1016/j.bbagen.2016.05.019. Epub 2016 May 20.
Elimination half-life is an important pharmacokinetic parameter that determines exposure duration to approach steady state of drugs and regulates drug administration. The experimental evaluation of half-life is time-consuming and costly. Thus, it is attractive to build an accurate prediction model for half-life.
In this study, several machine learning methods, including gradient boosting machine (GBM), support vector regressions (RBF-SVR and Linear-SVR), local lazy regression (LLR), SA, SR, and GP, were employed to build high-quality prediction models. Two strategies of building consensus models were explored to improve the accuracy of prediction. Moreover, the applicability domains (ADs) of the models were determined by using the distance-based threshold.
Among seven individual models, GBM showed the best performance (R(2)=0.820 and RMSE=0.555 for the test set), and Linear-SVR produced the inferior prediction accuracy (R(2)=0.738 and RMSE=0.672). The use of distance-based ADs effectively determined the scope of QSAR models. However, the consensus models by combing the individual models could not improve the prediction performance. Some essential descriptors relevant to half-life were identified and analyzed.
An accurate prediction model for elimination half-life was built by GBM, which was superior to the reference model (R(2)=0.723 and RMSE=0.698).
Encouraged by the promising results, we expect that the GBM model for elimination half-life would have potential applications for the early pharmacokinetic evaluations, and provide guidance for designing drug candidates with favorable in vivo exposure profile. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang.
消除半衰期是一个重要的药代动力学参数,它决定了药物达到稳态的暴露持续时间并调节药物给药。半衰期的实验评估既耗时又昂贵。因此,建立一个准确的半衰期预测模型很有吸引力。
在本研究中,采用了几种机器学习方法,包括梯度提升机(GBM)、支持向量回归(RBF-SVR和线性SVR)、局部懒惰回归(LLR)、SA、SR和GP,来建立高质量的预测模型。探索了两种构建共识模型的策略以提高预测准确性。此外,通过使用基于距离的阈值来确定模型的适用域(ADs)。
在七个单独模型中,GBM表现最佳(测试集的R(2)=0.820,RMSE=0.555),而线性SVR的预测准确性较差(R(2)=0.738,RMSE=0.672)。基于距离的ADs的使用有效地确定了QSAR模型的范围。然而,通过组合单个模型得到的共识模型并不能提高预测性能。识别并分析了一些与半衰期相关的重要描述符。
GBM建立了一个准确的消除半衰期预测模型,该模型优于参考模型(R(2)=0.723,RMSE=0.698)。
受这些有前景的结果鼓舞,我们期望消除半衰期的GBM模型在早期药代动力学评估中具有潜在应用,并为设计具有良好体内暴露特征的候选药物提供指导。本文是名为“系统遗传学”的特刊的一部分 客座编辑:蔡宇东博士和黄涛博士。