Yap C W, Li Z R, Chen Y Z
Department of Computational Science, National University of Singapore, Blk SOC1, Level 7, 3 Science Drive 2, Singapore 117543, Singapore.
J Mol Graph Model. 2006 Mar;24(5):383-95. doi: 10.1016/j.jmgm.2005.10.004. Epub 2005 Nov 14.
Quantitative structure-pharmacokinetic relationships (QSPkR) have increasingly been used for the prediction of the pharmacokinetic properties of drug leads. Several QSPkR models have been developed to predict the total clearance (CL(tot)) of a compound. These models give good prediction accuracy but they are primarily based on a limited number of related compounds which are significantly lesser in number and diversity than the 503 compounds with known CL(tot) described in the literature. It is desirable to examine whether these and other statistical learning methods can be used for predicting the CL(tot) of a more diverse set of compounds. In this work, three statistical learning methods, general regression neural network (GRNN), support vector regression (SVR) and k-nearest neighbour (KNN) were explored for modeling the CL(tot) of all of the 503 known compounds. Six different sets of molecular descriptors, DS-MIXED, DS-3DMoRSE, DS-ATS, DS-GETAWAY, DS-RDF and DS-WHIM, were evaluated for their usefulness in the prediction of CL(tot). GRNN-, SVR- and KNN-developed models have average-fold errors in the range of 1.63 to 1.96, 1.66-1.95 and 1.90-2.23, respectively. For the best GRNN-, SVR- and KNN-developed models, the percentage of compounds with predicted CL(tot) within two-fold error of actual values are in the range of 61.9-74.3% and are comparable or slightly better than those of earlier studies. QSPkR models developed by using DS-MIXED, which is a collection of constitutional, geometrical, topological and electrotopological descriptors, generally give better prediction accuracies than those developed by using other descriptor sets. These results suggest that GRNN, SVR, and their consensus model are potentially useful for predicting QSPkR properties of drug leads.
定量构效关系(QSPkR)已越来越多地用于预测药物先导物的药代动力学性质。已经开发了几种QSPkR模型来预测化合物的总清除率(CL(tot))。这些模型具有良好的预测准确性,但它们主要基于数量有限的相关化合物,这些化合物的数量和多样性明显少于文献中描述的503种具有已知CL(tot)的化合物。研究这些以及其他统计学习方法是否可用于预测更多种类化合物的CL(tot)是很有必要的。在这项工作中,探索了三种统计学习方法,即广义回归神经网络(GRNN)、支持向量回归(SVR)和k近邻(KNN),用于对所有503种已知化合物的CL(tot)进行建模。评估了六组不同的分子描述符,即DS-MIXED、DS-3DMoRSE、DS-ATS、DS-GETAWAY、DS-RDF和DS-WHIM,它们在预测CL(tot)方面的有用性。GRNN、SVR和KNN开发的模型的平均倍数误差分别在1.63至1.96、1.66 - 1.95和1.90 - 2.23范围内。对于最佳的GRNN、SVR和KNN开发的模型,预测的CL(tot)在实际值两倍误差范围内的化合物百分比在61.9 - 74.3%之间,与早期研究的结果相当或略好。使用DS-MIXED(一种由结构、几何、拓扑和电子拓扑描述符组成的集合)开发的QSPkR模型通常比使用其他描述符集开发的模型具有更好的预测准确性。这些结果表明,GRNN、SVR及其共识模型在预测药物先导物的QSPkR性质方面可能是有用的。