Liu H X, Hu R J, Zhang R S, Yao X J, Liu M C, Hu Z D, Fan B T
Department of Chemistry, Lanzhou University, Lanzhou 730000, P.R. China.
J Comput Aided Mol Des. 2005 Jan;19(1):33-46. doi: 10.1007/s10822-005-0095-8.
Support vector machine (SVM), as a novel machine learning technique, was used for the prediction of the human oral absorption for a large and diverse data set using the five descriptors calculated from the molecular structure alone. The molecular descriptors were selected by heuristic method (HM) implemented in CODESSA. At the same time, in order to show the influence of different molecular descriptors on absorption and to well understand the absorption mechanism, HM was used to build several multivariable linear models using different numbers of molecular descriptors. Both the linear and non-linear model can give satisfactory prediction results: the square of correlation coefficient R(2) was 0.78 and 0.86 for the training set, and 0.70 and 0.73 for the test set respectively. In addition, this paper provides a new and effective method for predicting the absorption of the drugs from their structures and gives some insight into structural features related to the absorption of the drugs.
支持向量机(SVM)作为一种新型机器学习技术,被用于依据仅从分子结构计算得出的五个描述符,对一个庞大且多样的数据集进行人体口服吸收预测。分子描述符通过CODESSA中实施的启发式方法(HM)进行选择。同时,为了展示不同分子描述符对吸收的影响并深入理解吸收机制,HM被用于构建使用不同数量分子描述符的多个多变量线性模型。线性和非线性模型均能给出令人满意的预测结果:训练集的相关系数R²分别为0.78和0.86,测试集的相关系数R²分别为0.70和0.73。此外,本文提供了一种从药物结构预测其吸收的新的有效方法,并对与药物吸收相关的结构特征给出了一些见解。