Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, P R China.
Int J Mol Sci. 2008 Oct;9(10):1961-76. doi: 10.3390/ijms9101961. Epub 2008 Oct 20.
QSAR (Quantitative Structure Activity Relationships) models for the prediction of human intestinal absorption (HIA) were built with molecular descriptors calculated by ADRIANA.Code, Cerius(2) and a combination of them. A dataset of 552 compounds covering a wide range of current drugs with experimental HIA values was investigated. A Genetic Algorithm feature selection method was applied to select proper descriptors. A Kohonen's self-organizing Neural Network (KohNN) map was used to split the whole dataset into a training set including 380 compounds and a test set consisting of 172 compounds. First, the six selected descriptors from ADRIANA.Code and the six selected descriptors from Cerius(2) were used as the input descriptors for building quantitative models using Partial Least Square (PLS) analysis and Support Vector Machine (SVM) Regression. Then, another two models were built based on nine descriptors selected by a combination of ADRIANA.Code and Cerius(2) descriptors using PLS and SVM, respectively. For the three SVM models, correlation coefficients (r) of 0.87, 0.89 and 0.88 were achieved; and standard deviations (s) of 10.98, 9.72 and 9.14 were obtained for the test set.
QSAR(定量构效关系)模型用于预测人类肠道吸收(HIA),使用 ADRIANA.Code、Cerius(2)和它们的组合计算的分子描述符进行构建。研究了涵盖广泛的当前药物的实验 HIA 值的 552 种化合物数据集。应用遗传算法特征选择方法选择合适的描述符。Kohonen 的自组织神经网络(KohNN)图用于将整个数据集分为包含 380 种化合物的训练集和由 172 种化合物组成的测试集。首先,使用偏最小二乘(PLS)分析和支持向量机(SVM)回归,将来自 ADRIANA.Code 的六个选定描述符和来自 Cerius(2)的六个选定描述符用作构建定量模型的输入描述符。然后,基于 ADRIANA.Code 和 Cerius(2)描述符组合选择的九个描述符,分别使用 PLS 和 SVM 构建另外两个模型。对于三个 SVM 模型,获得了 0.87、0.89 和 0.88 的相关系数(r);对于测试集,获得了 10.98、9.72 和 9.14 的标准偏差(s)。