Yamashita Fumiyoshi, Wanchana Suchada, Hashida Mitsuru
Department of Drug Delivery Research, Graduate School of Pharmaceutical Sciences, Kyoto University, Yoshidashimoadachi-cho, Sakyo-ku, Kyoto 606-8501, Japan.
J Pharm Sci. 2002 Oct;91(10):2230-9. doi: 10.1002/jps.10214.
Caco-2 cell monolayers are widely used systems for predicting human intestinal absorption. This study was carried out to develop a quantitative structure-property relationship (QSPR) model of Caco-2 permeability using a novel genetic algorithm-based partial least squares (GA-PLS) method. The Caco-2 permeability data for 73 compounds were taken from the literature. Molconn-Z descriptors of these compounds were calculated as molecular descriptors, and the optimal subset of the descriptors was explored by GA-PLS analysis. A fitness function considering both goodness-of-fit to the training data and predictability of the testing data was adopted throughout the genetic algorithm-driven optimization procedure. The final PLS model consisting of 24 descriptors gave a correlation coefficient (r) of 0.886 for the entire dataset and a predictive correlation coefficient (r(pred)) of 0.825 that was evaluated by a leave-some-out cross-validation procedure. Thus, the GA-PLS analysis proved to be a reasonable QSPR modeling approach for predicting Caco-2 permeability.
Caco-2细胞单层是广泛用于预测人体肠道吸收的系统。本研究旨在使用一种基于新型遗传算法的偏最小二乘法(GA-PLS)开发Caco-2通透性的定量结构-性质关系(QSPR)模型。73种化合物的Caco-2通透性数据取自文献。计算这些化合物的Molconn-Z描述符作为分子描述符,并通过GA-PLS分析探索描述符的最佳子集。在整个遗传算法驱动的优化过程中,采用了一种兼顾训练数据拟合优度和测试数据可预测性的适应度函数。由24个描述符组成的最终PLS模型对整个数据集的相关系数(r)为0.886,通过留一法交叉验证程序评估的预测相关系数(r(pred))为0.825。因此,GA-PLS分析被证明是预测Caco-2通透性的一种合理的QSPR建模方法。