Falcón-Cano Gabriela, Molina Christophe, Cabrera-Pérez Miguel Ángel
Unidad de Modelación y Experimentación Biofarmacéutica, Centro de Bioactivos Químicos, Universidad Central "Marta Abreu" de las Villas, Santa Clara 54830, Cuba.
PIKAÏROS, S.A., 31650 Saint-Orens-de-Gameville, France.
Pharmaceutics. 2022 Sep 21;14(10):1998. doi: 10.3390/pharmaceutics14101998.
The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and applicable regression models. This study presents a QSPR approach developed on the KNIME analytical platform and based on a structurally diverse dataset of over 4900 molecules. Interpretable models were obtained using random forest supervised recursive algorithms for data cleaning and feature selection. The development of a conditional consensus model based on regional and global regression random forest produced models with RMSE values between 0.43-0.51 for all validation sets. The potential applicability of the model as a surrogate for the in vitro Caco-2 assay was demonstrated through blind prediction of 32 drugs recommended by the International Council for the Harmonization of Technical Requirements for Pharmaceuticals (ICH) for validation of in vitro permeability methods. The model was validated for the preliminary estimation of the BCS/BDDCS class. The KNIME workflow developed to automate new drug prediction is freely available. The results suggest that this automated prediction platform is a reliable tool for identifying the most promising compounds with high intestinal permeability during the early stages of drug discovery.
Caco-2细胞系的异质性以及使用这种基于细胞的方法进行渗透性评估的实验方案差异,导致了Caco-2渗透性测量的高变异性。这些问题限制了大型数据集的生成,从而难以开发准确且适用的回归模型。本研究提出了一种在KNIME分析平台上开发的定量构效关系(QSPR)方法,该方法基于一个包含4900多个分子的结构多样的数据集。使用随机森林监督递归算法进行数据清理和特征选择,从而获得了可解释的模型。基于区域和全局回归随机森林开发的条件共识模型,在所有验证集中产生的均方根误差(RMSE)值在0.43至0.51之间。通过对国际药品技术要求协调理事会(ICH)推荐用于体外渗透性方法验证的32种药物进行盲预测,证明了该模型作为体外Caco-2试验替代方法的潜在适用性。该模型针对生物药剂学分类系统(BCS)/生物药物处置分类系统(BDDCS)类别的初步估计进行了验证。为实现新药预测自动化而开发的KNIME工作流程可免费获取。结果表明,这个自动化预测平台是在药物发现早期阶段识别具有高肠道渗透性的最有前景化合物的可靠工具。