Di Filippo Juan I, Bollini Mariela, Cavasotto Claudio N
Computational Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones en Medicina Traslacional (IIMT), CONICET-Universidad Austral, Pilar, Argentina.
Facultad de Ciencias Biomédicas and Facultad de Ingeniería, Universidad Austral, Pilar, Argentina.
Front Chem. 2021 Jul 20;9:714678. doi: 10.3389/fchem.2021.714678. eCollection 2021.
The development of computational models for assessing the transfer of chemicals across the placental membrane would be of the utmost importance in drug discovery campaigns, in order to develop safe therapeutic options. We have developed a low-dimensional machine learning model capable of classifying compounds according to whether they can cross or not the placental barrier. To this aim, we compiled a database of 248 compounds with experimental information about their placental transfer, characterizing each compound with a set of ∼5.4 thousand descriptors, including physicochemical properties and structural features. We evaluated different machine learning classifiers and implemented a genetic algorithm, in a five cross validation scheme, to perform feature selection. The optimization was guided towards models displaying a low number of false positives (molecules that actually cross the placental barrier, but are predicted as not crossing it). A Linear Discriminant Analysis model trained with only four structural features resulted to be robust for this task, exhibiting only one false positive case across all testing folds. This model is expected to be useful in predicting placental drug transfer during pregnancy, and thus could be used as a filter for chemical libraries in virtual screening campaigns.
在药物研发过程中,开发用于评估化学物质跨胎盘膜转运的计算模型对于开发安全的治疗方案至关重要。我们开发了一种低维机器学习模型,该模型能够根据化合物是否能够穿过胎盘屏障对其进行分类。为此,我们编制了一个包含248种化合物的数据库,这些化合物具有关于其胎盘转运的实验信息,并用一组约5400个描述符对每种化合物进行表征,包括物理化学性质和结构特征。我们评估了不同的机器学习分类器,并在五重交叉验证方案中实施了遗传算法以进行特征选择。优化的目标是使模型显示出低数量的假阳性(即实际上穿过胎盘屏障但被预测为未穿过的分子)。仅用四个结构特征训练的线性判别分析模型对于这项任务具有鲁棒性,在所有测试折叠中仅出现一例假阳性情况。该模型有望用于预测孕期胎盘药物转运,因此可在虚拟筛选活动中用作化学文库的过滤器。