Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.
Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador.
Mol Pharm. 2022 Jul 4;19(7):2151-2163. doi: 10.1021/acs.molpharmaceut.2c00029. Epub 2022 Jun 7.
Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research.
抗菌药物 (AD) 改变细菌的代谢状态,导致细菌死亡。然而,抗生素耐药性和多药耐药菌的出现增加了对抗生素代谢网络 (MN) 突变和 AD-MN 相互作用的理解的兴趣。在这项研究中,我们在来自 ChEMBL 数据库的庞大数据集上使用了 IFPTML = 信息融合 (IF) + 扰动理论 (PT) + 机器学习 (ML) 算法,该数据集包含超过 155,000 种 AD 测定值和超过 40 种多种细菌的 MN。我们建立了一个线性判别分析 (LDA) 和 17 个以线性指标为中心的 ML 模型,并基于原子预测抗菌化合物。IFPTML-LDA 模型在训练子集中呈现出以下结果:特异性 (Sp) = 70,000 例中的 76%,敏感性 (Sn) = 70%,准确性 (Acc) = 73%。相同的模型在验证子集中也呈现出以下结果:Sp = 76%,Sn = 70%,Acc = 73.1%。在 IFPTML 非线性模型中,k 最近邻 (KNN) 显示出最佳结果,Sn = 99.2%,Sp = 95.5%,Acc = 97.4%,训练集的 AUROC = 0.998。在验证系列中,随机森林的结果最佳:Sn = 93.96%,Sp = 87.02%(AUROC = 0.945)。IFPTML 线性和非线性模型关于 AD-MN 的统计参数较好,它们可以有助于发现抗生素耐药性中的新代谢突变,并减少抗菌药物研究中的时间/成本。