Marrero-Ponce Yovani, Iyarreta-Veitía Maité, Montero-Torres Alina, Romero-Zaldivar Carlos, Brandt Carlos A, Avila Priscilla E, Kirchgatter Karin, Machado Yanetsy
Department of Pharmacy, Faculty of Chemical Pharmacy and Department of Drug Design, Chemical Bioactive Center, Central University of Las Villas, Santa Clara, 54830 Villa Clara, Cuba.
J Chem Inf Model. 2005 Jul-Aug;45(4):1082-100. doi: 10.1021/ci050085t.
Malaria has been one of the most significant public health problems for centuries. It affects many tropical and subtropical regions of the world. The increasing resistance of Plasmodium spp. to existing therapies has heightened alarms about malaria in the international health community. Nowadays, there is a pressing need for identifying and developing new drug-based antimalarial therapies. In an effort to overcome this problem, the main purpose of this study is to develop simple linear discriminant-based quantitative structure-activity relationship (QSAR) models for the classification and prediction of antimalarial activity using some of the TOMOCOMD-CARDD (TOpological MOlecular COMputer Design-Computer Aided "Rational" Drug Design) fingerprints, so as to enable computational screening from virtual combinatorial datasets. In this sense, a database of 1562 organic chemicals having great structural variability, 597 of them antimalarial agents and 965 compounds having other clinical uses, was analyzed and presented as a helpful tool, not only for theoretical chemists but also for other researchers in this area. This series of compounds was processed by a k-means cluster analysis in order to design training and predicting sets. Afterward, two linear classification functions were derived in order to discriminate between antimalarial and nonantimalarial compounds. The models (including nonstochastic and stochastic indices) correctly classify more than 93% of the compound set, in both training and external prediction datasets. They showed high Matthews' correlation coefficients, 0.889 and 0.866 for the training set and 0.855 and 0.857 for the test one. The models' predictivity was also assessed and validated by the random removal of 10% of the compounds to form a new test set, for which predictions were made using the models. The overall means of the correct classification for this process (leave group 10% full-out cross validation) using the equations with nonstochastic and stochastic atom-based quadratic fingerprints were 93.93% and 92.77%, respectively. The quadratic maps-based TOMOCOMD-CARDD approach implemented in this work was successfully compared with four of the most useful models for antimalarials selection reported to date. The developed models were then used in a simulation of a virtual search for Ras FTase (FTase = farnesyltransferase) inhibitors with antimalarial activity; 70% and 100% of the 10 inhibitors used in this virtual search were correctly classified, showing the ability of the models to identify new lead antimalarials. Finally, these two QSAR models were used in the identification of previously unknown antimalarials. In this sense, three synthetic intermediaries of quinolinic compounds were evaluated as active/inactive ones using the developed models. The synthesis and biological evaluation of these chemicals against two malaria strains, using chloroquine as a reference, was performed. An accuracy of 100% with the theoretical predictions was observed. Compound 3 showed antimalarial activity, being the first report of an arylaminomethylenemalonate having such behavior. This result opens a door to a virtual study considering a higher variability of the structural core already evaluated, as well as of other chemicals not included in this study. We conclude that the approach described here seems to be a promising QSAR tool for the molecular discovery of novel classes of antimalarial drugs, which may meet the dual challenges posed by drug-resistant parasites and the rapid progression of malaria illnesses.
几个世纪以来,疟疾一直是最为严重的公共卫生问题之一。它影响着世界上许多热带和亚热带地区。疟原虫对现有疗法的耐药性不断增强,这在国际卫生界引发了对疟疾的高度警觉。如今,迫切需要识别和开发基于新药的抗疟疗法。为努力克服这一问题,本研究的主要目的是利用一些TOMOCOMD - CARDD(拓扑分子计算机设计 - 计算机辅助“理性”药物设计)指纹,开发基于简单线性判别分析的定量构效关系(QSAR)模型,用于抗疟活性的分类和预测,以便能够从虚拟组合数据集中进行计算筛选。从这个意义上讲,分析了一个包含1562种结构差异很大的有机化学品的数据库,其中597种是抗疟剂,965种化合物具有其他临床用途,并将其作为一个有用的工具呈现出来,这不仅对理论化学家,而且对该领域的其他研究人员都有帮助。对这一系列化合物进行k均值聚类分析,以设计训练集和预测集。之后,推导了两个线性分类函数,以区分抗疟化合物和非抗疟化合物。这些模型(包括非随机和随机指标)在训练集和外部预测数据集中,对超过93%的化合物集进行了正确分类。它们显示出较高的马修斯相关系数,训练集的系数为0.889和0.866,测试集的系数为0.855和0.857。通过随机去除10%的化合物以形成新的测试集,并使用模型进行预测,对模型的预测能力进行了评估和验证。使用基于非随机和随机原子的二次指纹方程进行此过程(留一法10%全出交叉验证)的正确分类总体均值分别为93.93%和92.77%。在这项工作中实施的基于二次映射的TOMOCOMD - CARDD方法成功地与迄今为止报道的四种最有用的抗疟药选择模型进行了比较。然后将开发出的模型用于模拟虚拟搜索具有抗疟活性的Ras法尼基转移酶(FTase = farnesyltransferase)抑制剂;在这次虚拟搜索中使用的10种抑制剂中有70%和100%被正确分类,表明这些模型能够识别新的抗疟先导化合物。最后,这两个QSAR模型被用于识别先前未知的抗疟药。从这个意义上讲,并使用开发出的模型评估了喹啉化合物的三种合成中间体是有活性/无活性的。以氯喹为参考,对这些化学品针对两种疟原虫菌株进行了合成和生物学评估。观察到理论预测的准确率为100%。化合物3显示出抗疟活性,这是关于具有这种行为的芳基氨基亚甲基丙二酸酯的首次报道。这一结果为虚拟研究打开了一扇门,该虚拟研究考虑了已评估的结构核心以及本研究未包括的其他化学品的更高变异性。我们得出结论,这里描述的方法似乎是一种有前景的QSAR工具,可用于分子发现新型抗疟药物,这可能应对耐药寄生虫和疟疾疾病快速发展带来的双重挑战。