Department of Chemistry, University of Calicut, Malappuram, 673635, India.
Department of Chemistry, University of Calicut, Malappuram, 673635, India.
Comput Biol Chem. 2018 Dec;77:154-166. doi: 10.1016/j.compbiolchem.2018.10.002. Epub 2018 Oct 5.
Developments of novel inhibitors to prevent the function of 5-lipoxygenase (5-LOX) proteins that are responsible for a variety of inflammatory and allergic disease are a major challenge in the scientific community. In this study, robust QSAR classification models for predicting 5-LOX activity were developed using machine learning algorithms. The Support Vector Machines (SVM), Logistic Regression, k-Nearest Neighbour (NN) and Decision Trees were adopted to improve the prediction ability of the classification models. The most informative molecular descriptors that contribute to the prediction of 5-LOX activity are screened from e-Dragon, Ochem, PowerMV and Combined databases using Filter-based feature selection methods such as Correlation Feature Selection (CFS) and Information Gain (IG). Performances of the models were measured by 5-fold cross-validation and external test sets prediction. Evaluation of performance of feature selection revealed that the CFS method outperforms the IG method for all descriptor databases except for PowerMV database. The best ensemble classification model was obtained with the IG filtered 'PowerMV' descriptor database using kNN (k = 5) algorithm which displayed an overall accuracy of 76.6% for the training set and 77.9% for the test set. Finally, we employed this model as a virtual screening tool for identifying potential 5-LOX inhibitors from the e-Drug3D drug database and found 43 potential hit candidates. This top screened hits containing one known 5-LOX inhibitors zileuton as well as novel scaffolds. These compounds further screened by applying molecular docking simulation and identified four potential hits such as Belinostat, Masoprocol, Mefloquine and Sitagliptin having a comparable binding affinity to zileuton.
开发新型抑制剂以防止负责多种炎症和过敏疾病的 5-脂氧合酶 (5-LOX) 蛋白的功能是科学界的主要挑战。在这项研究中,使用机器学习算法为预测 5-LOX 活性开发了强大的 QSAR 分类模型。支持向量机 (SVM)、逻辑回归、k-最近邻 (NN) 和决策树被采用以提高分类模型的预测能力。从 e-Dragon、Ochem、PowerMV 和 Combined 数据库中采用基于过滤的特征选择方法(如相关特征选择 (CFS) 和信息增益 (IG))从电子龙、Ochem、PowerMV 和联合数据库中筛选出对预测 5-LOX 活性贡献最大的信息分子描述符。使用 5 折交叉验证和外部测试集预测来衡量模型的性能。特征选择性能的评估表明,除了 PowerMV 数据库外,CFS 方法在所有描述符数据库中均优于 IG 方法。使用 kNN (k=5) 算法对具有 IG 过滤的“PowerMV”描述符数据库进行最佳集成分类模型,该模型在训练集和测试集上的整体准确性分别为 76.6%和 77.9%。最后,我们将该模型用作从 e-Drug3D 药物数据库中识别潜在 5-LOX 抑制剂的虚拟筛选工具,并发现了 43 个潜在的命中候选物。这些顶级筛选的命中物包含一种已知的 5-LOX 抑制剂齐留通以及新型支架。进一步对这些化合物进行分子对接模拟筛选,并确定了四种潜在的命中物,如贝林司他、美法仑、甲氟喹和西他列汀,它们与齐留通具有相当的结合亲和力。