State Key Laboratory of Chemical Resource Engineering Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, P. R. China.
SAR QSAR Environ Res. 2021 May;32(5):411-431. doi: 10.1080/1062936X.2021.1910862. Epub 2021 Apr 26.
Leukotriene A4 hydrolase (LTA4H) is an important anti-inflammatory target which can convert leukotriene A4 (LTA4) into pro-inflammatory substance leukotriene B4 (LTB4). In this paper, we built 18 classification models for 463 LTA4H inhibitors by using support vector machine (SVM), random forest (RF) and K-Nearest Neighbour (KNN). The best classification model (Model 2A) was built from RF and MACCS fingerprints. The prediction accuracy of 88.96% and the Matthews correlation coefficient (MCC) of 0.74 had been achieved on the test set. We also divided the 463 LTA4H inhibitors into six subsets using K-Means. We found that the highly active LTA4H inhibitors mostly contained diphenylmethane or diphenyl ether as the scaffold and pyridine or piperidine as the side chain. In addition, six quantitative structure-activity relationship (QSAR) models for 172 LTA4H inhibitors were built by multiple linear regression (MLR) and SVM. The best QSAR model (Model 6A) was built by using SVM and CORINA Symphony descriptors. The coefficients of determination of the training set and the test set were equal to 0.81 and 0.79, respectively. Classification and QSAR models could be used for subsequent virtual screening, and the obtained fragments that were important for highly active inhibitors would be helpful for designing new LTA4H inhibitors.
白细胞三烯 A4 水解酶(LTA4H)是一种重要的抗炎靶点,它可以将白细胞三烯 A4(LTA4)转化为促炎物质白细胞三烯 B4(LTB4)。在本文中,我们使用支持向量机(SVM)、随机森林(RF)和 K-最近邻(KNN)为 463 种 LTA4H 抑制剂构建了 18 种分类模型。最佳分类模型(模型 2A)是基于 RF 和 MACCS 指纹构建的。在测试集上,预测准确率达到了 88.96%,马修斯相关系数(MCC)为 0.74。我们还使用 K-Means 将 463 种 LTA4H 抑制剂分为六个子集。我们发现,高活性的 LTA4H 抑制剂大多含有二苯甲烷或二苯醚作为支架,吡啶或哌啶作为侧链。此外,我们还通过多元线性回归(MLR)和 SVM 为 172 种 LTA4H 抑制剂构建了六个定量构效关系(QSAR)模型。最佳 QSAR 模型(模型 6A)是基于 SVM 和 CORINA Symphony 描述符构建的。训练集和测试集的决定系数分别为 0.81 和 0.79。分类和 QSAR 模型可用于后续的虚拟筛选,获得的对高活性抑制剂重要的片段将有助于设计新型 LTA4H 抑制剂。