Suppr超能文献

采用机器学习方法对白细胞三烯 A4 水解酶(LTA4H)抑制剂进行分类和定量构效关系模型研究。

Classification and QSAR models of leukotriene A4 hydrolase (LTA4H) inhibitors by machine learning methods.

机构信息

State Key Laboratory of Chemical Resource Engineering Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, P. R. China.

出版信息

SAR QSAR Environ Res. 2021 May;32(5):411-431. doi: 10.1080/1062936X.2021.1910862. Epub 2021 Apr 26.

Abstract

Leukotriene A4 hydrolase (LTA4H) is an important anti-inflammatory target which can convert leukotriene A4 (LTA4) into pro-inflammatory substance leukotriene B4 (LTB4). In this paper, we built 18 classification models for 463 LTA4H inhibitors by using support vector machine (SVM), random forest (RF) and K-Nearest Neighbour (KNN). The best classification model (Model 2A) was built from RF and MACCS fingerprints. The prediction accuracy of 88.96% and the Matthews correlation coefficient (MCC) of 0.74 had been achieved on the test set. We also divided the 463 LTA4H inhibitors into six subsets using K-Means. We found that the highly active LTA4H inhibitors mostly contained diphenylmethane or diphenyl ether as the scaffold and pyridine or piperidine as the side chain. In addition, six quantitative structure-activity relationship (QSAR) models for 172 LTA4H inhibitors were built by multiple linear regression (MLR) and SVM. The best QSAR model (Model 6A) was built by using SVM and CORINA Symphony descriptors. The coefficients of determination of the training set and the test set were equal to 0.81 and 0.79, respectively. Classification and QSAR models could be used for subsequent virtual screening, and the obtained fragments that were important for highly active inhibitors would be helpful for designing new LTA4H inhibitors.

摘要

白细胞三烯 A4 水解酶(LTA4H)是一种重要的抗炎靶点,它可以将白细胞三烯 A4(LTA4)转化为促炎物质白细胞三烯 B4(LTB4)。在本文中,我们使用支持向量机(SVM)、随机森林(RF)和 K-最近邻(KNN)为 463 种 LTA4H 抑制剂构建了 18 种分类模型。最佳分类模型(模型 2A)是基于 RF 和 MACCS 指纹构建的。在测试集上,预测准确率达到了 88.96%,马修斯相关系数(MCC)为 0.74。我们还使用 K-Means 将 463 种 LTA4H 抑制剂分为六个子集。我们发现,高活性的 LTA4H 抑制剂大多含有二苯甲烷或二苯醚作为支架,吡啶或哌啶作为侧链。此外,我们还通过多元线性回归(MLR)和 SVM 为 172 种 LTA4H 抑制剂构建了六个定量构效关系(QSAR)模型。最佳 QSAR 模型(模型 6A)是基于 SVM 和 CORINA Symphony 描述符构建的。训练集和测试集的决定系数分别为 0.81 和 0.79。分类和 QSAR 模型可用于后续的虚拟筛选,获得的对高活性抑制剂重要的片段将有助于设计新型 LTA4H 抑制剂。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验