• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习算法预测微粒体前列腺素E合酶-1抑制剂的生物活性

Prediction of bioactivities of microsomal prostaglandin E synthase-1 inhibitors by machine learning algorithms.

作者信息

Tian Yujia, Yang Zhenwu, Wang Hongzhao, Yan Aixia

机构信息

Department of Pharmaceutical Engineering, State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing, People's Republic of China.

出版信息

Chem Biol Drug Des. 2023 Jun;101(6):1307-1321. doi: 10.1111/cbdd.14214. Epub 2023 Feb 20.

DOI:10.1111/cbdd.14214
PMID:36752697
Abstract

There is a strong interest in the development of microsomal prostaglandin E2 synthase-1 (mPGES-1) inhibitors of their potential to safely and effectively treat inflammation. Herein, 70 QSAR models were built on the dataset (735 mPGES-1 inhibitors) characterized with RDKit descriptors by multiple linear regression (MLR), support vector machine (SVM), random forest (RF), deep neural networks (DNN), and eXtreme Gradient Boosting (XGBoost). The other three regression models on the dataset are represented by SMILES using self-attention recurrent neural networks (RNN) and Graph Convolutional Networks (GCN). For the best model (Model C2), which was developed by SVM with RDKit descriptors, the coefficient of determination (R ) of 0.861 and root mean squared error (RMSE) of 0.235 were achieved for the test set. Additionally, R of 0.692 and RMSE of 0.383 were obtained on the external test set. We investigated the applicability domain (AD) of Model C2 with the rivality index (RI), the prediction of Model C2 on 78.92% of molecules in the test set, and 78.33% of molecules in the external test set were reliable. After dissecting the RDKit descriptors of Model C2, we found important physicochemical properties of highly active mPGES-1 inhibitors. Besides, by analyzing the attention weight of each atom of each inhibitor from the attention layer, we found that the benzamide group and the trifluoromethyl cyclohexane group are favorable substructures for mPGES-1 inhibitors.

摘要

人们对开发微粒体前列腺素E2合酶-1(mPGES-1)抑制剂以安全有效地治疗炎症的潜力有着浓厚兴趣。在此,基于由RDKit描述符表征的数据集(735种mPGES-1抑制剂),通过多元线性回归(MLR)、支持向量机(SVM)、随机森林(RF)、深度神经网络(DNN)和极端梯度提升(XGBoost)构建了70个定量构效关系(QSAR)模型。该数据集上的其他三个回归模型由使用自注意力循环神经网络(RNN)和图卷积网络(GCN)的SMILES表示。对于由带有RDKit描述符的SVM开发的最佳模型(模型C2),测试集的决定系数(R)为0.861,均方根误差(RMSE)为0.235。此外,在外部测试集上获得的R为0.692,RMSE为0.383。我们用竞争指数(RI)研究了模型C2的适用域(AD),模型C2对测试集中78.92%的分子以及外部测试集中78.33%的分子的预测是可靠的。在剖析模型C2的RDKit描述符后,我们发现了高活性mPGES-1抑制剂的重要物理化学性质。此外,通过分析注意力层中每种抑制剂每个原子的注意力权重,我们发现苯甲酰胺基团和三氟甲基环己烷基团是mPGES-1抑制剂的有利子结构。

相似文献

1
Prediction of bioactivities of microsomal prostaglandin E synthase-1 inhibitors by machine learning algorithms.通过机器学习算法预测微粒体前列腺素E合酶-1抑制剂的生物活性
Chem Biol Drug Des. 2023 Jun;101(6):1307-1321. doi: 10.1111/cbdd.14214. Epub 2023 Feb 20.
2
Discovering the Active Ingredients of Medicine and Food Homologous Substances for Inhibiting the Cyclooxygenase-2 Metabolic Pathway by Machine Learning Algorithms.利用机器学习算法发现抑制环氧化酶-2代谢途径的药食同源物质的有效成分。
Molecules. 2023 Sep 23;28(19):6782. doi: 10.3390/molecules28196782.
3
Computational models for the classification of mPGES-1 inhibitors with fingerprint descriptors.基于指纹描述符的 mPGES-1 抑制剂分类的计算模型。
Mol Divers. 2017 Aug;21(3):661-675. doi: 10.1007/s11030-017-9743-x. Epub 2017 May 8.
4
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.
5
A SAR and QSAR study on 3CLpro inhibitors of SARS-CoV-2 using machine learning methods.基于机器学习方法的 SARS-CoV-2 3CLpro 抑制剂的 SAR 和 QSAR 研究。
SAR QSAR Environ Res. 2024 Jul;35(7):531-563. doi: 10.1080/1062936X.2024.2375513. Epub 2024 Jul 30.
6
Classification models and SAR analysis on thromboxane A synthase inhibitors by machine learning methods.基于机器学习方法的血栓素 A 合酶抑制剂的分类模型和 SAR 分析。
SAR QSAR Environ Res. 2022 Jun;33(6):429-462. doi: 10.1080/1062936X.2022.2078880. Epub 2022 Jun 9.
7
Classification models and SAR analysis on CysLT1 receptor antagonists using machine learning algorithms.使用机器学习算法对半胱氨酰白三烯1受体拮抗剂进行分类模型和构效关系分析。
Mol Divers. 2021 Aug;25(3):1597-1616. doi: 10.1007/s11030-020-10165-4. Epub 2021 Feb 3.
8
SAR and QSAR research on tyrosinase inhibitors using machine learning methods.基于机器学习方法的酪氨酸酶抑制剂的 SAR 和 QSAR 研究。
SAR QSAR Environ Res. 2021 Feb;32(2):85-110. doi: 10.1080/1062936X.2020.1862297. Epub 2021 Feb 1.
9
Classification and QSAR models of leukotriene A4 hydrolase (LTA4H) inhibitors by machine learning methods.采用机器学习方法对白细胞三烯 A4 水解酶(LTA4H)抑制剂进行分类和定量构效关系模型研究。
SAR QSAR Environ Res. 2021 May;32(5):411-431. doi: 10.1080/1062936X.2021.1910862. Epub 2021 Apr 26.
10
Classification models and SAR analysis on HDAC1 inhibitors using machine learning methods.使用机器学习方法对 HDAC1 抑制剂进行分类模型和 SAR 分析。
Mol Divers. 2023 Jun;27(3):1037-1051. doi: 10.1007/s11030-022-10466-w. Epub 2022 Jun 23.

引用本文的文献

1
BGATT-GR: accurate identification of glucocorticoid receptor antagonists based on data augmentation combined with BiGRU-attention.BGATT-GR:基于数据增强结合双向门控循环单元-注意力机制的糖皮质激素受体拮抗剂准确识别
Sci Rep. 2025 Jul 1;15(1):21402. doi: 10.1038/s41598-025-05839-8.
2
Discovering the Active Ingredients of Medicine and Food Homologous Substances for Inhibiting the Cyclooxygenase-2 Metabolic Pathway by Machine Learning Algorithms.利用机器学习算法发现抑制环氧化酶-2代谢途径的药食同源物质的有效成分。
Molecules. 2023 Sep 23;28(19):6782. doi: 10.3390/molecules28196782.