• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

M3S-GRPred:一种使用多步堆叠策略对糖皮质激素受体拮抗剂进行可解释预测的新型集成学习方法。

M3S-GRPred: a novel ensemble learning approach for the interpretable prediction of glucocorticoid receptor antagonists using a multi-step stacking strategy.

作者信息

Schaduangrat Nalini, Chuntakaruk Hathaichanok, Rungrotmongkol Thanyada, Mookdarsanit Pakpoom, Shoombuatong Watshara

机构信息

Faculty of Medical Technology, Center for Research Innovation and Biomedical Informatics, Mahidol University, Bangkok, 10700, Thailand.

Program in Bioinformatics and Computational Biology, Graduate School, Chulalongkorn University, Bangkok, 10330, Thailand.

出版信息

BMC Bioinformatics. 2025 Apr 30;26(1):117. doi: 10.1186/s12859-025-06132-1.

DOI:10.1186/s12859-025-06132-1
PMID:40307679
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12044944/
Abstract

Accelerating drug discovery for glucocorticoid receptor (GR)-related disorders, including innovative machine learning (ML)-based approaches, holds promise in advancing therapeutic development, optimizing treatment efficacy, and mitigating adverse effects. While experimental methods can accurately identify GR antagonists, they are often not cost-effective for large-scale drug discovery. Thus, computational approaches leveraging SMILES information for precise in silico identification of GR antagonists are crucial, enabling efficient and scalable drug discovery. Here, we develop a new ensemble learning approach using a multi-step stacking strategy (M3S), termed M3S-GRPred, aimed at rapidly and accurately discovering novel GR antagonists. To the best of our knowledge, M3S-GRPred is the first SMILES-based predictor designed to identify GR antagonists without the use of 3D structural information. In M3S-GRPred, we first constructed different balanced subsets using an under-sampling approach. Using these balanced subsets, we explored and evaluated heterogeneous base-classifiers trained with a variety of SMILES-based feature descriptors coupled with popular ML algorithms. Finally, M3S-GRPred was constructed by integrating probabilistic feature from the selected base-classifiers derived from a two-step feature selection technique. Our comparative experiments demonstrate that M3S-GRPred can precisely identify GR antagonists and effectively address the imbalanced dataset. Compared to traditional ML classifiers, M3S-GRPred attained superior performance in terms of both the training and independent test datasets. Additionally, M3S-GRPred was applied to identify potential GR antagonists among FDA-approved drugs confirmed through molecular docking, followed by detailed MD simulation studies for drug repurposing in Cushing's syndrome. We anticipate that M3S-GRPred will serve as an efficient screening tool for discovering novel GR antagonists from vast libraries of unknown compounds in a cost-effective manner.

摘要

加速针对糖皮质激素受体(GR)相关疾病的药物发现,包括基于创新机器学习(ML)的方法,有望推动治疗开发、优化治疗效果并减轻不良反应。虽然实验方法可以准确识别GR拮抗剂,但对于大规模药物发现而言,它们往往成本效益不高。因此,利用SMILES信息进行GR拮抗剂精确虚拟识别的计算方法至关重要,可实现高效且可扩展的药物发现。在此,我们开发了一种新的集成学习方法,采用多步堆叠策略(M3S),称为M3S-GRPred,旨在快速准确地发现新型GR拮抗剂。据我们所知,M3S-GRPred是首个基于SMILES设计的预测器,旨在在不使用3D结构信息的情况下识别GR拮抗剂。在M3S-GRPred中,我们首先使用欠采样方法构建不同的平衡子集。利用这些平衡子集,我们探索并评估了使用各种基于SMILES的特征描述符与流行的ML算法训练的异构基分类器培训。最后,通过集成来自两步特征选择技术衍生的选定基分类器的概率特征来构建M3S-GRPred。我们的比较实验表明,M3S-GRPred可以精确识别GR拮抗剂并有效解决不平衡数据集问题。与传统ML分类器相比,M3S-GRPred在训练数据集和独立测试数据集方面均表现出卓越性能。此外,M3S-GRPred被应用于在FDA批准的药物中识别潜在的GR拮抗剂,通过分子对接确认,随后进行详细的分子动力学模拟研究,以用于库欣综合征的药物重新利用。我们预计M3S-GRPred将成为一种高效的筛选工具,以经济高效的方式从大量未知化合物库中发现新型GR拮抗剂。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/bf6a7555b551/12859_2025_6132_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/d5ae9598340b/12859_2025_6132_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/99c249684d37/12859_2025_6132_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/38c64d0a772d/12859_2025_6132_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/ce1f46770b92/12859_2025_6132_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/780d99d2a2c8/12859_2025_6132_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/f8ea747292b7/12859_2025_6132_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/bf6a7555b551/12859_2025_6132_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/d5ae9598340b/12859_2025_6132_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/99c249684d37/12859_2025_6132_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/38c64d0a772d/12859_2025_6132_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/ce1f46770b92/12859_2025_6132_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/780d99d2a2c8/12859_2025_6132_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/f8ea747292b7/12859_2025_6132_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d352/12044944/bf6a7555b551/12859_2025_6132_Fig7_HTML.jpg

相似文献

1
M3S-GRPred: a novel ensemble learning approach for the interpretable prediction of glucocorticoid receptor antagonists using a multi-step stacking strategy.M3S-GRPred:一种使用多步堆叠策略对糖皮质激素受体拮抗剂进行可解释预测的新型集成学习方法。
BMC Bioinformatics. 2025 Apr 30;26(1):117. doi: 10.1186/s12859-025-06132-1.
2
MetaCGRP is a high-precision meta-model for large-scale identification of CGRP inhibitors using multi-view information.MetaCGRP 是一种高精度的元模型,用于使用多视图信息大规模识别 CGRP 抑制剂。
Sci Rep. 2024 Oct 21;14(1):24764. doi: 10.1038/s41598-024-75487-x.
3
Stack-AVP: A Stacked Ensemble Predictor Based on Multi-view Information for Fast and Accurate Discovery of Antiviral Peptides.堆叠式抗血管加压素:一种基于多视图信息的堆叠集成预测器,用于快速准确地发现抗病毒肽。
J Mol Biol. 2025 Mar 15;437(6):168853. doi: 10.1016/j.jmb.2024.168853. Epub 2024 Nov 6.
4
SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins.SCORPION 是一个基于堆叠的集成学习框架,用于准确预测噬菌体病毒蛋白。
Sci Rep. 2022 Mar 8;12(1):4106. doi: 10.1038/s41598-022-08173-5.
5
StackER: a novel SMILES-based stacked approach for the accelerated and efficient discovery of ERα and ERβ antagonists.StackER:一种基于 SMILES 的新型堆叠方法,用于加速和高效发现 ERα 和 ERβ 拮抗剂。
Sci Rep. 2023 Dec 27;13(1):22994. doi: 10.1038/s41598-023-50393-w.
6
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens.StackTTCA:一种基于堆叠集成学习的框架,用于准确、高通量地鉴定肿瘤 T 细胞抗原。
BMC Bioinformatics. 2023 Jul 28;24(1):301. doi: 10.1186/s12859-023-05421-x.
7
DeepAR: a novel deep learning-based hybrid framework for the interpretable prediction of androgen receptor antagonists.DeepAR:一种基于深度学习的新型混合框架,用于雄激素受体拮抗剂的可解释预测。
J Cheminform. 2023 May 6;15(1):50. doi: 10.1186/s13321-023-00721-z.
8
Deepstack-ACE: A deep stacking-based ensemble learning framework for the accelerated discovery of ACE inhibitory peptides.深度堆叠-ACE:一种基于深度堆叠的集成学习框架,用于加速发现ACE抑制肽。
Methods. 2025 Feb;234:131-140. doi: 10.1016/j.ymeth.2024.12.005. Epub 2024 Dec 19.
9
An ensemble learning method combined with multiple feature representation strategies to predict lncRNA subcellular localizations.一种结合多种特征表示策略的集成学习方法,用于预测长链非编码RNA的亚细胞定位。
Comput Biol Chem. 2025 Apr;115:108336. doi: 10.1016/j.compbiolchem.2024.108336. Epub 2025 Jan 1.
10
A Machine Learning Approach for Drug-target Interaction Prediction using Wrapper Feature Selection and Class Balancing.基于包装特征选择和类别平衡的药物-靶标相互作用预测的机器学习方法。
Mol Inform. 2020 May;39(5):e1900062. doi: 10.1002/minf.201900062. Epub 2020 Feb 11.

引用本文的文献

1
Advancing the accuracy of clathrin protein prediction through multi-source protein language models.通过多源蛋白质语言模型提高网格蛋白蛋白质预测的准确性。
Sci Rep. 2025 Jul 8;15(1):24403. doi: 10.1038/s41598-025-08510-4.

本文引用的文献

1
Leveraging a meta-learning approach to advance the accuracy of Na blocking peptides prediction.利用元学习方法提高 Na 阻断肽预测的准确性。
Sci Rep. 2024 Feb 23;14(1):4463. doi: 10.1038/s41598-024-55160-z.
2
Accelerating the identification of the allergenic potential of plant proteins using a stacked ensemble-learning framework.使用堆叠集成学习框架加速植物蛋白致敏潜力的鉴定。
J Biomol Struct Dyn. 2024 Feb 22:1-13. doi: 10.1080/07391102.2024.2318482.
3
StackER: a novel SMILES-based stacked approach for the accelerated and efficient discovery of ERα and ERβ antagonists.
StackER:一种基于 SMILES 的新型堆叠方法,用于加速和高效发现 ERα 和 ERβ 拮抗剂。
Sci Rep. 2023 Dec 27;13(1):22994. doi: 10.1038/s41598-023-50393-w.
4
Development of in silico classification models for binding affinity to the glucocorticoid receptor.糖皮质激素受体结合亲和力的计算机分类模型的开发。
Chemosphere. 2023 Sep;336:139147. doi: 10.1016/j.chemosphere.2023.139147. Epub 2023 Jun 8.
5
DeepAR: a novel deep learning-based hybrid framework for the interpretable prediction of androgen receptor antagonists.DeepAR:一种基于深度学习的新型混合框架,用于雄激素受体拮抗剂的可解释预测。
J Cheminform. 2023 May 6;15(1):50. doi: 10.1186/s13321-023-00721-z.
6
PSRQSP: An effective approach for the interpretable prediction of quorum sensing peptide using propensity score representation learning.PSRQSP:一种使用倾向得分表示学习对群体感应肽进行可解释预测的有效方法。
Comput Biol Med. 2023 May;158:106784. doi: 10.1016/j.compbiomed.2023.106784. Epub 2023 Mar 14.
7
A machine learning-integrated stepwise method to discover novel anti-obesity phytochemicals that antagonize the glucocorticoid receptor.一种整合机器学习的逐步方法,用于发现拮抗糖皮质激素受体的新型抗肥胖植物化学物质。
Food Funct. 2023 Feb 21;14(4):1869-1883. doi: 10.1039/d2fo03466b.
8
Targeting an evolutionarily conserved "E-L-L" motif in spike protein to identify a small molecule fusion inhibitor against SARS-CoV-2.靶向刺突蛋白中一个进化保守的“E-L-L”基序以鉴定一种抗SARS-CoV-2的小分子融合抑制剂。
PNAS Nexus. 2022 Oct 22;1(5):pgac198. doi: 10.1093/pnasnexus/pgac198. eCollection 2022 Nov.
9
StackPR is a new computational approach for large-scale identification of progesterone receptor antagonists using the stacking strategy.StackPR 是一种使用堆叠策略进行大规模鉴定孕激素受体拮抗剂的新计算方法。
Sci Rep. 2022 Sep 30;12(1):16435. doi: 10.1038/s41598-022-20143-5.
10
Structure-based virtual screening, molecular docking, molecular dynamics simulation and MM/PBSA calculations towards identification of steroidal and non-steroidal selective glucocorticoid receptor modulators.基于结构的虚拟筛选、分子对接、分子动力学模拟以及MM/PBSA计算用于鉴定甾体和非甾体选择性糖皮质激素受体调节剂。
J Biomol Struct Dyn. 2023 Sep-Oct;41(16):7640-7650. doi: 10.1080/07391102.2022.2123392. Epub 2022 Sep 22.