自适应增强集成决策树桩的配体分类器（LiCABEDS）及其在 5HT 亚型 GPCR 家族配体功能建模中的应用。

Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS) and its application on modeling ligand functionality for 5HT-subtype GPCR families.

机构信息

Department of Computational Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, USA.

出版信息

J Chem Inf Model. 2011 Mar 28;51(3):521-31. doi: 10.1021/ci100399j. Epub 2011 Mar 7.

DOI:10.1021/ci100399j

PMID:21381738

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3065508/

Abstract

Advanced high-throughput screening (HTS) technologies generate great amounts of bioactivity data, and this data needs to be analyzed and interpreted with attention to understand how these small molecules affect biological systems. As such, there is an increasing demand to develop and adapt cheminformatics algorithms and tools in order to predict molecular and pharmacological properties on the basis of these large data sets. In this manuscript, we report a novel machine-learning-based ligand classification algorithm, named Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS), for data-mining and modeling of large chemical data sets to predict pharmacological properties in an efficient and accurate manner. The performance of LiCABEDS was evaluated through predicting GPCR ligand functionality (agonist or antagonist) using four different molecular fingerprints, including Maccs, FP2, Unity, and Molprint 2D fingerprints. Our studies showed that LiCABEDS outperformed two other popular techniques, classification tree and Naive Bayes classifier, on all four types of molecular fingerprints. Parameters in LiCABEDS, including the number of boosting iterations, initialization condition, and a "reject option" boundary, were thoroughly explored and discussed to demonstrate the capability of handling imbalanced data sets, as well as its robustness and flexibility. In addition, the detailed mathematical concepts and theory are also given to address the principle behind statistical prediction models. The LiCABEDS algorithm has been implemented into a user-friendly software package that is accessible online at http://www.cbligand.org/LiCABEDS/ .

摘要

高通量筛选 (HTS) 技术会产生大量的生物活性数据，这些数据需要经过分析和解释，以便了解这些小分子如何影响生物系统。因此，人们越来越需要开发和适应化学信息学算法和工具，以便根据这些大数据集来预测分子和药理学性质。在本文中，我们报告了一种新的基于机器学习的配体分类算法，名为 Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS)，用于挖掘和建模大型化学数据集，以高效、准确地预测药理学性质。通过使用四种不同的分子指纹图谱（Maccs、FP2、Unity 和 Molprint 2D 指纹图谱）预测 GPCR 配体功能（激动剂或拮抗剂），评估了 LiCABEDS 的性能。我们的研究表明，LiCABEDS 在所有四种类型的分子指纹图谱上的性能均优于另外两种流行的技术，即分类树和朴素贝叶斯分类器。深入探讨和讨论了 LiCABEDS 中的参数，包括提升迭代次数、初始化条件和“拒绝选项”边界，以展示其处理不平衡数据集的能力以及其稳健性和灵活性。此外，还给出了详细的数学概念和理论，以解决统计预测模型背后的原理。LiCABEDS 算法已被实现为一个用户友好的软件包，并可在 http://www.cbligand.org/LiCABEDS/ 上在线访问。

相似文献

Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS) and its application on modeling ligand functionality for 5HT-subtype GPCR families.

J Chem Inf Model. 2011 Mar 28;51(3):521-31. doi: 10.1021/ci100399j. Epub 2011 Mar 7.

LiCABEDS II. Modeling of ligand selectivity for G-protein-coupled cannabinoid receptors.

J Chem Inf Model. 2013 Jan 28;53(1):11-26. doi: 10.1021/ci3003914. Epub 2013 Jan 15.

Development and validation of a novel protein-ligand fingerprint to mine chemogenomic space: application to G protein-coupled receptors and their ligands.

J Chem Inf Model. 2009 Apr;49(4):1049-62. doi: 10.1021/ci800447g.

AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist.

J Cheminform. 2025 Jan 29;17(1):12. doi: 10.1186/s13321-024-00945-7.

Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence.

Biomolecules. 2021 Jun 11;11(6):870. doi: 10.3390/biom11060870.

GPCR-MPredictor: multi-level prediction of G protein-coupled receptors using genetic ensemble.

Amino Acids. 2012 May;42(5):1809-23. doi: 10.1007/s00726-011-0902-6. Epub 2011 Apr 20.

Heterogeneous classifier fusion for ligand-based virtual screening: or, how decision making by committee can be a good thing.

J Chem Inf Model. 2013 Nov 25;53(11):2829-36. doi: 10.1021/ci400466r. Epub 2013 Nov 14.

A Machine Learning Approach for the Discovery of Ligand-Specific Functional Mechanisms of GPCRs.

Molecules. 2019 Jun 2;24(11):2097. doi: 10.3390/molecules24112097.

Binding Activity Prediction of Cyclin-Dependent Inhibitors.

J Chem Inf Model. 2015 Jul 27;55(7):1469-82. doi: 10.1021/ci500633c. Epub 2015 Jul 10.

GPCRVS - AI-driven Decision Support System for GPCR Virtual Screening.

Int J Mol Sci. 2025 Feb 27;26(5):2160. doi: 10.3390/ijms26052160.

引用本文的文献

GPCR-A17 MAAP: mapping modulators, agonists, and antagonists to predict the next bioactive target.

J Cheminform. 2025 Jul 11;17(1):102. doi: 10.1186/s13321-025-01050-z.

GraphDeep-hERG: Graph Neural Network PharmacoAnalytics for Assessing hERG-Related Cardiotoxicity.

Pharm Res. 2025 Apr;42(4):579-591. doi: 10.1007/s11095-025-03848-w. Epub 2025 Mar 26.

Leveraging Artificial Intelligence in GPCR Activation Studies: Computational Prediction Methods as Key Drivers of Knowledge.

Methods Mol Biol. 2025;2870:183-220. doi: 10.1007/978-1-0716-4213-9_10.

Distinct activation mechanisms regulate subtype selectivity of Cannabinoid receptors.

Commun Biol. 2023 May 5;6(1):485. doi: 10.1038/s42003-023-04868-1.

Curated Database and Preliminary AutoML QSAR Model for 5-HT1A Receptor.

Pharmaceutics. 2021 Oct 16;13(10):1711. doi: 10.3390/pharmaceutics13101711.

Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence.

Biomolecules. 2021 Jun 11;11(6):870. doi: 10.3390/biom11060870.

Discovery of a Novel Acetylcholinesterase Inhibitor by Fragment-Based Design and Virtual Screening.

Molecules. 2021 Apr 3;26(7):2058. doi: 10.3390/molecules26072058.

Galantamine-Curcumin Hybrids as Dual-Site Binding Acetylcholinesterase Inhibitors.

Molecules. 2020 Jul 23;25(15):3341. doi: 10.3390/molecules25153341.

Virus-CKB: an integrated bioinformatics platform and analysis resource for COVID-19 research.

Brief Bioinform. 2021 Mar 22;22(2):882-895. doi: 10.1093/bib/bbaa155.

Analysis of substance use and its outcomes by machine learning I. Childhood evaluation of liability to substance use disorder.

Drug Alcohol Depend. 2020 Jan 1;206:107605. doi: 10.1016/j.drugalcdep.2019.107605. Epub 2019 Oct 22.

本文引用的文献

Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.

J Am Chem Soc. 1988 Aug 1;110(18):5959-67. doi: 10.1021/ja00226a005.

Exploiting PubChem for Virtual Screening.

Expert Opin Drug Discov. 2010 Dec;5(12):1205-1220. doi: 10.1517/17460441.2010.524924.

Searching for target-selective compounds using different combinations of multiclass support vector machine ranking methods, kernel functions, and fingerprint descriptors.

J Chem Inf Model. 2009 Mar;49(3):582-92. doi: 10.1021/ci800441c.

Methods for computer-aided chemical biology. Part 3: analysis of structure-selectivity relationships through single- or dual-step selectivity searching and Bayesian classification.

Chem Biol Drug Des. 2008 Jun;71(6):518-28. doi: 10.1111/j.1747-0285.2008.00670.x. Epub 2008 May 9.

Prediction of PAH mutagenicity in human cells by QSAR classification.

SAR QSAR Environ Res. 2008 Jan-Mar;19(1-2):115-27. doi: 10.1080/10629360701843482.

GLIDA: GPCR--ligand database for chemical genomics drug discovery--database and tools update.

Nucleic Acids Res. 2008 Jan;36(Database issue):D907-12. doi: 10.1093/nar/gkm948. Epub 2007 Nov 5.

Recent developments of the chemistry development kit (CDK) - an open-source java library for chemo- and bioinformatics.

Curr Pharm Des. 2006;12(17):2111-20. doi: 10.2174/138161206777585274.

Classification tree models for the prediction of blood-brain barrier passage of drugs.

J Chem Inf Model. 2006 May-Jun;46(3):1410-9. doi: 10.1021/ci050518s.

Assessing different classification methods for virtual screening.

J Chem Inf Model. 2006 May-Jun;46(3):1098-106. doi: 10.1021/ci050519k.

Virtual screening using binary kernel discrimination: analysis of pesticide data.

J Chem Inf Model. 2006 Mar-Apr;46(2):471-7. doi: 10.1021/ci050397w.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自适应增强集成决策树桩的配体分类器（LiCABEDS）及其在 5HT 亚型 GPCR 家族配体功能建模中的应用。

Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS) and its application on modeling ligand functionality for 5HT-subtype GPCR families.

机构信息

Department of Computational Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, USA.

出版信息

J Chem Inf Model. 2011 Mar 28;51(3):521-31. doi: 10.1021/ci100399j. Epub 2011 Mar 7.

DOI:10.1021/ci100399j

PMID:21381738

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3065508/

Abstract

摘要

自适应增强集成决策树桩的配体分类器（LiCABEDS）及其在 5HT 亚型 GPCR 家族配体功能建模中的应用。

Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS) and its application on modeling ligand functionality for 5HT-subtype GPCR families.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

自适应增强集成决策树桩的配体分类器（LiCABEDS）及其在 5HT 亚型 GPCR 家族配体功能建模中的应用。

Ligand Classifier of Adaptively Boosting Ensemble Decision Stumps (LiCABEDS) and its application on modeling ligand functionality for 5HT-subtype GPCR families.

机构信息

出版信息