一种用于联合特征选择和分类器设计的贝叶斯方法。

A Bayesian approach to joint feature selection and classifier design.

作者信息

Krishnapuram Balaji, Hartemink Alexander J, Carin Lawrence, Figueiredo Mário A T

机构信息

Department of Electrical Engineering, Duke University, Durham, NC 27708-0291, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1105-11. doi: 10.1109/TPAMI.2004.55.

DOI:10.1109/TPAMI.2004.55

PMID:15742887

Abstract

This paper adopts a Bayesian approach to simultaneously learn both an optimal nonlinear classifier and a subset of predictor variables (or features) that are most relevant to the classification task. The approach uses heavy-tailed priors to promote sparsity in the utilization of both basis functions and features; these priors act as regularizers for the likelihood function that rewards good classification on the training data. We derive an expectation-maximization (EM) algorithm to efficiently compute a maximum a posteriori (MAP) point estimate of the various parameters. The algorithm is an extension of recent state-of-the-art sparse Bayesian classifiers, which in turn can be seen as Bayesian counterparts of support vector machines. Experimental comparisons using kernel classifiers demonstrate both parsimonious feature selection and excellent classification accuracy on a range of synthetic and benchmark data sets.

摘要

本文采用贝叶斯方法同时学习最优非线性分类器和与分类任务最相关的预测变量（或特征）子集。该方法使用重尾先验来促进基函数和特征利用的稀疏性；这些先验作为似然函数的正则化项，对训练数据上的良好分类给予奖励。我们推导了一种期望最大化（EM）算法，以有效地计算各种参数的最大后验（MAP）点估计。该算法是最近最先进的稀疏贝叶斯分类器的扩展，而稀疏贝叶斯分类器又可视为支持向量机的贝叶斯对应物。使用核分类器的实验比较表明，在一系列合成数据集和基准数据集上，该方法既能进行简约的特征选择，又具有出色的分类准确率。

相似文献

A Bayesian approach to joint feature selection and classifier design.

IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1105-11. doi: 10.1109/TPAMI.2004.55.

Bayesian Gaussian process classification with the EM-EP algorithm.

IEEE Trans Pattern Anal Mach Intell. 2006 Dec;28(12):1948-59. doi: 10.1109/TPAMI.2006.238.

Sparse multinomial logistic regression: fast algorithms and generalization bounds.

IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):957-68. doi: 10.1109/TPAMI.2005.127.

A novel kernel method for clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):801-5. doi: 10.1109/TPAMI.2005.88.

On utilizing search methods to select subspace dimensions for kernel-based nonlinear subspace classifiers.

IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):136-41. doi: 10.1109/TPAMI.2005.15.

Gene selection in cancer classification using sparse logistic regression with Bayesian regularization.

Bioinformatics. 2006 Oct 1;22(19):2348-55. doi: 10.1093/bioinformatics/btl386. Epub 2006 Jul 14.

Bias in error estimation when using cross-validation for model selection.

BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.

One-shot learning of object categories.

IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):594-611. doi: 10.1109/TPAMI.2006.79.

The relevance sample-feature machine: a sparse Bayesian learning approach to joint feature-sample selection.

IEEE Trans Cybern. 2013 Dec;43(6):2241-54. doi: 10.1109/TCYB.2013.2260736.

Cancer classification and prediction using logistic regression with Bayesian gene selection.

J Biomed Inform. 2004 Aug;37(4):249-59. doi: 10.1016/j.jbi.2004.07.009.

引用本文的文献

A comparative analysis of gene expression profiling by statistical and machine learning approaches.

Bioinform Adv. 2024 Dec 18;5(1):vbae199. doi: 10.1093/bioadv/vbae199. eCollection 2025.

Exploring an immune cells-related molecule in STEMI by bioinformatics analysis.

BMC Med Genomics. 2023 Jun 30;16(1):151. doi: 10.1186/s12920-023-01579-8.

Graph convolutional network-based feature selection for high-dimensional and low-sample size data.

Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad135.

Machine learning algorithm-based identification and verification of characteristic genes in acute kidney injury.

Front Med (Lausanne). 2022 Oct 13;9:1016459. doi: 10.3389/fmed.2022.1016459. eCollection 2022.

Sparse feature selection identifies H2A.Z as a novel, pattern-specific biomarker for asymmetrically self-renewing distributed stem cells.

Stem Cell Res. 2015 Mar;14(2):144-54. doi: 10.1016/j.scr.2014.12.007. Epub 2015 Jan 6.

Accurate prediction of coronary artery disease using reliable diagnosis system.

J Med Syst. 2012 Oct;36(5):3353-73. doi: 10.1007/s10916-012-9828-0. Epub 2012 Feb 12.

Evolving a Bayesian Classifier for ECG-based Age Classification in Medical Applications.

Appl Soft Comput. 2008 Jan;8(1):599-608. doi: 10.1016/j.asoc.2007.03.009.

Classification of arrayCGH data using fused SVM.

Bioinformatics. 2008 Jul 1;24(13):i375-82. doi: 10.1093/bioinformatics/btn188.

A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.

BMC Bioinformatics. 2006 Nov 24;7:514. doi: 10.1186/1471-2105-7-514.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于联合特征选择和分类器设计的贝叶斯方法。

A Bayesian approach to joint feature selection and classifier design.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献