RBOOST：基于黎曼距离的正则化增强算法。

RBOOST: RIEMANNIAN DISTANCE BASED REGULARIZED BOOSTING.

作者信息

Liu Meizhu, Vemuri Baba C

机构信息

Department of CISE, University of Florida, Gainesville, FL 32611.

出版信息

Proc IEEE Int Symp Biomed Imaging. 2011 Mar 30;2011:1831-1834. doi: 10.1109/ISBI.2011.5872763.

DOI:10.1109/ISBI.2011.5872763

PMID:21927643

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3173974/

Abstract

Boosting is a versatile machine learning technique that has numerous applications including but not limited to image processing, computer vision, data mining etc. It is based on the premise that the classification performance of a set of weak learners can be boosted by some weighted combination of them. There have been a number of boosting methods proposed in the literature, such as the AdaBoost, LPBoost, SoftBoost and their variations. However, the learning update strategies used in these methods usually lead to overfitting and instabilities in the classification accuracy. Improved boosting methods via regularization can overcome such difficulties. In this paper, we propose a Riemannian distance regularized LPBoost, dubbed RBoost. RBoost uses Riemannian distance between two square-root densities (in closed form) - used to represent the distribution over the training data and the classification error respectively - to regularize the error distribution in an iterative update formula. Since this distance is in closed form, RBoost requires much less computational cost compared to other regularized Boosting algorithms. We present several experimental results depicting the performance of our algorithm in comparison to recently published methods, LP-Boost and CAVIAR, on a variety of datasets including the publicly available OASIS database, a home grown Epilepsy database and the well known UCI repository. Results depict that the RBoost algorithm performs better than the competing methods in terms of accuracy and efficiency.

摘要

提升算法是一种通用的机器学习技术，有众多应用，包括但不限于图像处理、计算机视觉、数据挖掘等。它基于这样一个前提：一组弱学习器的分类性能可以通过它们的某种加权组合得到提升。文献中已经提出了许多提升方法，如AdaBoost、LPBoost、SoftBoost及其变体。然而，这些方法中使用的学习更新策略通常会导致过拟合以及分类准确率的不稳定。通过正则化改进的提升方法可以克服这些困难。在本文中，我们提出了一种黎曼距离正则化的LPBoost，称为RBoost。RBoost在一个迭代更新公式中使用两个平方根密度（以封闭形式表示）之间的黎曼距离——分别用于表示训练数据上的分布和分类误差——来正则化误差分布。由于这个距离是封闭形式的，与其他正则化提升算法相比，RBoost所需的计算成本要少得多。我们展示了几个实验结果，描述了我们的算法与最近发表的方法LP - Boost和CAVIAR相比，在包括公开可用的OASIS数据库、一个自主构建的癫痫数据库以及著名的UCI库在内的各种数据集上的性能。结果表明，RBoost算法在准确性和效率方面比竞争方法表现更好。

相似文献

RBOOST: RIEMANNIAN DISTANCE BASED REGULARIZED BOOSTING.RBOOST：基于黎曼距离的正则化增强算法。

Proc IEEE Int Symp Biomed Imaging. 2011 Mar 30;2011:1831-1834. doi: 10.1109/ISBI.2011.5872763.

Robust and Efficient Regularized Boosting Using Total Bregman Divergence.使用总布雷格曼散度的稳健且高效的正则化提升

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2011 Dec 31;2011. doi: 10.1109/CVPR.2011.5995686.

RBoost: Label Noise-Robust Boosting Algorithm Based on a Nonconvex Loss Function and the Numerically Stable Base Learners.RBoost：基于非凸损失函数和数值稳定基学习器的标签噪声鲁棒提升算法。

IEEE Trans Neural Netw Learn Syst. 2016 Nov;27(11):2216-2228. doi: 10.1109/TNNLS.2015.2475750. Epub 2015 Sep 22.

CAVIAR: CLASSIFICATION VIA AGGREGATED REGRESSION AND ITS APPLICATION IN CLASSIFYING OASIS BRAIN DATABASE.CAVIAR：通过聚合回归进行分类及其在OASIS脑数据库分类中的应用

Proc IEEE Int Symp Biomed Imaging. 2010 Apr 14;2010:1337-1340. doi: 10.1109/ISBI.2010.5490244.

Boosting through optimization of margin distributions.通过优化边际分布进行提升。

IEEE Trans Neural Netw. 2010 Apr;21(4):659-66. doi: 10.1109/TNN.2010.2040484. Epub 2010 Feb 17.

Bilinear Regularized Locality Preserving Learning on Riemannian Graph for Motor Imagery BCI.基于黎曼图双线性正则化保局学习的运动想象脑机接口。

IEEE Trans Neural Syst Rehabil Eng. 2018 Mar;26(3):698-708. doi: 10.1109/TNSRE.2018.2794415.

Multi-class boosting for the analysis of multiple incomplete views on microbiome data.多类提升在微生物组数据多个不完全视图分析中的应用。

BMC Bioinformatics. 2024 May 14;25(1):188. doi: 10.1186/s12859-024-05767-w.

A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。

Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.

Boosting neural networks.增强神经网络

Neural Comput. 2000 Aug;12(8):1869-87. doi: 10.1162/089976600300015178.

Boosting for high-dimensional two-class prediction.用于高维二类预测的提升算法。

BMC Bioinformatics. 2015 Sep 21;16:300. doi: 10.1186/s12859-015-0723-9.

本文引用的文献

CAVIAR: CLASSIFICATION VIA AGGREGATED REGRESSION AND ITS APPLICATION IN CLASSIFYING OASIS BRAIN DATABASE.CAVIAR：通过聚合回归进行分类及其在OASIS脑数据库分类中的应用

Proc IEEE Int Symp Biomed Imaging. 2010 Apr 14;2010:1337-1340. doi: 10.1109/ISBI.2010.5490244.

Open Access Series of Imaging Studies (OASIS): cross-sectional MRI data in young, middle aged, nondemented, and demented older adults.开放获取影像研究系列（OASIS）：年轻、中年、非痴呆及痴呆老年人的横断面MRI数据

J Cogn Neurosci. 2007 Sep;19(9):1498-507. doi: 10.1162/jocn.2007.19.9.1498.

Kernel Fisher discriminant for shape-based classification in epilepsy.用于癫痫中基于形状分类的核Fisher判别法。

Med Image Anal. 2007 Feb;11(1):79-90. doi: 10.1016/j.media.2006.10.002. Epub 2006 Dec 6.

Unbiased diffeomorphic atlas construction for computational anatomy.用于计算解剖学的无偏微分同胚图谱构建

Neuroimage. 2004;23 Suppl 1:S151-60. doi: 10.1016/j.neuroimage.2004.07.068.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验