鲁棒多类别支持矩阵机

Robust Multicategory Support Matrix Machines.

作者信息

Qian Chengde, Tran-Dinh Quoc, Fu Sheng, Zou Changliang, Liu Yufeng

机构信息

School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, P. R. China.

Department of Statistics and Operations Research, The University of North Carolina at Chapel Hill.

出版信息

Math Program. 2019 Jul;176(1-2):429-463. doi: 10.1007/s10107-019-01386-z. Epub 2019 Mar 28.

DOI:10.1007/s10107-019-01386-z

PMID:31983775

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6980461/

Abstract

We consider the classification problem when the input features are represented as matrices rather than vectors. To preserve the intrinsic structures for classification, a successful method is the Support Matrix Machine (SMM) in [19], which optimizes an objective function with a hinge loss plus a so-called spectral elastic net penalty. However, the issues of extending SMM to multicategory classification still remain. Moreover, in practice, it is common to see the training data contaminated by outlying observations, which can affect the robustness of existing matrix classification methods. In this paper, we address these issues by introducing a robust angle-based classifier, which boils down binary and multicategory problems to a unified framework. Benefitting from the use of truncated hinge loss functions, the proposed classifier achieves certain robustness to outliers. The underlying optimization model becomes nonconvex, but admits a natural DC (difference of two convex functions) representation. We develop a new and efficient algorithm by incorporating the DC algorithm and primal-dual first-order methods together. The proposed DC algorithm adaptively chooses the accuracy of the subproblem at each iteration while guaranteeing the overall convergence of the algorithm. The use of primal-dual methods removes a natural complexity of the linear operator in the subproblems and enables us to use the proximal operator of the objective functions, and matrix-vector operations. This advantage allows us to solve large-scale problems efficiently. Theoretical and numerical results indicate that for problems with potential outliers, our method can be highly competitive among existing methods.

摘要

当输入特征表示为矩阵而非向量时，我们考虑分类问题。为了保留用于分类的内在结构，一种成功的方法是文献[19]中的支持矩阵机（SMM），它通过一个铰链损失加上一个所谓的谱弹性网惩罚来优化目标函数。然而，将SMM扩展到多类别分类的问题仍然存在。此外，在实际中，经常会看到训练数据受到异常观测值的污染，这会影响现有矩阵分类方法的鲁棒性。在本文中，我们通过引入一种基于角度的鲁棒分类器来解决这些问题，该分类器将二分类和多分类问题归结为一个统一的框架。得益于截断铰链损失函数的使用，所提出的分类器对异常值具有一定的鲁棒性。底层的优化模型变为非凸的，但允许一种自然的DC（两个凸函数之差）表示。我们通过将DC算法和原始对偶一阶方法结合起来，开发了一种新的高效算法。所提出的DC算法在每次迭代时自适应地选择子问题的精度，同时保证算法的整体收敛性。原始对偶方法的使用消除了子问题中线性算子的自然复杂性，并使我们能够使用目标函数的近端算子以及矩阵向量运算。这一优势使我们能够高效地解决大规模问题。理论和数值结果表明，对于存在潜在异常值的问题，我们的方法在现有方法中具有很强的竞争力。

相似文献

Robust Multicategory Support Matrix Machines.鲁棒多类别支持矩阵机

Math Program. 2019 Jul;176(1-2):429-463. doi: 10.1007/s10107-019-01386-z. Epub 2019 Mar 28.

Robust Multicategory Support Vector Machines using Difference Convex Algorithm.使用差分凸算法的稳健多类别支持向量机

Math Program. 2018 May;169(1):277-305. Epub 2017 Nov 29.

Robust Support Vector Machines for Classification with Nonconvex and Smooth Losses.用于非凸平滑损失分类的鲁棒支持向量机

Neural Comput. 2016 Jun;28(6):1217-47. doi: 10.1162/NECO_a_00837. Epub 2016 May 3.

Adaptively weighted large-margin angle-based classifiers.自适应加权的基于大间隔角度的分类器。

J Multivar Anal. 2018 Jul;166:282-299. doi: 10.1016/j.jmva.2018.03.004. Epub 2018 Mar 15.

Reinforced Angle-based Multicategory Support Vector Machines.基于增强角度的多类别支持向量机

J Comput Graph Stat. 2016;25(3):806-825. doi: 10.1080/10618600.2015.1043010. Epub 2016 Aug 5.

Improving Sparsity and Scalability in Regularized Nonconvex Truncated-Loss Learning Problems.提高正则化非凸截断损失学习问题中的稀疏性和可扩展性。

IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):2782-2793. doi: 10.1109/TNNLS.2017.2705429. Epub 2017 Jun 6.

Multicategory Large-Margin Unified Machines.多类别大间隔统一机器

J Mach Learn Res. 2013 May 1;14:1349-1386.

Multicategory angle-based large-margin classification.基于多类别角度的大间隔分类。

Biometrika. 2014 Sep;101(3):625-640. doi: 10.1093/biomet/asu017. Epub 2014 Jul 23.

Multicategory Composite Least Squares Classifiers.多类别复合最小二乘分类器

Stat Anal Data Min. 2010 Aug;3(4):272-286. doi: 10.1002/sam.10081.

NEW MULTICATEGORY BOOSTING ALGORITHMS BASED ON MULTICATEGORY FISHER-CONSISTENT LOSSES.基于多类别Fisher一致性损失的新型多类别增强算法。

Ann Appl Stat. 2008 Dec;2(4):1290-1306. doi: 10.1214/08-AOAS198.

引用本文的文献

A study of the impact of COVID-19 on the Chinese stock market based on a new textual multiple ARMA model.基于新型文本多元自回归滑动平均模型的新冠疫情对中国股票市场影响的研究

Stat Anal Data Min. 2022 Apr 4. doi: 10.1002/sam.11582.

本文引用的文献

ASSESSING ROBUSTNESS OF CLASSIFICATION USING ANGULAR BREAKDOWN POINT.使用角崩溃点评估分类的稳健性。

Ann Stat. 2018 Dec;46(6B):3362-3389. doi: 10.1214/17-AOS1661. Epub 2018 Sep 11.

Robust Multicategory Support Vector Machines using Difference Convex Algorithm.使用差分凸算法的稳健多类别支持向量机

Math Program. 2018 May;169(1):277-305. Epub 2017 Nov 29.

Reinforced Angle-based Multicategory Support Vector Machines.基于增强角度的多类别支持向量机

J Comput Graph Stat. 2016;25(3):806-825. doi: 10.1080/10618600.2015.1043010. Epub 2016 Aug 5.

Multicategory angle-based large-margin classification.基于多类别角度的大间隔分类。

Biometrika. 2014 Sep;101(3):625-640. doi: 10.1093/biomet/asu017. Epub 2014 Jul 23.

Regularized matrix regression.正则化矩阵回归

J R Stat Soc Series B Stat Methodol. 2014 Mar 1;76(2):463-483. doi: 10.1111/rssb.12031.

Two-dimensional PCA: a new approach to appearance-based face representation and recognition.二维主成分分析：一种基于外观的人脸表示与识别新方法。

IEEE Trans Pattern Anal Mach Intell. 2004 Jan;26(1):131-7. doi: 10.1109/tpami.2004.1261097.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验