IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7253-7265. doi: 10.1109/TPAMI.2021.3092177. Epub 2022 Sep 14.
Support vector machines (SVM) have drawn wide attention for the last two decades due to its extensive applications, so a vast body of work has developed optimization algorithms to solve SVM with various soft-margin losses. To distinguish all, in this paper, we aim at solving an ideal soft-margin loss SVM: L soft-margin loss SVM (dubbed as L-SVM). Many of the existing (non)convex soft-margin losses can be viewed as one of the surrogates of the L soft-margin loss. Despite its discrete nature, we manage to establish the optimality theory for the L-SVM including the existence of the optimal solutions, the relationship between them and P-stationary points. These not only enable us to deliver a rigorous definition of L support vectors but also allow us to define a working set. Integrating such a working set, a fast alternating direction method of multipliers is then proposed with its limit point being a locally optimal solution to the L-SVM. Finally, numerical experiments demonstrate that our proposed method outperforms some leading classification solvers from SVM communities, in terms of faster computational speed and a fewer number of support vectors. The bigger the data size is, the more evident its advantage appears.
支持向量机(Support Vector Machine,简称 SVM)由于其广泛的应用,在过去的二十年中引起了广泛的关注,因此已经开发出了大量的优化算法来解决具有各种软间隔损失的 SVM。为了区分它们,在本文中,我们旨在解决理想的软间隔损失 SVM:L 软间隔损失 SVM(简称 L-SVM)。许多现有的(非)凸软间隔损失可以看作是 L 软间隔损失的一个替代。尽管它是离散的,但我们成功地为 L-SVM 建立了最优性理论,包括最优解的存在性、它们之间的关系以及 P-稳定点。这些不仅使我们能够对 L 支持向量进行严格的定义,还使我们能够定义一个工作集。然后,结合这样的工作集,提出了一种快速交替方向乘子法,其极限点是 L-SVM 的局部最优解。最后,数值实验表明,我们提出的方法在计算速度和支持向量数量方面优于一些来自 SVM 社区的领先分类求解器。数据量越大,其优势越明显。