Zhang Chong, Pham Minh, Fu Sheng, Liu Yufeng
Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada.
Statistical and Applied Mathematical Sciences Institute (SAMSI), Durham, NC, USA.
Math Program. 2018 May;169(1):277-305. Epub 2017 Nov 29.
The Support Vector Machine (SVM) is one of the most popular classification methods in the machine learning literature. Binary SVM methods have been extensively studied, and have achieved many successes in various disciplines. However, generalization to Multicategory SVM (MSVM) methods can be very challenging. Many existing methods estimate functions for classes with an explicit sum-to-zero constraint. It was shown recently that such a formulation can be suboptimal. Moreover, many existing MSVMs are not Fisher consistent, or do not take into account the effect of outliers. In this paper, we focus on classification in the angle-based framework, which is free of the explicit sum-to-zero constraint, hence more efficient, and propose two robust MSVM methods using truncated hinge loss functions. We show that our new classifiers can enjoy Fisher consistency, and simultaneously alleviate the impact of outliers to achieve more stable classification performance. To implement our proposed classifiers, we employ the difference convex algorithm (DCA) for efficient computation. Theoretical and numerical results obtained indicate that for problems with potential outliers, our robust angle-based MSVMs can be very competitive among existing methods.
支持向量机(SVM)是机器学习文献中最流行的分类方法之一。二元支持向量机方法已得到广泛研究,并在各个学科中取得了许多成功。然而,将其推广到多类支持向量机(MSVM)方法可能极具挑战性。许多现有方法通过明确的和为零约束来估计类别函数。最近的研究表明,这种公式可能不是最优的。此外,许多现有的多类支持向量机不满足Fisher一致性,或者没有考虑异常值的影响。在本文中,我们专注于基于角度的框架中的分类,该框架没有明确的和为零约束,因此效率更高,并提出了两种使用截断铰链损失函数的鲁棒多类支持向量机方法。我们表明,我们的新分类器可以实现Fisher一致性,同时减轻异常值的影响,以获得更稳定的分类性能。为了实现我们提出的分类器,我们采用差异凸算法(DCA)进行高效计算。理论和数值结果表明,对于存在潜在异常值的问题,我们基于角度的鲁棒多类支持向量机在现有方法中具有很强的竞争力。