Hsu Chih-Wei, Lin Chih-Jen
Dept. of Comput. Sci. and Inf. Eng., Nat. Taiwan Univ., Taipei.
IEEE Trans Neural Netw. 2002;13(2):415-25. doi: 10.1109/72.991427.
Support vector machines (SVMs) were originally designed for binary classification. How to effectively extend it for multiclass classification is still an ongoing research issue. Several methods have been proposed where typically we construct a multiclass classifier by combining several binary classifiers. Some authors also proposed methods that consider all classes at once. As it is computationally more expensive to solve multiclass problems, comparisons of these methods using large-scale problems have not been seriously conducted. Especially for methods solving multiclass SVM in one step, a much larger optimization problem is required so up to now experiments are limited to small data sets. In this paper we give decomposition implementations for two such "all-together" methods. We then compare their performance with three methods based on binary classifications: "one-against-all," "one-against-one," and directed acyclic graph SVM (DAGSVM). Our experiments indicate that the "one-against-one" and DAG methods are more suitable for practical use than the other methods. Results also show that for large problems methods by considering all data at once in general need fewer support vectors.
支持向量机(SVM)最初是为二分类设计的。如何有效地将其扩展用于多分类仍然是一个正在研究的问题。已经提出了几种方法,通常我们通过组合多个二分类器来构建一个多分类器。一些作者还提出了一次性考虑所有类别的方法。由于解决多分类问题的计算成本更高,因此尚未认真进行这些方法在大规模问题上的比较。特别是对于一步解决多分类SVM的方法,需要一个大得多的优化问题,所以到目前为止实验仅限于小数据集。在本文中,我们给出了两种此类“一次性处理”方法的分解实现。然后我们将它们的性能与基于二分类的三种方法进行比较:“一对多”、“一对一”和有向无环图支持向量机(DAGSVM)。我们的实验表明,“一对一”和DAG方法比其他方法更适合实际应用。结果还表明,对于大规模问题,一次性考虑所有数据的方法通常需要更少的支持向量。