Holden S B, Niranjan M
Cambridge University Engineering Department, England.
Neural Comput. 1995 Nov;7(6):1265-88. doi: 10.1162/neco.1995.7.6.1265.
This article addresses the question of whether some recent Vapnik-Chervonenkis (VC) dimension-based bounds on sample complexity can be regarded as a practical design tool. Specifically, we are interested in bounds on the sample complexity for the problem of training a pattern classifier such that we can expect it to perform valid generalization. Early results using the VC dimension, while being extremely powerful, suffered from the fact that their sample complexity predictions were rather impractical. More recent results have begun to improve the situation by attempting to take specific account of the precise algorithm used to train the classifier. We perform a series of experiments based on a task involving the classification of sets of vowel formant frequencies. The results of these experiments indicate that the more recent theories provide sample complexity predictions that are significantly more applicable in practice than those provided by earlier theories; however, we also find that the recent theories still have significant shortcomings.
本文探讨了一些最近基于Vapnik-Chervonenkis(VC)维数的样本复杂度界限是否可被视为一种实用设计工具的问题。具体而言,我们关注训练模式分类器问题的样本复杂度界限,以便我们能够期望它进行有效的泛化。早期使用VC维数的结果虽然极其强大,但存在样本复杂度预测相当不切实际的问题。最近的结果通过尝试具体考虑用于训练分类器的精确算法,开始改善这种情况。我们基于一项涉及元音共振峰频率集分类的任务进行了一系列实验。这些实验结果表明,与早期理论相比,最近的理论提供的样本复杂度预测在实践中更具适用性;然而,我们也发现最近的理论仍然存在显著缺点。