Verzi Stephen J, Heileman Gregory L, Georgiopoulos Michael
Computer Science Department University of New Mexico, Albuquerque, NM 87131, USA.
Neural Netw. 2006 May;19(4):446-68. doi: 10.1016/j.neunet.2005.08.013. Epub 2005 Dec 15.
In this paper, several modifications to the Fuzzy ARTMAP neural network architecture are proposed for conducting classification in complex, possibly noisy, environments. The goal of these modifications is to improve upon the generalization performance of Fuzzy ART-based neural networks, such as Fuzzy ARTMAP, in these situations. One of the major difficulties of employing Fuzzy ARTMAP on such learning problems involves over-fitting of the training data. Structural risk minimization is a machine-learning framework that addresses the issue of over-fitting by providing a backbone for analysis as well as an impetus for the design of better learning algorithms. The theory of structural risk minimization reveals a trade-off between training error and classifier complexity in reducing generalization error, which will be exploited in the learning algorithms proposed in this paper. Boosted ART extends Fuzzy ART by allowing the spatial extent of each cluster formed to be adjusted independently. Boosted ARTMAP generalizes upon Fuzzy ARTMAP by allowing non-zero training error in an effort to reduce the hypothesis complexity and hence improve overall generalization performance. Although Boosted ARTMAP is strictly speaking not a boosting algorithm, the changes it encompasses were motivated by the goals that one strives to achieve when employing boosting. Boosted ARTMAP is an on-line learner, it does not require excessive parameter tuning to operate, and it reduces precisely to Fuzzy ARTMAP for particular parameter values. Another architecture described in this paper is Structural Boosted ARTMAP, which uses both Boosted ART and Boosted ARTMAP to perform structural risk minimization learning. Structural Boosted ARTMAP will allow comparison of the capabilities of off-line versus on-line learning as well as empirical risk minimization versus structural risk minimization using Fuzzy ARTMAP-based neural network architectures. Both empirical and theoretical results are presented to enhance the understanding of these architectures.
本文提出了对模糊ARTMAP神经网络架构的若干修改,以便在复杂且可能存在噪声的环境中进行分类。这些修改的目标是在这些情况下提高基于模糊ART的神经网络(如模糊ARTMAP)的泛化性能。在这类学习问题中应用模糊ARTMAP的一个主要困难涉及训练数据的过拟合。结构风险最小化是一个机器学习框架,它通过提供分析的主干以及设计更好学习算法的动力来解决过拟合问题。结构风险最小化理论揭示了在降低泛化误差时训练误差与分类器复杂度之间的权衡,本文提出的学习算法将利用这一点。增强ART通过允许独立调整形成的每个聚类的空间范围来扩展模糊ART。增强ARTMAP在模糊ARTMAP的基础上进行了推广,允许非零训练误差,以降低假设复杂度,从而提高整体泛化性能。虽然严格来说增强ARTMAP不是一种增强算法,但它所包含的变化是受使用增强时力求实现的目标所驱动的。增强ARTMAP是一种在线学习器,它不需要过多的参数调整即可运行,并且对于特定参数值,它精确地简化为模糊ARTMAP。本文描述的另一种架构是结构增强ARTMAP,它使用增强ART和增强ARTMAP来执行结构风险最小化学习。结构增强ARTMAP将允许比较离线学习与在线学习的能力,以及使用基于模糊ARTMAP的神经网络架构进行经验风险最小化与结构风险最小化的能力。本文给出了实证和理论结果,以增进对这些架构的理解。