IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):4901-4915. doi: 10.1109/TNNLS.2020.3026114. Epub 2021 Oct 27.
Conventional artificial neural network (ANN) learning algorithms for classification tasks, either derivative-based optimization algorithms or derivative-free optimization algorithms work by training ANN first (or training and validating ANN) and then testing ANN, which are a two-stage and one-pass learning mechanism. Thus, this learning mechanism may not guarantee the generalization ability of a trained ANN. In this article, a novel bilevel learning model is constructed for self-organizing feed-forward neural network (FFNN), in which the training and testing processes are integrated into a unified framework. In this bilevel model, the upper level optimization problem is built for testing error on testing data set and network architecture based on network complexity, whereas the lower level optimization problem is constructed for network weights based on training error on training data set. For the bilevel framework, an interactive learning algorithm is proposed to optimize the architecture and weights of an FFNN with consideration of both training error and testing error. In this interactive learning algorithm, a hybrid binary particle swarm optimization (BPSO) taken as an upper level optimizer is used to self-organize network architecture, whereas the Levenberg-Marquardt (LM) algorithm as a lower level optimizer is utilized to optimize the connection weights of an FFNN. The bilevel learning model and algorithm have been tested on 20 benchmark classification problems. Experimental results demonstrate that the bilevel learning algorithm can significantly produce more compact FFNNs with more excellent generalization ability when compared with conventional learning algorithms.
传统的人工神经网络 (ANN) 分类任务学习算法,无论是基于导数的优化算法还是无导数的优化算法,都是通过先训练 ANN(或训练和验证 ANN),然后再测试 ANN 来工作的,这是一种两阶段和单步学习机制。因此,这种学习机制可能无法保证训练好的 ANN 的泛化能力。本文为自组织前馈神经网络 (FFNN) 构建了一种新的双层学习模型,其中训练和测试过程集成到一个统一的框架中。在这个双层模型中,上层优化问题是基于测试数据集上的测试误差和基于网络复杂度的网络架构构建的,而下层优化问题是基于训练数据集上的训练误差构建的,用于网络权重。对于双层框架,提出了一种交互式学习算法,该算法在考虑训练误差和测试误差的情况下,对 FFNN 的结构和权重进行优化。在这个交互式学习算法中,使用混合二进制粒子群优化 (BPSO) 作为上层优化器来自组织网络架构,而 Levenberg-Marquardt (LM) 算法作为下层优化器用于优化 FFNN 的连接权重。双层学习模型和算法已经在 20 个基准分类问题上进行了测试。实验结果表明,与传统学习算法相比,双层学习算法可以显著产生更紧凑的 FFNN,并具有更好的泛化能力。