Xue Yu, Han Xiaolong, Neri Ferrante, Qin Jiafeng, Pelusi Danilo
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4345-4357. doi: 10.1109/TNNLS.2024.3371432. Epub 2025 Feb 28.
Neural architecture search (NAS) is a popular method that can automatically design deep neural network structures. However, designing a neural network using NAS is computationally expensive. This article proposes a gradient-guided evolutionary NAS (GENAS) to design convolutional neural networks (CNNs) for image classification. GENAS is a hybrid algorithm that combines evolutionary global and local search operators to evolve a population of subnets sampled from a supernet. Each candidate architecture is encoded as a table describing which operations are associated with the edges between nodes signifying feature maps. Besides, evolutionary optimization uses novel crossover and mutation operators to manipulate the subnets using the proposed tabular encoding. Every generations, the candidate architectures undergo a local search inspired by differentiable NAS. GENAS is designed to overcome the limitations of both evolutionary and gradient descent NAS. This algorithmic structure enables the performance assessment of the candidate architecture without retraining, thus limiting the NAS calculation time. Furthermore, subnet individuals are decoupled during evaluation to prevent strong coupling of operations in the supernet. The experimental results indicate that the searched structures achieve test errors of 2.45%, 16.86%, and 23.9% on CIFAR-10/100/ImageNet datasets and it costs only 0.26 GPU days on a graphic card. GENAS can effectively expedite the training and evaluation processes and obtain high-performance network structures.
神经架构搜索(NAS)是一种流行的方法,可自动设计深度神经网络结构。然而,使用NAS设计神经网络在计算上代价高昂。本文提出了一种梯度引导的进化NAS(GENAS),用于设计用于图像分类的卷积神经网络(CNN)。GENAS是一种混合算法,它结合了进化全局和局部搜索算子,以进化从超网中采样的子网种群。每个候选架构都被编码为一个表格,描述哪些操作与表示特征图的节点之间的边相关联。此外,进化优化使用新颖的交叉和变异算子,利用所提出的表格编码来操纵子网。每一代,候选架构都会经历受可微NAS启发的局部搜索。GENAS旨在克服进化NAS和梯度下降NAS的局限性。这种算法结构能够在不重新训练的情况下对候选架构进行性能评估,从而限制NAS计算时间。此外,在评估期间子网个体是解耦的,以防止超网中操作的强耦合。实验结果表明,搜索到的结构在CIFAR - 10/100/ImageNet数据集上实现了2.45%、16.86%和23.9%的测试误差,并且在一张图形卡上仅花费0.26 GPU天。GENAS可以有效地加快训练和评估过程,并获得高性能的网络结构。