用于卷积神经网络混合变量超参数优化的代理辅助分布估计算法的混合模型估计

Surrogate-Assisted Hybrid-Model Estimation of Distribution Algorithm for Mixed-Variable Hyperparameters Optimization in Convolutional Neural Networks.

作者信息

Li Jian-Yu, Zhan Zhi-Hui, Xu Jin, Kwong Sam, Zhang Jun

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2338-2352. doi: 10.1109/TNNLS.2021.3106399. Epub 2023 May 2.

DOI:10.1109/TNNLS.2021.3106399

Abstract

The performance of a convolutional neural network (CNN) heavily depends on its hyperparameters. However, finding a suitable hyperparameters configuration is difficult, challenging, and computationally expensive due to three issues, which are 1) the mixed-variable problem of different types of hyperparameters; 2) the large-scale search space of finding optimal hyperparameters; and 3) the expensive computational cost for evaluating candidate hyperparameters configuration. Therefore, this article focuses on these three issues and proposes a novel estimation of distribution algorithm (EDA) for efficient hyperparameters optimization, with three major contributions in the algorithm design. First, a hybrid-model EDA is proposed to efficiently deal with the mixed-variable difficulty. The proposed algorithm uses a mixed-variable encoding scheme to encode the mixed-variable hyperparameters and adopts an adaptive hybrid-model learning (AHL) strategy to efficiently optimize the mixed-variables. Second, an orthogonal initialization (OI) strategy is proposed to efficiently deal with the challenge of large-scale search space. Third, a surrogate-assisted multi-level evaluation (SME) method is proposed to reduce the expensive computational cost. Based on the above, the proposed algorithm is named s urrogate-assisted hybrid-model EDA (SHEDA). For experimental studies, the proposed SHEDA is verified on widely used classification benchmark problems, and is compared with various state-of-the-art methods. Moreover, a case study on aortic dissection (AD) diagnosis is carried out to evaluate its performance. Experimental results show that the proposed SHEDA is very effective and efficient for hyperparameters optimization, which can find a satisfactory hyperparameters configuration for the CIFAR10, CIFAR100, and AD diagnosis with only 0.58, 0.97, and 1.18 GPU days, respectively.

摘要

卷积神经网络（CNN）的性能在很大程度上取决于其超参数。然而，由于三个问题，找到合适的超参数配置既困难又具有挑战性，而且计算成本高昂。这三个问题分别是：1）不同类型超参数的混合变量问题；2）寻找最优超参数的大规模搜索空间；3）评估候选超参数配置的高昂计算成本。因此，本文聚焦于这三个问题，提出了一种新颖的分布估计算法（EDA）用于高效的超参数优化，在算法设计上有三个主要贡献。首先，提出了一种混合模型EDA来有效处理混合变量难题。所提出的算法使用混合变量编码方案对混合变量超参数进行编码，并采用自适应混合模型学习（AHL）策略来有效优化混合变量。其次，提出了一种正交初始化（OI）策略来有效应对大规模搜索空间的挑战。第三，提出了一种代理辅助多级评估（SME）方法来降低高昂的计算成本。基于以上内容，所提出的算法被命名为代理辅助混合模型EDA（SHEDA）。对于实验研究，所提出的SHEDA在广泛使用的分类基准问题上进行了验证，并与各种先进方法进行了比较。此外，还进行了一项关于主动脉夹层（AD）诊断的案例研究以评估其性能。实验结果表明，所提出的SHEDA对于超参数优化非常有效且高效，它分别仅需0.58、0.97和1.18个GPU天数就能为CIFAR10、CIFAR100和AD诊断找到令人满意的超参数配置。

相似文献

Surrogate-Assisted Hybrid-Model Estimation of Distribution Algorithm for Mixed-Variable Hyperparameters Optimization in Convolutional Neural Networks.

IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2338-2352. doi: 10.1109/TNNLS.2021.3106399. Epub 2023 May 2.

An optimized deep learning architecture for breast cancer diagnosis based on improved marine predators algorithm.

Neural Comput Appl. 2022;34(20):18015-18033. doi: 10.1007/s00521-022-07445-5. Epub 2022 Jun 8.

MFBCNNC: Momentum factor biogeography convolutional neural network for COVID-19 detection via chest X-ray images.

Knowl Based Syst. 2021 Nov 28;232:107494. doi: 10.1016/j.knosys.2021.107494. Epub 2021 Sep 15.

Surrogate-Assisted Particle Swarm Optimization for Evolving Variable-Length Transferable Blocks for Image Classification.

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3727-3740. doi: 10.1109/TNNLS.2021.3054400. Epub 2022 Aug 3.

A Cell-Based Fast Memetic Algorithm for Automated Convolutional Neural Architecture Design.

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9040-9053. doi: 10.1109/TNNLS.2022.3155230. Epub 2023 Oct 27.

Adaptive habitat biogeography-based optimizer for optimizing deep CNN hyperparameters in image classification.

Heliyon. 2024 Mar 21;10(7):e28147. doi: 10.1016/j.heliyon.2024.e28147. eCollection 2024 Apr 15.

Optimizing Deep Learning Models with Improved BWO for TEC Prediction.

Biomimetics (Basel). 2024 Sep 22;9(9):575. doi: 10.3390/biomimetics9090575.

Improving classification accuracy of fine-tuned CNN models: Impact of hyperparameter optimization.

Heliyon. 2024 Feb 23;10(5):e26586. doi: 10.1016/j.heliyon.2024.e26586. eCollection 2024 Mar 15.

Robust optimization of convolutional neural networks with a uniform experiment design method: a case of phonocardiogram testing in patients with heart diseases.

BMC Bioinformatics. 2021 Nov 8;22(Suppl 5):92. doi: 10.1186/s12859-021-04032-8.

Brain Tumor Detection and Classification Using Deep Learning and Sine-Cosine Fitness Grey Wolf Optimization.

Bioengineering (Basel). 2022 Dec 22;10(1):18. doi: 10.3390/bioengineering10010018.

引用本文的文献

Efficient Design of Broadband and Low-Profile Multilayer Absorbing Materials on Cobalt-Iron Magnetic Alloy Doped with Rare Earth Element.

Nanomaterials (Basel). 2024 Jun 27;14(13):1107. doi: 10.3390/nano14131107.

Efficiently handling constraints in mixed-integer nonlinear programming problems using gradient-based repair differential evolution.

PeerJ Comput Sci. 2024 May 31;10:e2095. doi: 10.7717/peerj-cs.2095. eCollection 2024.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于卷积神经网络混合变量超参数优化的代理辅助分布估计算法的混合模型估计

Surrogate-Assisted Hybrid-Model Estimation of Distribution Algorithm for Mixed-Variable Hyperparameters Optimization in Convolutional Neural Networks.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献