元启发式算法在神经网络和深度学习架构训练中的应用：全面综述。

Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review.

作者信息

Kaveh Mehrdad, Mesgari Mohammad Saadi

机构信息

Department of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran, 19967-15433 Iran.

出版信息

Neural Process Lett. 2022 Oct 31:1-104. doi: 10.1007/s11063-022-11055-6.

DOI:10.1007/s11063-022-11055-6

PMID:36339645

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9628382/

Abstract

The learning process and hyper-parameter optimization of artificial neural networks (ANNs) and deep learning (DL) architectures is considered one of the most challenging machine learning problems. Several past studies have used gradient-based back propagation methods to train DL architectures. However, gradient-based methods have major drawbacks such as stucking at local minimums in multi-objective cost functions, expensive execution time due to calculating gradient information with thousands of iterations and needing the cost functions to be continuous. Since training the ANNs and DLs is an NP-hard optimization problem, their structure and parameters optimization using the meta-heuristic (MH) algorithms has been considerably raised. MH algorithms can accurately formulate the optimal estimation of DL components (such as hyper-parameter, weights, number of layers, number of neurons, learning rate, etc.). This paper provides a comprehensive review of the optimization of ANNs and DLs using MH algorithms. In this paper, we have reviewed the latest developments in the use of MH algorithms in the DL and ANN methods, presented their disadvantages and advantages, and pointed out some research directions to fill the gaps between MHs and DL methods. Moreover, it has been explained that the evolutionary hybrid architecture still has limited applicability in the literature. Also, this paper classifies the latest MH algorithms in the literature to demonstrate their effectiveness in DL and ANN training for various applications. Most researchers tend to extend novel hybrid algorithms by combining MHs to optimize the hyper-parameters of DLs and ANNs. The development of hybrid MHs helps improving algorithms performance and capable of solving complex optimization problems. In general, the optimal performance of the MHs should be able to achieve a suitable trade-off between exploration and exploitation features. Hence, this paper tries to summarize various MH algorithms in terms of the convergence trend, exploration, exploitation, and the ability to avoid local minima. The integration of MH with DLs is expected to accelerate the training process in the coming few years. However, relevant publications in this way are still rare.

摘要

人工神经网络（ANNs）和深度学习（DL）架构的学习过程及超参数优化被认为是最具挑战性的机器学习问题之一。过去的一些研究使用基于梯度的反向传播方法来训练DL架构。然而，基于梯度的方法存在主要缺点，例如在多目标成本函数中陷入局部最小值、由于数千次迭代计算梯度信息而导致执行时间昂贵，以及需要成本函数连续。由于训练ANNs和DLs是一个NP难优化问题，使用元启发式（MH）算法对其结构和参数进行优化受到了广泛关注。MH算法可以准确地制定DL组件（如超参数、权重、层数、神经元数量、学习率等）的最优估计。本文对使用MH算法优化ANNs和DLs进行了全面综述。在本文中，我们回顾了在DL和ANN方法中使用MH算法的最新进展，介绍了它们的缺点和优点，并指出了一些研究方向以填补MHs和DL方法之间的差距。此外，还解释了进化混合架构在文献中的适用性仍然有限。同时，本文对文献中的最新MH算法进行了分类，以展示它们在DL和ANN训练中对各种应用的有效性。大多数研究人员倾向于通过组合MHs来扩展新颖的混合算法，以优化DLs和ANNs的超参数。混合MHs的发展有助于提高算法性能，并能够解决复杂的优化问题。一般来说，MHs的最优性能应该能够在探索和利用特征之间实现适当的权衡。因此，本文试图从收敛趋势、探索、利用以及避免局部最小值的能力等方面总结各种MH算法。预计在未来几年，MH与DLs的集成将加速训练过程。然而，以这种方式的相关出版物仍然很少。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5307/9628382/75042e997715/11063_2022_11055_Fig1_HTML.jpg

相似文献

Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review.元启发式算法在神经网络和深度学习架构训练中的应用：全面综述。

Neural Process Lett. 2022 Oct 31:1-104. doi: 10.1007/s11063-022-11055-6.

Optimizing Deep Learning Model for Software Cost Estimation Using Hybrid Meta-Heuristic Algorithmic Approach.使用混合启发式算法优化软件成本估算的深度学习模型。

Comput Intell Neurosci. 2022 Oct 4;2022:3145956. doi: 10.1155/2022/3145956. eCollection 2022.

A new constructive algorithm for architectural and functional adaptation of artificial neural networks.一种用于人工神经网络架构和功能自适应的新型构造算法。

IEEE Trans Syst Man Cybern B Cybern. 2009 Dec;39(6):1590-605. doi: 10.1109/TSMCB.2009.2021849. Epub 2009 Jun 5.

Advances in Sparrow Search Algorithm: A Comprehensive Survey.麻雀搜索算法的研究进展：全面综述

Arch Comput Methods Eng. 2023;30(1):427-455. doi: 10.1007/s11831-022-09804-w. Epub 2022 Aug 22.

A Novel Breast Cancer Diagnosis Scheme With Intelligent Feature and Parameter Selections.一种具有智能特征和参数选择的新型乳腺癌诊断方案。

Comput Methods Programs Biomed. 2022 Feb;214:106432. doi: 10.1016/j.cmpb.2021.106432. Epub 2021 Sep 20.

Training of feedforward neural networks for data classification using hybrid particle swarm optimization, Mantegna Lévy flight and neighborhood search.使用混合粒子群优化、曼特尼亚 Lévy 飞行和邻域搜索对前馈神经网络进行数据分类训练。

Heliyon. 2019 Apr 3;5(4):e01275. doi: 10.1016/j.heliyon.2019.e01275. eCollection 2019 Apr.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

A hybrid neural learning algorithm using evolutionary learning and derivative free local search method.一种使用进化学习和无导数局部搜索方法的混合神经学习算法。

Int J Neural Syst. 2006 Jun;16(3):201-13. doi: 10.1142/S0129065706000615.

Rethinking the performance comparison between SNNS and ANNS.重新思考 SNNS 和 ANNS 的性能比较。

Neural Netw. 2020 Jan;121:294-307. doi: 10.1016/j.neunet.2019.09.005. Epub 2019 Sep 19.

引用本文的文献

A Novel Adaptive Superb Fairy-Wren () Optimization Algorithm for Solving Numerical Optimization Problems.一种用于求解数值优化问题的新型自适应华丽细尾鹩莺优化算法

Biomimetics (Basel). 2025 Jul 27;10(8):496. doi: 10.3390/biomimetics10080496.

A hybrid AI-Blockchain security framework for smart grids.一种用于智能电网的混合人工智能-区块链安全框架。

Sci Rep. 2025 Jul 1;15(1):20882. doi: 10.1038/s41598-025-05257-w.

Real-time prediction system for tunnel deformation induced by excavation and its application.隧道开挖引起的变形实时预测系统及其应用

Sci Prog. 2025 Apr-Jun;108(2):368504251349975. doi: 10.1177/00368504251349975. Epub 2025 Jun 18.

Biomimicry as a decision-making methodology in condition monitoring.作为状态监测中决策方法的仿生学

Front Artif Intell. 2025 May 30;8:1485489. doi: 10.3389/frai.2025.1485489. eCollection 2025.

Consumption quota compilation based on BP artificial neural network algorithm in mechanical and electrical installation engineering of prefabricated buildings.基于BP人工神经网络算法的装配式建筑机电安装工程消耗定额编制

PLoS One. 2025 Jun 2;20(6):e0324854. doi: 10.1371/journal.pone.0324854. eCollection 2025.

Optimization of non-smooth functions via differentiable surrogates.通过可微替代函数优化非光滑函数。

PLoS One. 2025 May 30;20(5):e0321862. doi: 10.1371/journal.pone.0321862. eCollection 2025.

A Particle Swarm Optimization-Guided Ivy Algorithm for Global Optimization Problems.一种用于全局优化问题的粒子群优化引导常春藤算法。

Biomimetics (Basel). 2025 May 21;10(5):342. doi: 10.3390/biomimetics10050342.

Bio inspired optimization techniques for disease detection in deep learning systems.深度学习系统中用于疾病检测的生物启发式优化技术。

Sci Rep. 2025 May 25;15(1):18202. doi: 10.1038/s41598-025-02846-7.

A revamped black winged kite algorithm with advanced strategies for engineering optimization.一种具有先进工程优化策略的改进型黑翅鸢算法。

Sci Rep. 2025 May 21;15(1):17681. doi: 10.1038/s41598-025-93370-1.

Menstrual cycle inspired latent diffusion model for image augmentation in energy production.受月经周期启发的潜在扩散模型在能源生产图像增强中的应用

Sci Rep. 2025 May 14;15(1):16749. doi: 10.1038/s41598-025-99088-4.

本文引用的文献

Optimizing deep neural networks to predict the effect of social distancing on COVID-19 spread.优化深度神经网络以预测社交距离对新冠病毒传播的影响。

Comput Ind Eng. 2022 Apr;166:107970. doi: 10.1016/j.cie.2022.107970. Epub 2022 Jan 29.

Improved deep convolutional neural networks using chimp optimization algorithm for Covid19 diagnosis from the X-ray images.使用黑猩猩优化算法改进深度卷积神经网络用于从X射线图像诊断新冠病毒。

Expert Syst Appl. 2023 Mar 1;213:119206. doi: 10.1016/j.eswa.2022.119206. Epub 2022 Nov 4.

An improved butterfly optimization algorithm for training the feed-forward artificial neural networks.一种用于训练前馈人工神经网络的改进型蝴蝶优化算法。

Soft comput. 2023;27(7):3887-3905. doi: 10.1007/s00500-022-07592-w. Epub 2022 Oct 20.

Monophenolase assay using excitation-emission matrix fluorescence and ELMAN neural network assisted by whale optimization algorithm.基于激发-发射矩阵荧光和鲸鱼优化算法辅助 ELMAN 神经网络的单酚酶活性检测。

Anal Biochem. 2022 Oct 15;655:114838. doi: 10.1016/j.ab.2022.114838. Epub 2022 Aug 9.

A novel approach for optimization of convolution neural network with hybrid particle swarm and grey wolf algorithm for classification of Indian classical dances.一种结合混合粒子群和灰狼算法优化卷积神经网络用于印度古典舞蹈分类的新方法。

Knowl Inf Syst. 2022;64(9):2411-2434. doi: 10.1007/s10115-022-01707-3. Epub 2022 Jul 28.

Electrical Impedance Tomography Based on Grey Wolf Optimized Radial Basis Function Neural Network.基于灰狼优化径向基函数神经网络的电阻抗断层成像

Micromachines (Basel). 2022 Jul 15;13(7):1120. doi: 10.3390/mi13071120.

Temperature Compensation Method Based on an Improved Firefly Algorithm Optimized Backpropagation Neural Network for Micromachined Silicon Resonant Accelerometers.基于改进萤火虫算法优化反向传播神经网络的微机械硅谐振加速度计温度补偿方法

Micromachines (Basel). 2022 Jun 30;13(7):1054. doi: 10.3390/mi13071054.

Enhanced Gravitational Search Optimization with Hybrid Deep Learning Model for COVID-19 Diagnosis on Epidemiology Data.基于流行病学数据的新冠肺炎诊断的混合深度学习模型增强引力搜索优化算法

Healthcare (Basel). 2022 Jul 19;10(7):1339. doi: 10.3390/healthcare10071339.

Hybridization of long short-term memory with Sparrow Search Optimization model for water quality index prediction.长短期记忆与麻雀搜索优化模型融合进行水质指数预测。

Chemosphere. 2022 Nov;307(Pt 1):135762. doi: 10.1016/j.chemosphere.2022.135762. Epub 2022 Jul 18.

Optimized convolution neural network based multiple eye disease detection.基于优化卷积神经网络的多种眼病检测。

Comput Biol Med. 2022 Jul;146:105648. doi: 10.1016/j.compbiomed.2022.105648. Epub 2022 May 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

元启发式算法在神经网络和深度学习架构训练中的应用：全面综述。

Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献