• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的多策略自学习粒子群优化算法

Multi-strategy self-learning particle swarm optimization algorithm based on reinforcement learning.

作者信息

Meng Xiaoding, Li Hecheng, Chen Anshan

机构信息

School of Computer Science and Technology, Qinghai Normal University, Xining 810008, China.

School of Mathematics and Statistics, Qinghai Normal University, Xining 810008, China.

出版信息

Math Biosci Eng. 2023 Mar 3;20(5):8498-8530. doi: 10.3934/mbe.2023373.

DOI:10.3934/mbe.2023373
PMID:37161209
Abstract

The trade-off between exploitation and exploration is a dilemma inherent to particle swarm optimization (PSO) algorithms. Therefore, a growing body of PSO variants is devoted to solving the balance between the two. Among them, the method of self-adaptive multi-strategy selection plays a crucial role in improving the performance of PSO algorithms but has yet to be well exploited. In this research, with the aid of the reinforcement learning technique to guide the generation of offspring, a novel self-adaptive multi-strategy selection mechanism is designed, and then a multi-strategy self-learning PSO algorithm based on reinforcement learning (MPSORL) is proposed. First, the fitness value of particles is regarded as a set of states that are divided into several state subsets non-uniformly. Second, the ε-greedy strategy is employed to select the optimal strategy for each particle. The personal best particle and the global best particle are then updated after executing the strategy. Subsequently, the next state is determined. Thus, the value of the Q-table, as a scheme adopted in self-learning, is reshaped by the reward value, the action and the state in a non-stationary environment. Finally, the proposed algorithm is compared with other state-of-the-art algorithms on two well-known benchmark suites and a real-world problem. Extensive experiments indicate that MPSORL has better performance in terms of accuracy, convergence speed and non-parametric tests in most cases. The multi-strategy selection mechanism presented in the manuscript is effective.

摘要

探索与利用之间的权衡是粒子群优化(PSO)算法固有的两难问题。因此,越来越多的PSO变体致力于解决两者之间的平衡。其中,自适应多策略选择方法在提高PSO算法性能方面起着关键作用,但尚未得到充分利用。在本研究中,借助强化学习技术来指导后代的生成,设计了一种新颖的自适应多策略选择机制,然后提出了一种基于强化学习的多策略自学习PSO算法(MPSORL)。首先,将粒子的适应度值视为一组状态,这些状态被非均匀地划分为几个状态子集。其次,采用ε-贪婪策略为每个粒子选择最优策略。在执行策略后更新个体最优粒子和全局最优粒子。随后,确定下一个状态。因此,作为自学习中采用的一种方案,Q表的值在非平稳环境中由奖励值、动作和状态重新塑造。最后,将所提出的算法与其他最先进的算法在两个著名的基准测试集和一个实际问题上进行比较。大量实验表明,在大多数情况下,MPSORL在准确性、收敛速度和非参数测试方面具有更好的性能。手稿中提出的多策略选择机制是有效的。

相似文献

1
Multi-strategy self-learning particle swarm optimization algorithm based on reinforcement learning.基于强化学习的多策略自学习粒子群优化算法
Math Biosci Eng. 2023 Mar 3;20(5):8498-8530. doi: 10.3934/mbe.2023373.
2
A novel particle swarm optimization based on hybrid-learning model.一种基于混合学习模型的新型粒子群优化算法。
Math Biosci Eng. 2023 Feb 9;20(4):7056-7087. doi: 10.3934/mbe.2023305.
3
A multi-sample particle swarm optimization algorithm based on electric field force.基于电场力的多样本粒子群优化算法。
Math Biosci Eng. 2021 Aug 31;18(6):7464-7489. doi: 10.3934/mbe.2021369.
4
A Multi-Strategy Adaptive Comprehensive Learning PSO Algorithm and Its Application.一种多策略自适应综合学习粒子群优化算法及其应用
Entropy (Basel). 2022 Jun 28;24(7):890. doi: 10.3390/e24070890.
5
Multi-objective particle swarm optimization with reverse multi-leaders.具有反向多领导者的多目标粒子群优化算法
Math Biosci Eng. 2023 May 9;20(7):11732-11762. doi: 10.3934/mbe.2023522.
6
Lévy flight-based inverse adaptive comprehensive learning particle swarm optimization.基于 Lévy 飞行的逆自适应综合学习粒子群优化算法。
Math Biosci Eng. 2022 Mar 23;19(5):5241-5268. doi: 10.3934/mbe.2022246.
7
Particle Swarm Optimization With Interswarm Interactive Learning Strategy.粒子群优化的群间交互学习策略。
IEEE Trans Cybern. 2016 Oct;46(10):2238-2251. doi: 10.1109/TCYB.2015.2474153. Epub 2015 Sep 9.
8
An improved predator-prey particle swarm optimization algorithm for Nash equilibrium solution.改进的纳什均衡求解捕食者-猎物粒子群优化算法。
PLoS One. 2021 Nov 24;16(11):e0260231. doi: 10.1371/journal.pone.0260231. eCollection 2021.
9
Adaptive Multiobjective Particle Swarm Optimization Based on Evolutionary State Estimation.基于进化状态估计的自适应多目标粒子群优化。
IEEE Trans Cybern. 2021 Jul;51(7):3738-3751. doi: 10.1109/TCYB.2019.2949204. Epub 2021 Jun 23.
10
Adaptive particle swarm optimization.自适应粒子群优化算法
IEEE Trans Syst Man Cybern B Cybern. 2009 Dec;39(6):1362-81. doi: 10.1109/TSMCB.2009.2015956. Epub 2009 Apr 7.