• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于特征变换的求解多目标旅行商问题的深度强化学习算法框架。

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.

机构信息

School of Mechatronic Engineering and Automation, Shanghai University, 99 Shangda Road, Shanghai 200444, China.

出版信息

Neural Netw. 2024 Aug;176:106359. doi: 10.1016/j.neunet.2024.106359. Epub 2024 May 3.

DOI:10.1016/j.neunet.2024.106359
PMID:38733797
Abstract

As a special type of multi-objective combinatorial optimization problems (MOCOPs), the multi-objective traveling salesman problem (MOTSP) plays an important role in practical fields such as transportation and robot control. However, due to the complexity of its solution space and the conflicts between different objectives, it is difficult to obtain satisfactory solutions in a short time. This paper proposes an end-to-end algorithm framework for solving MOTSP based on deep reinforcement learning (DRL). By decomposing strategies, solving MOTSP is transformed into solving multiple single-objective optimization subproblems. Through linear transformation, the features of the MOTSP are combined with the weights of the objective function. Subsequently, a modified graph pointer network (GPN) model is used to solve the decomposed subproblems. Compared with the previous DRL model, the proposed algorithm can solve all the subproblems using only one model without adding weight information as input features. Furthermore, our algorithm can output a corresponding solution for each weight, which increases the diversity of solutions. In order to verify the performance of our proposed algorithm, it is compared with four classical evolutionary algorithms and two DRL algorithms on several MOTSP instances. The comparison shows that our proposed algorithm outperforms the compared algorithms both in terms of training time and the quality of the resulting solutions.

摘要

作为一类特殊的多目标组合优化问题(MOCOPs),多目标旅行商问题(MOTSP)在交通和机器人控制等实际领域中发挥着重要作用。然而,由于其解空间的复杂性以及不同目标之间的冲突,很难在短时间内获得满意的解决方案。本文提出了一种基于深度强化学习(DRL)的 MOTSP 端到端算法框架。通过策略分解,将求解 MOTSP 转化为求解多个单目标优化子问题。通过线性变换,将 MOTSP 的特征与目标函数的权重结合起来。然后,使用改进的图指针网络(GPN)模型来求解分解后的子问题。与之前的 DRL 模型相比,所提出的算法可以仅使用一个模型来解决所有的子问题,而无需添加权重信息作为输入特征。此外,我们的算法可以为每个权重输出相应的解决方案,增加了解决方案的多样性。为了验证所提出算法的性能,将其与四种经典的进化算法和两种 DRL 算法在几个 MOTSP 实例上进行了比较。比较结果表明,在所提出的算法在训练时间和得到的解决方案的质量方面都优于比较算法。

相似文献

1
A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.基于特征变换的求解多目标旅行商问题的深度强化学习算法框架。
Neural Netw. 2024 Aug;176:106359. doi: 10.1016/j.neunet.2024.106359. Epub 2024 May 3.
2
Deep Reinforcement Learning for Multiobjective Optimization.用于多目标优化的深度强化学习
IEEE Trans Cybern. 2021 Jun;51(6):3103-3114. doi: 10.1109/TCYB.2020.2977661. Epub 2021 May 18.
3
Multiobjective Combinatorial Optimization Using a Single Deep Reinforcement Learning Model.使用单一深度强化学习模型的多目标组合优化
IEEE Trans Cybern. 2024 Mar;54(3):1984-1996. doi: 10.1109/TCYB.2023.3312476. Epub 2024 Feb 9.
4
Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.基于双目标框架的多机器人编队分布式深度强化学习
Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.
5
Hybrid pointer networks for traveling salesman problems optimization.混合指针网络在旅行商问题优化中的应用。
PLoS One. 2021 Dec 14;16(12):e0260995. doi: 10.1371/journal.pone.0260995. eCollection 2021.
6
Solving Traveling Salesman Problems Based on Artificial Cooperative Search Algorithm.基于人工协同搜索算法的旅行商问题求解。
Comput Intell Neurosci. 2022 Apr 12;2022:1008617. doi: 10.1155/2022/1008617. eCollection 2022.
7
Memory-efficient Transformer-based network model for Traveling Salesman Problem.用于旅行商问题的基于高效内存Transformer的网络模型。
Neural Netw. 2023 Apr;161:589-597. doi: 10.1016/j.neunet.2023.02.014. Epub 2023 Feb 16.
8
Set-Based Discrete Particle Swarm Optimization Based on Decomposition for Permutation-Based Multiobjective Combinatorial Optimization Problems.基于分解的基于集合的离散粒子群优化算法求解基于排列的多目标组合优化问题
IEEE Trans Cybern. 2018 Jul;48(7):2139-2153. doi: 10.1109/TCYB.2017.2728120. Epub 2017 Aug 7.
9
Research on improved ant colony optimization for traveling salesman problem.旅行商问题的改进蚁群优化算法研究。
Math Biosci Eng. 2022 Jun 6;19(8):8152-8186. doi: 10.3934/mbe.2022381.
10
An accelerated end-to-end method for solving routing problems.一种加速的端到端路由问题求解方法。
Neural Netw. 2023 Jul;164:535-545. doi: 10.1016/j.neunet.2023.05.003. Epub 2023 May 10.