基于特征变换的求解多目标旅行商问题的深度强化学习算法框架。

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.

机构信息

School of Mechatronic Engineering and Automation, Shanghai University, 99 Shangda Road, Shanghai 200444, China.

出版信息

Neural Netw. 2024 Aug;176:106359. doi: 10.1016/j.neunet.2024.106359. Epub 2024 May 3.

DOI:10.1016/j.neunet.2024.106359

Abstract

As a special type of multi-objective combinatorial optimization problems (MOCOPs), the multi-objective traveling salesman problem (MOTSP) plays an important role in practical fields such as transportation and robot control. However, due to the complexity of its solution space and the conflicts between different objectives, it is difficult to obtain satisfactory solutions in a short time. This paper proposes an end-to-end algorithm framework for solving MOTSP based on deep reinforcement learning (DRL). By decomposing strategies, solving MOTSP is transformed into solving multiple single-objective optimization subproblems. Through linear transformation, the features of the MOTSP are combined with the weights of the objective function. Subsequently, a modified graph pointer network (GPN) model is used to solve the decomposed subproblems. Compared with the previous DRL model, the proposed algorithm can solve all the subproblems using only one model without adding weight information as input features. Furthermore, our algorithm can output a corresponding solution for each weight, which increases the diversity of solutions. In order to verify the performance of our proposed algorithm, it is compared with four classical evolutionary algorithms and two DRL algorithms on several MOTSP instances. The comparison shows that our proposed algorithm outperforms the compared algorithms both in terms of training time and the quality of the resulting solutions.

摘要

作为一类特殊的多目标组合优化问题（MOCOPs），多目标旅行商问题（MOTSP）在交通和机器人控制等实际领域中发挥着重要作用。然而，由于其解空间的复杂性以及不同目标之间的冲突，很难在短时间内获得满意的解决方案。本文提出了一种基于深度强化学习（DRL）的 MOTSP 端到端算法框架。通过策略分解，将求解 MOTSP 转化为求解多个单目标优化子问题。通过线性变换，将 MOTSP 的特征与目标函数的权重结合起来。然后，使用改进的图指针网络（GPN）模型来求解分解后的子问题。与之前的 DRL 模型相比，所提出的算法可以仅使用一个模型来解决所有的子问题，而无需添加权重信息作为输入特征。此外，我们的算法可以为每个权重输出相应的解决方案，增加了解决方案的多样性。为了验证所提出算法的性能，将其与四种经典的进化算法和两种 DRL 算法在几个 MOTSP 实例上进行了比较。比较结果表明，在所提出的算法在训练时间和得到的解决方案的质量方面都优于比较算法。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于特征变换的求解多目标旅行商问题的深度强化学习算法框架。

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.

机构信息

出版信息

相似文献

基于特征变换的求解多目标旅行商问题的深度强化学习算法框架。

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.

机构信息

出版信息

相似文献