• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于解决异构容量车辆路径问题的深度强化学习

Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem.

作者信息

Li Jingwen, Ma Yining, Gao Ruize, Cao Zhiguang, Lim Andrew, Song Wen, Zhang Jie

出版信息

IEEE Trans Cybern. 2022 Dec;52(12):13572-13585. doi: 10.1109/TCYB.2021.3111082. Epub 2022 Nov 18.

DOI:10.1109/TCYB.2021.3111082
PMID:34554923
Abstract

Existing deep reinforcement learning (DRL)-based methods for solving the capacitated vehicle routing problem (CVRP) intrinsically cope with a homogeneous vehicle fleet, in which the fleet is assumed as repetitions of a single vehicle. Hence, their key to construct a solution solely lies in the selection of the next node (customer) to visit excluding the selection of vehicle. However, vehicles in real-world scenarios are likely to be heterogeneous with different characteristics that affect their capacity (or travel speed), rendering existing DRL methods less effective. In this article, we tackle heterogeneous CVRP (HCVRP), where vehicles are mainly characterized by different capacities. We consider both min-max and min-sum objectives for HCVRP, which aim to minimize the longest or total travel time of the vehicle(s) in the fleet. To solve those problems, we propose a DRL method based on the attention mechanism with a vehicle selection decoder accounting for the heterogeneous fleet constraint and a node selection decoder accounting for the route construction, which learns to construct a solution by automatically selecting both a vehicle and a node for this vehicle at each step. Experimental results based on randomly generated instances show that, with desirable generalization to various problem sizes, our method outperforms the state-of-the-art DRL method and most of the conventional heuristics, and also delivers competitive performance against the state-of-the-art heuristic method, that is, slack induction by string removal. In addition, the results of extended experiments demonstrate that our method is also able to solve CVRPLib instances with satisfactory performance.

摘要

现有的基于深度强化学习(DRL)解决容量受限车辆路径问题(CVRP)的方法本质上处理的是同构车辆车队,即车队被假定为单一车辆的重复。因此,它们构建解决方案的关键仅在于选择下一个要访问的节点(客户),而不包括车辆的选择。然而,现实场景中的车辆可能具有不同特性从而呈现异构性,这些特性会影响其容量(或行驶速度),使得现有的深度强化学习方法效果不佳。在本文中,我们处理异构CVRP(HCVRP),其中车辆主要以不同容量为特征。我们考虑了HCVRP的最小最大和最小和目标,其旨在最小化车队中车辆的最长或总行驶时间。为了解决这些问题,我们提出了一种基于注意力机制的深度强化学习方法,该方法具有一个考虑异构车队约束的车辆选择解码器和一个考虑路径构建的节点选择解码器,它通过在每一步自动为车辆选择车辆和节点来学习构建解决方案。基于随机生成实例的实验结果表明,我们的方法对各种问题规模具有良好的泛化能力,优于现有最先进的深度强化学习方法和大多数传统启发式算法,并且与现有最先进的启发式方法(即通过字符串删除进行松弛归纳)相比也具有竞争力。此外,扩展实验结果表明,我们的方法也能够以令人满意的性能解决CVRPLib实例。

相似文献

1
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem.用于解决异构容量车辆路径问题的深度强化学习
IEEE Trans Cybern. 2022 Dec;52(12):13572-13585. doi: 10.1109/TCYB.2021.3111082. Epub 2022 Nov 18.
2
Learning Improvement Heuristics for Solving Routing Problems.用于解决路由问题的学习改进启发式算法
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):5057-5069. doi: 10.1109/TNNLS.2021.3068828. Epub 2022 Aug 31.
3
Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls.
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4779-4793. doi: 10.1109/TNNLS.2024.3371781. Epub 2025 Feb 28.
4
Hybrid modified ant system with sweep algorithm and path relinking for the capacitated vehicle routing problem.结合扫描算法和路径重连的混合改进蚁群系统求解容量受限车辆路径问题
Heliyon. 2021 Sep 21;7(9):e08029. doi: 10.1016/j.heliyon.2021.e08029. eCollection 2021 Sep.
5
Genetic Programming Hyper-Heuristics with Vehicle Collaboration for Uncertain Capacitated Arc Routing Problems.遗传编程超启发式算法与车辆协作求解不确定容量弧路由问题。
Evol Comput. 2020 Winter;28(4):563-593. doi: 10.1162/evco_a_00267. Epub 2019 Nov 15.
6
A modified coronavirus herd immunity optimizer for capacitated vehicle routing problem.一种用于容量受限车辆路径问题的改进型冠状病毒群体免疫优化器。
J King Saud Univ Comput Inf Sci. 2022 Sep;34(8):4782-4795. doi: 10.1016/j.jksuci.2021.06.013. Epub 2021 Jun 24.
7
Learning Feature Embedding Refiner for Solving Vehicle Routing Problems.用于解决车辆路径问题的学习特征嵌入优化器
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15279-15291. doi: 10.1109/TNNLS.2023.3285077. Epub 2024 Oct 29.
8
Heuristics for Two Depot Heterogeneous Unmanned Vehicle Path Planning to Minimize Maximum Travel Cost.用于双配送中心异构无人车路径规划以最小化最大行程成本的启发式算法
Sensors (Basel). 2019 May 29;19(11):2461. doi: 10.3390/s19112461.
9
Reinforcement Learning With Multiple Relational Attention for Solving Vehicle Routing Problems.
IEEE Trans Cybern. 2022 Oct;52(10):11107-11120. doi: 10.1109/TCYB.2021.3089179. Epub 2022 Sep 19.
10
A discrete wild horse optimizer for capacitated vehicle routing problem.一种用于容量车辆路径问题的离散野马优化器。
Sci Rep. 2024 Sep 11;14(1):21277. doi: 10.1038/s41598-024-72242-0.

引用本文的文献

1
Enhancing Last-Mile Logistics: AI-Driven Fleet Optimization, Mixed Reality, and Large Language Model Assistants for Warehouse Operations.加强最后一英里物流:人工智能驱动的车队优化、混合现实以及用于仓库运营的大语言模型助手。
Sensors (Basel). 2025 Apr 24;25(9):2696. doi: 10.3390/s25092696.
2
Deep reinforcement learning based low energy consumption scheduling approach design for urban electric logistics vehicle networks.基于深度强化学习的城市电动物流车辆网络低能耗调度方法设计
Sci Rep. 2025 Mar 15;15(1):9003. doi: 10.1038/s41598-025-92916-7.
3
Generating large-scale real-world vehicle routing dataset with novel spatial data extraction tool.
利用新型空间数据提取工具生成大规模真实世界车辆路径数据集。
PLoS One. 2024 Jun 21;19(6):e0304422. doi: 10.1371/journal.pone.0304422. eCollection 2024.
4
A deep reinforcement learning algorithm for the rectangular strip packing problem.一种用于矩形带材打包问题的深度强化学习算法。
PLoS One. 2023 Mar 16;18(3):e0282598. doi: 10.1371/journal.pone.0282598. eCollection 2023.