• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度强化学习的混合指针网络解决接送问题。

Solving pickup and drop-off problem using hybrid pointer networks with deep reinforcement learning.

机构信息

Department of Mathematics, College of Science and Arts, Qassim University, Al Mithnab, Saudi Arabia.

Department of Electrical Engineering, School of Engineering and Technology, Badr University in Cairo (BUC), Cairo, Egypt.

出版信息

PLoS One. 2022 May 26;17(5):e0267199. doi: 10.1371/journal.pone.0267199. eCollection 2022.

DOI:10.1371/journal.pone.0267199
PMID:35617306
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9135236/
Abstract

In this study, we propose a general method for tackling the Pickup and Drop-off Problem (PDP) using Hybrid Pointer Networks (HPNs) and Deep Reinforcement Learning (DRL). Our aim is to reduce the overall tour length traveled by an agent while remaining within the truck's capacity restrictions and adhering to the node-to-node relationship. In such instances, the agent does not allow any drop-off points to be serviced if the truck is empty; conversely, if the vehicle is full, the agent does not allow any products to be picked up from pickup points. In our approach, this challenge is solved using machine learning-based models. Using HPNs as our primary model allows us to gain insight and tackle more complicated node interactions, which simplified our objective to obtaining state-of-art outcomes. Our experimental results demonstrate the effectiveness of the proposed neural network, as we achieve the state-of-art results for this problem as compared with the existing models. We deal with two types of demand patterns in a single type commodity problem. In the first pattern, all demands are assumed to sum up to zero (i.e., we have an equal number of backup and drop-off items). In the second pattern, we have an unequal number of backup and drop-off items, which is close to practical application, such as bike sharing system rebalancing. Our data, models, and code are publicly available at Solving Pickup and Dropoff Problem Using Hybrid Pointer Networks with Deep Reinforcement Learning.

摘要

在这项研究中,我们提出了一种使用混合指针网络(HPN)和深度强化学习(DRL)解决提货和交货问题(PDP)的通用方法。我们的目标是在保持卡车容量限制和遵守节点到节点关系的同时,减少代理行驶的总行程。在这种情况下,如果卡车为空,代理不允许为任何下车点提供服务;相反,如果车辆已满,代理不允许从提货点提货。在我们的方法中,使用基于机器学习的模型来解决这个挑战。使用 HPN 作为我们的主要模型,使我们能够深入了解和解决更复杂的节点交互,从而简化了我们的目标,以获得最先进的结果。我们的实验结果证明了所提出的神经网络的有效性,因为与现有模型相比,我们在这个问题上取得了最先进的结果。我们在单个商品问题中处理了两种类型的需求模式。在第一种模式中,假设所有需求的总和为零(即,我们有相同数量的备份和下车项目)。在第二种模式下,我们有备份和下车项目的数量不相等,这更接近实际应用,例如自行车共享系统的再平衡。我们的数据、模型和代码在使用深度强化学习的混合指针网络解决提货和交货问题中公开可用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/89785f6edb73/pone.0267199.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/7d3596f91b96/pone.0267199.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/dbd8ebf6cb90/pone.0267199.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/9955f931dcb1/pone.0267199.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/81b12033fef5/pone.0267199.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/6e1dcdc3ee61/pone.0267199.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/89785f6edb73/pone.0267199.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/7d3596f91b96/pone.0267199.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/dbd8ebf6cb90/pone.0267199.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/9955f931dcb1/pone.0267199.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/81b12033fef5/pone.0267199.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/6e1dcdc3ee61/pone.0267199.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c5/9135236/89785f6edb73/pone.0267199.g006.jpg

相似文献

1
Solving pickup and drop-off problem using hybrid pointer networks with deep reinforcement learning.使用深度强化学习的混合指针网络解决接送问题。
PLoS One. 2022 May 26;17(5):e0267199. doi: 10.1371/journal.pone.0267199. eCollection 2022.
2
Hybrid pointer networks for traveling salesman problems optimization.混合指针网络在旅行商问题优化中的应用。
PLoS One. 2021 Dec 14;16(12):e0260995. doi: 10.1371/journal.pone.0260995. eCollection 2021.
3
IHG-MA: Inductive heterogeneous graph multi-agent reinforcement learning for multi-intersection traffic signal control.IHG-MA:用于多交叉口交通信号控制的归纳异质图多智能体强化学习。
Neural Netw. 2021 Jul;139:265-277. doi: 10.1016/j.neunet.2021.03.015. Epub 2021 Mar 22.
4
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem.用于解决异构容量车辆路径问题的深度强化学习
IEEE Trans Cybern. 2022 Dec;52(12):13572-13585. doi: 10.1109/TCYB.2021.3111082. Epub 2022 Nov 18.
5
Distributed forecasting and ant colony optimization for the bike-sharing rebalancing problem with unserved demands.分布式预测与蚁群优化在共享单车再平衡问题中的应用研究——考虑未满足需求
PLoS One. 2019 Dec 31;14(12):e0226204. doi: 10.1371/journal.pone.0226204. eCollection 2019.
6
An accelerated end-to-end method for solving routing problems.一种加速的端到端路由问题求解方法。
Neural Netw. 2023 Jul;164:535-545. doi: 10.1016/j.neunet.2023.05.003. Epub 2023 May 10.
7
Bike-Sharing Demand Prediction at Community Level under COVID-19 Using Deep Learning.基于深度学习的 COVID-19 下社区层面共享单车需求预测。
Sensors (Basel). 2022 Jan 29;22(3):1060. doi: 10.3390/s22031060.
8
Deep Reinforcement Learning for Multiobjective Optimization.用于多目标优化的深度强化学习
IEEE Trans Cybern. 2021 Jun;51(6):3103-3114. doi: 10.1109/TCYB.2020.2977661. Epub 2021 May 18.
9
Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls.
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4779-4793. doi: 10.1109/TNNLS.2024.3371781. Epub 2025 Feb 28.
10
A novel and effective method for solving the router nodes placement in wireless mesh networks using reinforcement learning.一种使用强化学习解决无线 mesh 网络中路由器节点放置问题的新颖有效方法。
PLoS One. 2024 Apr 10;19(4):e0301073. doi: 10.1371/journal.pone.0301073. eCollection 2024.

本文引用的文献

1
Hybrid pointer networks for traveling salesman problems optimization.混合指针网络在旅行商问题优化中的应用。
PLoS One. 2021 Dec 14;16(12):e0260995. doi: 10.1371/journal.pone.0260995. eCollection 2021.
2
Learning Improvement Heuristics for Solving Routing Problems.用于解决路由问题的学习改进启发式算法
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):5057-5069. doi: 10.1109/TNNLS.2021.3068828. Epub 2022 Aug 31.
3
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.