Department of Mathematics, College of Science and Arts, Qassim University, Al Mithnab, Saudi Arabia.
Department of Electrical Engineering, School of Engineering and Technology, Badr University in Cairo (BUC), Cairo, Egypt.
PLoS One. 2022 May 26;17(5):e0267199. doi: 10.1371/journal.pone.0267199. eCollection 2022.
In this study, we propose a general method for tackling the Pickup and Drop-off Problem (PDP) using Hybrid Pointer Networks (HPNs) and Deep Reinforcement Learning (DRL). Our aim is to reduce the overall tour length traveled by an agent while remaining within the truck's capacity restrictions and adhering to the node-to-node relationship. In such instances, the agent does not allow any drop-off points to be serviced if the truck is empty; conversely, if the vehicle is full, the agent does not allow any products to be picked up from pickup points. In our approach, this challenge is solved using machine learning-based models. Using HPNs as our primary model allows us to gain insight and tackle more complicated node interactions, which simplified our objective to obtaining state-of-art outcomes. Our experimental results demonstrate the effectiveness of the proposed neural network, as we achieve the state-of-art results for this problem as compared with the existing models. We deal with two types of demand patterns in a single type commodity problem. In the first pattern, all demands are assumed to sum up to zero (i.e., we have an equal number of backup and drop-off items). In the second pattern, we have an unequal number of backup and drop-off items, which is close to practical application, such as bike sharing system rebalancing. Our data, models, and code are publicly available at Solving Pickup and Dropoff Problem Using Hybrid Pointer Networks with Deep Reinforcement Learning.
在这项研究中,我们提出了一种使用混合指针网络(HPN)和深度强化学习(DRL)解决提货和交货问题(PDP)的通用方法。我们的目标是在保持卡车容量限制和遵守节点到节点关系的同时,减少代理行驶的总行程。在这种情况下,如果卡车为空,代理不允许为任何下车点提供服务;相反,如果车辆已满,代理不允许从提货点提货。在我们的方法中,使用基于机器学习的模型来解决这个挑战。使用 HPN 作为我们的主要模型,使我们能够深入了解和解决更复杂的节点交互,从而简化了我们的目标,以获得最先进的结果。我们的实验结果证明了所提出的神经网络的有效性,因为与现有模型相比,我们在这个问题上取得了最先进的结果。我们在单个商品问题中处理了两种类型的需求模式。在第一种模式中,假设所有需求的总和为零(即,我们有相同数量的备份和下车项目)。在第二种模式下,我们有备份和下车项目的数量不相等,这更接近实际应用,例如自行车共享系统的再平衡。我们的数据、模型和代码在使用深度强化学习的混合指针网络解决提货和交货问题中公开可用。