一种用于在线出租车调度的集成强化学习与集中式编程方法

An Integrated Reinforcement Learning and Centralized Programming Approach for Online Taxi Dispatching.

作者信息

Liang Enming, Wen Kexin, Lam William H K, Sumalee Agachai, Zhong Renxin

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4742-4756. doi: 10.1109/TNNLS.2021.3060187. Epub 2022 Aug 31.

DOI:10.1109/TNNLS.2021.3060187

Abstract

Balancing the supply and demand for ride-sourcing companies is a challenging issue, especially with real-time requests and stochastic traffic conditions of large-scale congested road networks. To tackle this challenge, this article proposes a robust and scalable approach that integrates reinforcement learning (RL) and a centralized programming (CP) structure to promote real-time taxi operations. Both real-time order matching decisions and vehicle relocation decisions at the microscopic network scale are integrated within a Markov decision process framework. The RL component learns the decomposed state-value function, which represents the taxi drivers' experience, the off-line historical demand pattern, and the traffic network congestion. The CP component plans nonmyopic decisions for drivers collectively under the prescribed system constraints to explicitly realize cooperation. Furthermore, to circumvent sparse reward and sample imbalance problems over the microscopic road network, this article proposed a temporal-difference learning algorithm with prioritized gradient descent and adaptive exploration techniques. A simulator is built and trained with the Manhattan road network and New York City yellow taxi data to simulate the real-time vehicle dispatching environment. Both centralized and decentralized taxi dispatching policies are examined with the simulator. This case study shows that the proposed approach can further improve taxi drivers' profits while reducing customers' waiting times compared to several existing vehicle dispatching algorithms.

摘要

平衡叫车公司的供需是一个具有挑战性的问题，尤其是在大规模拥堵道路网络的实时请求和随机交通状况下。为应对这一挑战，本文提出了一种强大且可扩展的方法，该方法将强化学习（RL）和集中式规划（CP）结构相结合，以促进实时出租车运营。微观网络层面的实时订单匹配决策和车辆重新定位决策都被整合到一个马尔可夫决策过程框架内。强化学习组件学习分解后的状态值函数，该函数代表出租车司机的经验、离线历史需求模式以及交通网络拥堵情况。集中式规划组件在规定的系统约束下为司机集体规划非近视决策，以明确实现合作。此外，为规避微观道路网络上的稀疏奖励和样本不平衡问题，本文提出了一种带有优先梯度下降和自适应探索技术的时间差分学习算法。利用曼哈顿道路网络和纽约市黄色出租车数据构建并训练了一个模拟器，以模拟实时车辆调度环境。通过该模拟器对集中式和分散式出租车调度策略进行了检验。该案例研究表明，与几种现有的车辆调度算法相比，所提出的方法在减少客户等待时间的同时，还能进一步提高出租车司机的利润。

相似文献

An Integrated Reinforcement Learning and Centralized Programming Approach for Online Taxi Dispatching.一种用于在线出租车调度的集成强化学习与集中式编程方法

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4742-4756. doi: 10.1109/TNNLS.2021.3060187. Epub 2022 Aug 31.

Taxi drivers' traffic violations detection using random forest algorithm: A case study in China.基于随机森林算法的出租车司机交通违规行为检测：以中国为例

Traffic Inj Prev. 2023;24(4):362-370. doi: 10.1080/15389588.2023.2191286. Epub 2023 Mar 28.

Exploring impacts of COVID-19 on city-wide taxi and ride-sourcing markets: Evidence from Ningbo, China.探索新冠疫情对全市出租车及网约车市场的影响：来自中国宁波的证据

Transp Policy (Oxf). 2022 Jan;115:220-238. doi: 10.1016/j.tranpol.2021.11.017. Epub 2021 Nov 22.

Multitask Learning and GCN-Based Taxi Demand Prediction for a Traffic Road Network.基于多任务学习和图卷积网络的交通路网出租车需求预测

Sensors (Basel). 2020 Jul 5;20(13):3776. doi: 10.3390/s20133776.

A data mining approach to deriving safety policy implications for taxi drivers.一种数据挖掘方法，用于推导出租车司机安全政策的含义。

J Safety Res. 2021 Feb;76:238-247. doi: 10.1016/j.jsr.2020.12.017. Epub 2021 Jan 7.

The impact of rainfall on the temporal and spatial distribution of taxi passengers.降雨对出租车乘客时空分布的影响。

PLoS One. 2017 Sep 5;12(9):e0183574. doi: 10.1371/journal.pone.0183574. eCollection 2017.

Resilient trade-offs between safety and profitability: perspectives of sharp-end drivers in the Beijing taxi service system.安全与盈利之间的灵活取舍：北京出租车服务系统终端司机的观点。

Int J Occup Saf Ergon. 2022 Jun;28(2):721-733. doi: 10.1080/10803548.2020.1821511. Epub 2020 Oct 9.

A multi-sensory stimulating attention model for cities' taxi service demand prediction.面向城市出租车服务需求预测的多感官刺激注意力模型。

Sci Rep. 2022 Feb 23;12(1):3065. doi: 10.1038/s41598-022-07072-z.

Request Dispatching Over Distributed SDN Control Plane: A Multiagent Approach.分布式软件定义网络控制平面上的请求调度：一种多智能体方法。

IEEE Trans Cybern. 2024 May;54(5):3211-3224. doi: 10.1109/TCYB.2023.3266448. Epub 2024 Apr 16.

A Framework of Vehicular Security and Demand Service Prediction Based on Data Analysis Integrated with Blockchain Approach.基于数据分析与区块链方法集成的车辆安全与需求服务预测框架。

Sensors (Basel). 2021 May 11;21(10):3314. doi: 10.3390/s21103314.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于在线出租车调度的集成强化学习与集中式编程方法

An Integrated Reinforcement Learning and Centralized Programming Approach for Online Taxi Dispatching.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献