• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MAGT-toll:一种用于动态交通拥堵收费的多智能体强化学习方法。

MAGT-toll: A multi-agent reinforcement learning approach to dynamic traffic congestion pricing.

机构信息

School of Business Administration, Southwestern University of Finance and Economics, Chengdu, China.

School of Computing and Artificial Intelligence, Southwestern University of Finance and Economics, Chengdu, China.

出版信息

PLoS One. 2024 Nov 18;19(11):e0313828. doi: 10.1371/journal.pone.0313828. eCollection 2024.

DOI:10.1371/journal.pone.0313828
PMID:39556544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11573182/
Abstract

Modern urban centers have one of the most critical challenges of congestion. Traditional electronic toll collection systems attempt to mitigate this issue through pre-defined static congestion pricing methods; however, they are inadequate in addressing the dynamic fluctuations in traffic demand. Dynamic congestion pricing has been identified as a promising approach, yet its implementation is hindered by the computational complexity involved in optimizing long-term objectives and the necessity for coordination across the traffic network. To address these challenges, we propose a novel dynamic traffic congestion pricing model utilizing multi-agent reinforcement learning with a transformer architecture. This architecture capitalizes on its encoder-decoder structure to transform the multi-agent reinforcement learning problem into a sequence modeling task. Drawing on insights from research on graph transformers, our model incorporates agent structures and positional encoding to enhance adaptability to traffic flow dynamics and network coordination. We have developed a microsimulation-based environment to implement a discrete toll-rate congestion pricing scheme on actual urban roads. Our extensive experimental results across diverse traffic demand scenarios demonstrate substantial improvements in congestion metrics and reductions in travel time, thereby effectively alleviating traffic congestion.

摘要

现代城市中心面临着最为严峻的拥堵挑战之一。传统的电子收费系统试图通过预先定义的静态拥堵定价方法来缓解这一问题,但在应对交通需求的动态波动方面并不理想。动态拥堵定价已被视为一种有前途的方法,但由于长期目标优化涉及到的计算复杂性以及交通网络协调的必要性,其实施受到了阻碍。为了解决这些挑战,我们提出了一种利用多智能体强化学习和变压器架构的新型动态交通拥堵定价模型。该架构利用其编码器-解码器结构将多智能体强化学习问题转化为序列建模任务。借鉴图转换器研究的见解,我们的模型采用了代理结构和位置编码,以增强对交通流动态和网络协调的适应性。我们已经开发了一个基于微观模拟的环境,以在实际的城市道路上实施离散的通行费拥堵定价方案。我们在不同的交通需求场景下进行了广泛的实验,结果表明,在拥堵指标和旅行时间方面都有了显著的改善,从而有效地缓解了交通拥堵。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/a754a46b5e53/pone.0313828.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/dc6cf44aa08f/pone.0313828.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/1097827df589/pone.0313828.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/2274c4aaa825/pone.0313828.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/64a7ff5a35ad/pone.0313828.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/81a9b21fc1d5/pone.0313828.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/25b1b26bb6fc/pone.0313828.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/d5ee008bed0d/pone.0313828.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/a754a46b5e53/pone.0313828.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/dc6cf44aa08f/pone.0313828.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/1097827df589/pone.0313828.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/2274c4aaa825/pone.0313828.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/64a7ff5a35ad/pone.0313828.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/81a9b21fc1d5/pone.0313828.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/25b1b26bb6fc/pone.0313828.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/d5ee008bed0d/pone.0313828.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c22/11573182/a754a46b5e53/pone.0313828.g008.jpg

相似文献

1
MAGT-toll: A multi-agent reinforcement learning approach to dynamic traffic congestion pricing.MAGT-toll:一种用于动态交通拥堵收费的多智能体强化学习方法。
PLoS One. 2024 Nov 18;19(11):e0313828. doi: 10.1371/journal.pone.0313828. eCollection 2024.
2
Sequence Decision Transformer for Adaptive Traffic Signal Control.用于自适应交通信号控制的序列决策变换器
Sensors (Basel). 2024 Sep 25;24(19):6202. doi: 10.3390/s24196202.
3
Pricing schemes of simultaneous optimization of congestion and air pollution: a case study in Iran.拥堵与空气污染同步优化的定价方案:以伊朗为例的研究
Environ Sci Pollut Res Int. 2023 Mar;30(12):33567-33586. doi: 10.1007/s11356-022-24424-4. Epub 2022 Dec 9.
4
Impact of Accurate Detection of Freeway Traffic Conditions on the Dynamic Pricing: A Case Study of I-95 Express Lanes.高速公路交通状况准确检测对动态定价的影响:以 I-95 快速车道为例。
Sensors (Basel). 2021 Sep 7;21(18):5997. doi: 10.3390/s21185997.
5
Trends and Insights from Transportation Congestion Pricing Policy Research: A Bibliometric Analysis.交通拥堵定价政策研究的趋势和洞察:文献计量分析。
Int J Environ Res Public Health. 2022 Jun 11;19(12):7189. doi: 10.3390/ijerph19127189.
6
Optimal dynamic pricing for public transportation considering consumer social learning.考虑消费者社会学习的公共交通最优动态定价。
PLoS One. 2024 Jan 31;19(1):e0296263. doi: 10.1371/journal.pone.0296263. eCollection 2024.
7
A Temporal Deep Q Learning for Optimal Load Balancing in Software-Defined Networks.用于软件定义网络中最优负载均衡的时态深度Q学习
Sensors (Basel). 2024 Feb 14;24(4):1216. doi: 10.3390/s24041216.
8
Deep Autoencoder Neural Networks for Short-Term Traffic Congestion Prediction of Transportation Networks.用于交通网络短期交通拥堵预测的深度自动编码器神经网络
Sensors (Basel). 2019 May 14;19(10):2229. doi: 10.3390/s19102229.
9
Design of a heuristic environment-friendly road pricing scheme for traffic emission control under uncertainty.启发式环保道路定价方案设计,以控制不确定性下的交通排放。
J Environ Manage. 2019 Apr 15;236:455-465. doi: 10.1016/j.jenvman.2018.11.042. Epub 2019 Jan 11.
10
Cordon screen: A cordon-based congestion pricing policy evaluation method for U.S. cities.警戒线筛选:一种针对美国城市基于警戒线的拥堵收费政策评估方法。
J Air Waste Manag Assoc. 2023 Jan;73(1):25-39. doi: 10.1080/10962247.2022.2100510. Epub 2022 Aug 19.

本文引用的文献

1
Effects analysis of reward functions on reinforcement learning for traffic signal control.奖励函数对交通信号控制强化学习的影响分析。
PLoS One. 2022 Nov 21;17(11):e0277813. doi: 10.1371/journal.pone.0277813. eCollection 2022.
2
Freeway ramp metering based on PSO-PID control.基于粒子群优化-比例积分微分(PSO-PID)控制的高速公路匝道控制。
PLoS One. 2021 Dec 9;16(12):e0260977. doi: 10.1371/journal.pone.0260977. eCollection 2021.
3
Deep Q-network-based traffic signal control models.基于深度 Q 网络的交通信号控制模型。
PLoS One. 2021 Sep 2;16(9):e0256405. doi: 10.1371/journal.pone.0256405. eCollection 2021.
4
Multiagent cooperation and competition with deep reinforcement learning.基于深度强化学习的多智能体合作与竞争
PLoS One. 2017 Apr 5;12(4):e0172395. doi: 10.1371/journal.pone.0172395. eCollection 2017.