• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多智能体深度强化学习算法的交通信号灯控制方法。

A traffic light control method based on multi-agent deep reinforcement learning algorithm.

机构信息

College of Data Science and Application, Inner Mongolia University of Technology, Inner Mongolia Autonomous Region Engineering and Technology Research Center of Big Data Based Software Service, Huhhot, 10080, Inner Monglia, China.

出版信息

Sci Rep. 2023 Jun 9;13(1):9396. doi: 10.1038/s41598-023-36606-2.

DOI:10.1038/s41598-023-36606-2
PMID:37296308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10256792/
Abstract

Intelligent traffic light control (ITLC) algorithms are very efficient for relieving traffic congestion. Recently, many decentralized multi-agent traffic light control algorithms are proposed. These researches mainly focus on improving reinforcement learning method and coordination method. But, as all the agents need to communicate while coordinating with each other, the communication details should be improved as well. To guarantee communication effectiveness, two aspect should be considered. Firstly, a traffic condition description method need to be designed. By using this method, traffic condition can be described simply and clearly. Secondly, synchronization should be considered. As different intersections have different cycle lengths and message sending event happens at the end of each traffic signal cycle, every agent will receive messages of other agents at different time. So it is hard for an agent to decide which message is the latest one and the most valuable. Apart from communication details, reinforcement learning algorithm used for traffic signal timing should also be improved. In the traditional reinforcement learning based ITLC algorithms, either queue length of congested cars or waiting time of these cars is considered while calculating reward value. But, both of them are very important. So a new reward calculation method is needed. To solve all these problems, in this paper, a new ITLC algorithm is proposed. To improve communication efficiency, this algorithm adopts a new message sending and processing method. Besides, to measure traffic congestion in a more reasonable way, a new reward calculation method is proposed and used. This method takes both waiting time and queue length into consideration.

摘要

智能交通灯控制 (ITLC) 算法对于缓解交通拥堵非常有效。最近,提出了许多分散式多智能体交通灯控制算法。这些研究主要集中在改进强化学习方法和协调方法上。但是,由于所有智能体在相互协调时都需要进行通信,因此也需要改进通信细节。为了保证通信的有效性,需要考虑两个方面。首先,需要设计一种交通状况描述方法。通过使用这种方法,可以简单明了地描述交通状况。其次,需要考虑同步。由于不同的交叉路口具有不同的周期长度,并且消息发送事件发生在每个交通信号周期的末尾,因此每个智能体将在不同的时间接收其他智能体的消息。因此,智能体很难确定哪条消息是最新的、最有价值的。除了通信细节外,用于交通信号定时的强化学习算法也需要改进。在传统的基于强化学习的 ITLC 算法中,在计算奖励值时,要么考虑拥堵车辆的队列长度,要么考虑这些车辆的等待时间。但是,两者都非常重要。因此,需要一种新的奖励计算方法。为了解决所有这些问题,本文提出了一种新的 ITLC 算法。为了提高通信效率,该算法采用了一种新的消息发送和处理方法。此外,为了更合理地衡量交通拥堵程度,提出并使用了一种新的奖励计算方法。该方法同时考虑了等待时间和队列长度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/8c2ce79782bc/41598_2023_36606_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/ecacceffc1a2/41598_2023_36606_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/9e4d7101b703/41598_2023_36606_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/6b8db35c63fb/41598_2023_36606_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/a2e7eba7acfb/41598_2023_36606_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/a7171eac2889/41598_2023_36606_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/8c2ce79782bc/41598_2023_36606_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/ecacceffc1a2/41598_2023_36606_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/9e4d7101b703/41598_2023_36606_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/6b8db35c63fb/41598_2023_36606_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/a2e7eba7acfb/41598_2023_36606_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/a7171eac2889/41598_2023_36606_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01eb/10256792/8c2ce79782bc/41598_2023_36606_Fig6_HTML.jpg

相似文献

1
A traffic light control method based on multi-agent deep reinforcement learning algorithm.基于多智能体深度强化学习算法的交通信号灯控制方法。
Sci Rep. 2023 Jun 9;13(1):9396. doi: 10.1038/s41598-023-36606-2.
2
IHG-MA: Inductive heterogeneous graph multi-agent reinforcement learning for multi-intersection traffic signal control.IHG-MA:用于多交叉口交通信号控制的归纳异质图多智能体强化学习。
Neural Netw. 2021 Jul;139:265-277. doi: 10.1016/j.neunet.2021.03.015. Epub 2021 Mar 22.
3
Multi-Objective reinforcement learning approach for improving safety at intersections with adaptive traffic signal control.多目标强化学习方法在自适应交通信号控制中提高交叉口安全性。
Accid Anal Prev. 2020 Sep;144:105655. doi: 10.1016/j.aap.2020.105655. Epub 2020 Jul 14.
4
Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.基于双目标框架的多机器人编队分布式深度强化学习
Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.
5
An Edge Based Multi-Agent Auto Communication Method for Traffic Light Control.一种基于边缘的多智能体交通灯控制自动通信方法。
Sensors (Basel). 2020 Jul 31;20(15):4291. doi: 10.3390/s20154291.
6
Self-learning adaptive traffic signal control for real-time safety optimization.自学习自适应交通信号控制实时安全优化。
Accid Anal Prev. 2020 Oct;146:105713. doi: 10.1016/j.aap.2020.105713. Epub 2020 Aug 18.
7
A deep reinforcement learning algorithm for the rectangular strip packing problem.一种用于矩形带材打包问题的深度强化学习算法。
PLoS One. 2023 Mar 16;18(3):e0282598. doi: 10.1371/journal.pone.0282598. eCollection 2023.
8
LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning.LJIR:在合作多智能体强化学习中学习联合行动内在奖励
Neural Netw. 2023 Oct;167:450-459. doi: 10.1016/j.neunet.2023.08.016. Epub 2023 Aug 22.
9
Cooperative Traffic Signal Control with Traffic Flow Prediction in Multi-Intersection.多交叉口协同交通信号控制与交通流预测。
Sensors (Basel). 2019 Dec 24;20(1):137. doi: 10.3390/s20010137.
10
Adaptive urban traffic signal control based on enhanced deep reinforcement learning.基于增强深度强化学习的自适应城市交通信号控制
Sci Rep. 2024 Jun 19;14(1):14116. doi: 10.1038/s41598-024-64885-w.

引用本文的文献

1
A Dynamic Traffic Light Control Algorithm to Mitigate Traffic Congestion in Metropolitan Areas.一种缓解大城市地区交通拥堵的动态交通信号灯控制算法。
Sensors (Basel). 2024 Jun 19;24(12):3987. doi: 10.3390/s24123987.

本文引用的文献

1
Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning.基于新型多智能体强化学习的大规模交通信号控制
IEEE Trans Cybern. 2021 Jan;51(1):174-187. doi: 10.1109/TCYB.2020.3015811. Epub 2020 Dec 22.
2
Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control.用于大规模交通网格信号控制的协作深度强化学习
IEEE Trans Cybern. 2020 Jun;50(6):2687-2700. doi: 10.1109/TCYB.2019.2904742. Epub 2019 Mar 29.