• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于强化学习的车辆编队控制策略,用于减少交通振荡中的能量消耗。

A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations.

作者信息

Li Meng, Cao Zehong, Li Zhibin

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5309-5322. doi: 10.1109/TNNLS.2021.3071959. Epub 2021 Nov 30.

DOI:10.1109/TNNLS.2021.3071959
PMID:33882007
Abstract

The vehicle platoon will be the most dominant driving mode on future roads. To the best of our knowledge, few reinforcement learning (RL) algorithms have been applied in vehicle platoon control, which has large-scale action and state spaces. Some RL-based methods were applied to solve single-agent problems. If we need to tackle multiagent problems, we will use multiagent RL algorithms since the parameters space grows exponentially with the increasing number of agents involved. Previous multiagent RL algorithms generally may provide redundant information to agents, indicating a large amount of useless or unrelated information, which may cause to be difficult for convergence training and pattern extractions from shared information. Also, random actions usually contribute to crashes, especially at the beginning of training. In this study, a communication proximal policy optimization (CommPPO) algorithm was proposed to tackle the above issues. In specific, the CommPPO model adopts a parameter-sharing structure to allow the dynamic variation of agent numbers, which can well handle various platoon dynamics, including splitting and merging. The communication protocol of the CommPPO consists of two parts. In the state part, the widely used predecessor-leader follower typology in the platoon is adopted to transmit global and local state information to agents. In the reward part, a new reward communication channel is proposed to solve the spurious reward and "lazy agent" problems in some existing multiagent RLs. Moreover, a curriculum learning approach is adopted to reduce crashes and speed up training. To validate the proposed strategy for platoon control, two existing multiagent RLs and a traditional platoon control strategy were applied in the same scenarios for comparison. Results showed that the CommPPO algorithm gained more rewards and achieved the largest fuel consumption reduction (11.6%).

摘要

车辆编队将成为未来道路上最主要的驾驶模式。据我们所知,很少有强化学习(RL)算法应用于车辆编队控制,因为其具有大规模的动作和状态空间。一些基于RL的方法被用于解决单智能体问题。如果我们需要处理多智能体问题,我们将使用多智能体RL算法,因为参数空间会随着所涉及智能体数量的增加而呈指数增长。以前的多智能体RL算法通常可能会向智能体提供冗余信息,即大量无用或不相关的信息,这可能导致收敛训练困难以及从共享信息中提取模式变得困难。此外,随机动作通常会导致碰撞,尤其是在训练开始时。在本研究中,提出了一种通信近端策略优化(CommPPO)算法来解决上述问题。具体而言,CommPPO模型采用参数共享结构以允许智能体数量动态变化,这可以很好地处理各种编队动态,包括分裂和合并。CommPPO的通信协议由两部分组成。在状态部分,采用编队中广泛使用的前驱 - 领导者 - 跟随者类型学将全局和局部状态信息传输给智能体。在奖励部分,提出了一种新的奖励通信通道来解决一些现有多智能体RL中的虚假奖励和“懒惰智能体”问题。此外,采用课程学习方法来减少碰撞并加快训练。为了验证所提出的编队控制策略,将两种现有的多智能体RL和一种传统的编队控制策略应用于相同场景进行比较。结果表明,CommPPO算法获得了更多奖励,并实现了最大的燃油消耗降低(11.6%)。

相似文献

1
A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations.一种基于强化学习的车辆编队控制策略,用于减少交通振荡中的能量消耗。
IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5309-5322. doi: 10.1109/TNNLS.2021.3071959. Epub 2021 Nov 30.
2
Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions.基于强化学习的驾驶车队决策最小化能量损失方法。
Sensors (Basel). 2023 Apr 21;23(8):4176. doi: 10.3390/s23084176.
3
Comparative Study of Cooperative Platoon Merging Control Based on Reinforcement Learning.基于强化学习的协同编队合并控制的对比研究。
Sensors (Basel). 2023 Jan 15;23(2):990. doi: 10.3390/s23020990.
4
Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning.基于新型多智能体强化学习的大规模交通信号控制
IEEE Trans Cybern. 2021 Jan;51(1):174-187. doi: 10.1109/TCYB.2020.3015811. Epub 2020 Dec 22.
5
Decentralized Platoon Join-in-Middle Protocol Considering Communication Delay for Connected and Automated Vehicle.考虑通信延迟的车联网中自动驾驶车辆的分布式编队中途加入协议
Sensors (Basel). 2021 Oct 27;21(21):7126. doi: 10.3390/s21217126.
6
Self-learning adaptive traffic signal control for real-time safety optimization.自学习自适应交通信号控制实时安全优化。
Accid Anal Prev. 2020 Oct;146:105713. doi: 10.1016/j.aap.2020.105713. Epub 2020 Aug 18.
7
Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems.用于协作多智能体系统的基于任务分解的强化学习
IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2054-2065. doi: 10.1109/TNNLS.2020.2996209. Epub 2021 May 3.
8
Quantifying effects of reverse linear perspective as a visual cue on vehicle and platoon crash risk variations in car-following using path analysis.运用路径分析量化逆向线性透视作为视觉线索对跟驰车辆和车队碰撞风险变化的影响。
Accid Anal Prev. 2021 Sep;159:106215. doi: 10.1016/j.aap.2021.106215. Epub 2021 Jun 12.
9
Modeling and Performance of the IEEE 802.11p Broadcasting for Intra-Platoon Communication.IEEE 802.11p 广播在车间通信中的建模与性能研究。
Sensors (Basel). 2018 Sep 6;18(9):2971. doi: 10.3390/s18092971.
10
Safe Adaptive Policy Transfer Reinforcement Learning for Distributed Multiagent Control.用于分布式多智能体控制的安全自适应策略转移强化学习
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1939-1946. doi: 10.1109/TNNLS.2023.3326867. Epub 2025 Jan 7.

引用本文的文献

1
Enhancing Mixed Traffic Flow with Platoon Control and Lane Management for Connected and Autonomous Vehicles.通过车联网和自动驾驶车辆的队列控制与车道管理增强混合交通流
Sensors (Basel). 2025 Jan 22;25(3):644. doi: 10.3390/s25030644.
2
Developing an eco-driving strategy in a hybrid traffic network using reinforcement learning.利用强化学习在混合交通网络中制定生态驾驶策略。
Sci Prog. 2024 Jul-Sep;107(3):368504241263406. doi: 10.1177/00368504241263406.
3
Graph Reinforcement Learning-Based Decision-Making Technology for Connected and Autonomous Vehicles: Framework, Review, and Future Trends.
基于图强化学习的联网和自动驾驶车辆决策技术:框架、综述与未来趋势
Sensors (Basel). 2023 Oct 3;23(19):8229. doi: 10.3390/s23198229.
4
A Comprehensive Survey on Multi-Agent Reinforcement Learning for Connected and Automated Vehicles.面向车联网和自动驾驶的多智能体强化学习综述
Sensors (Basel). 2023 May 12;23(10):4710. doi: 10.3390/s23104710.