• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图强化学习的联网和自动驾驶车辆决策技术:框架、综述与未来趋势

Graph Reinforcement Learning-Based Decision-Making Technology for Connected and Autonomous Vehicles: Framework, Review, and Future Trends.

作者信息

Liu Qi, Li Xueyuan, Tang Yujie, Gao Xin, Yang Fan, Li Zirui

机构信息

School of Mechanical Engineering, Beijing Institute of Technology, Beijing 100811, China.

Faculty of Computer Science, Dalhousie University, Halifax, NS B3H 4R2, Canada.

出版信息

Sensors (Basel). 2023 Oct 3;23(19):8229. doi: 10.3390/s23198229.

DOI:10.3390/s23198229
PMID:37837063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10575438/
Abstract

The proper functioning of connected and autonomous vehicles (CAVs) is crucial for the safety and efficiency of future intelligent transport systems. Meanwhile, transitioning to fully autonomous driving requires a long period of mixed autonomy traffic, including both CAVs and human-driven vehicles. Thus, collaborative decision-making technology for CAVs is essential to generate appropriate driving behaviors to enhance the safety and efficiency of mixed autonomy traffic. In recent years, deep reinforcement learning (DRL) methods have become an efficient way in solving decision-making problems. However, with the development of computing technology, graph reinforcement learning (GRL) methods have gradually demonstrated the large potential to further improve the decision-making performance of CAVs, especially in the area of accurately representing the mutual effects of vehicles and modeling dynamic traffic environments. To facilitate the development of GRL-based methods for autonomous driving, this paper proposes a review of GRL-based methods for the decision-making technologies of CAVs. Firstly, a generic GRL framework is proposed in the beginning to gain an overall understanding of the decision-making technology. Then, the GRL-based decision-making technologies are reviewed from the perspective of the construction methods of mixed autonomy traffic, methods for graph representation of the driving environment, and related works about graph neural networks (GNN) and DRL in the field of decision-making for autonomous driving. Moreover, validation methods are summarized to provide an efficient way to verify the performance of decision-making methods. Finally, challenges and future research directions of GRL-based decision-making methods are summarized.

摘要

联网和自动驾驶车辆(CAV)的正常运行对于未来智能交通系统的安全和效率至关重要。与此同时,向完全自动驾驶的过渡需要一段很长的混合自主交通时期,包括CAV和人类驾驶车辆。因此,CAV的协同决策技术对于产生适当的驾驶行为以提高混合自主交通的安全性和效率至关重要。近年来,深度强化学习(DRL)方法已成为解决决策问题的一种有效方式。然而,随着计算技术的发展,图强化学习(GRL)方法已逐渐显示出进一步提高CAV决策性能的巨大潜力,特别是在准确表示车辆相互影响和对动态交通环境建模方面。为了促进基于GRL的自动驾驶方法的发展,本文对基于GRL的CAV决策技术进行了综述。首先,在开头提出了一个通用的GRL框架,以全面了解决策技术。然后,从混合自主交通的构建方法、驾驶环境的图表示方法以及自动驾驶决策领域中关于图神经网络(GNN)和DRL的相关工作等角度对基于GRL的决策技术进行了综述。此外,总结了验证方法,以提供一种验证决策方法性能的有效途径。最后,总结了基于GRL的决策方法面临的挑战和未来研究方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d1efbb5c1a8e/sensors-23-08229-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/190d88a3f4a6/sensors-23-08229-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/6762cfd7ebba/sensors-23-08229-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/fc45422ad615/sensors-23-08229-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d3cfd5227eae/sensors-23-08229-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/166903fc3035/sensors-23-08229-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/c2e87bb5d834/sensors-23-08229-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/a11f081dc6b9/sensors-23-08229-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d09023f4aaff/sensors-23-08229-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/6ad1e605c240/sensors-23-08229-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/e3686114a6e1/sensors-23-08229-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/0b935e5add66/sensors-23-08229-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/131445b4fee7/sensors-23-08229-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/cd47ed7fb176/sensors-23-08229-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d1efbb5c1a8e/sensors-23-08229-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/190d88a3f4a6/sensors-23-08229-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/6762cfd7ebba/sensors-23-08229-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/fc45422ad615/sensors-23-08229-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d3cfd5227eae/sensors-23-08229-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/166903fc3035/sensors-23-08229-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/c2e87bb5d834/sensors-23-08229-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/a11f081dc6b9/sensors-23-08229-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d09023f4aaff/sensors-23-08229-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/6ad1e605c240/sensors-23-08229-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/e3686114a6e1/sensors-23-08229-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/0b935e5add66/sensors-23-08229-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/131445b4fee7/sensors-23-08229-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/cd47ed7fb176/sensors-23-08229-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/10575438/d1efbb5c1a8e/sensors-23-08229-g014.jpg

相似文献

1
Graph Reinforcement Learning-Based Decision-Making Technology for Connected and Autonomous Vehicles: Framework, Review, and Future Trends.基于图强化学习的联网和自动驾驶车辆决策技术:框架、综述与未来趋势
Sensors (Basel). 2023 Oct 3;23(19):8229. doi: 10.3390/s23198229.
2
Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning.基于图卷积的深度强化学习在不确定交互式交通场景中的多智能体决策模式。
Sensors (Basel). 2022 Jun 17;22(12):4586. doi: 10.3390/s22124586.
3
How Do Autonomous Vehicles Decide?自动驾驶汽车如何决策?
Sensors (Basel). 2022 Dec 28;23(1):317. doi: 10.3390/s23010317.
4
Mixed traffic flow of human-driven vehicles and connected autonomous vehicles: String stability and fundamental diagram.人-车混合交通流中的车联网自动驾驶车辆:稳定性与基本图。
Math Biosci Eng. 2023 Jan;20(2):2280-2295. doi: 10.3934/mbe.2023107. Epub 2022 Nov 17.
5
A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles.一种基于多任务融合策略的自动驾驶车辆决策与规划方法
Sensors (Basel). 2023 Aug 8;23(16):7021. doi: 10.3390/s23167021.
6
TAKEN: A Traffic Knowledge-Based Navigation System for Connected and Autonomous Vehicles.采用:面向互联和自动驾驶车辆的基于交通知识的导航系统。
Sensors (Basel). 2023 Jan 6;23(2):653. doi: 10.3390/s23020653.
7
Constraint-Guided Behavior Transformer for Centralized Coordination of Connected and Automated Vehicles at Intersections.用于交叉路口联网和自动驾驶车辆集中协调的约束引导行为变换器
Sensors (Basel). 2024 Aug 11;24(16):5187. doi: 10.3390/s24165187.
8
Law compliance decision making for autonomous vehicles on highways.高速公路自动驾驶汽车的法律合规决策。
Accid Anal Prev. 2024 Sep;204:107620. doi: 10.1016/j.aap.2024.107620. Epub 2024 May 31.
9
Risk perception and the warning strategy based on safety potential field theory.基于安全势场理论的风险感知与预警策略。
Accid Anal Prev. 2020 Dec;148:105805. doi: 10.1016/j.aap.2020.105805. Epub 2020 Oct 24.
10
Evaluating the safety impact of connected and autonomous vehicles on motorways.评估高速公路上互联和自动驾驶汽车的安全影响。
Accid Anal Prev. 2019 Mar;124:12-22. doi: 10.1016/j.aap.2018.12.019. Epub 2019 Jan 2.

本文引用的文献

1
Challenges and Opportunities in Deep Reinforcement Learning With Graph Neural Networks: A Comprehensive Review of Algorithms and Applications.基于图神经网络的深度强化学习中的挑战与机遇:算法与应用综述
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15051-15071. doi: 10.1109/TNNLS.2023.3283523. Epub 2024 Oct 30.
2
Deep Reinforcement Learning: A Survey.深度强化学习综述
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5064-5078. doi: 10.1109/TNNLS.2022.3207346. Epub 2024 Apr 4.
3
Generalized Single-Vehicle-Based Graph Reinforcement Learning for Decision-Making in Autonomous Driving.
基于广义单车图的强化学习在自动驾驶决策中的应用。
Sensors (Basel). 2022 Jun 29;22(13):4935. doi: 10.3390/s22134935.
4
Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning.基于图卷积的深度强化学习在不确定交互式交通场景中的多智能体决策模式。
Sensors (Basel). 2022 Jun 17;22(12):4586. doi: 10.3390/s22124586.
5
A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations.一种基于强化学习的车辆编队控制策略,用于减少交通振荡中的能量消耗。
IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5309-5322. doi: 10.1109/TNNLS.2021.3071959. Epub 2021 Nov 30.
6
IHG-MA: Inductive heterogeneous graph multi-agent reinforcement learning for multi-intersection traffic signal control.IHG-MA:用于多交叉口交通信号控制的归纳异质图多智能体强化学习。
Neural Netw. 2021 Jul;139:265-277. doi: 10.1016/j.neunet.2021.03.015. Epub 2021 Mar 22.
7
Human-like driving behaviour emerges from a risk-based driver model.类人驾驶行为源于基于风险的驾驶员模型。
Nat Commun. 2020 Sep 29;11(1):4850. doi: 10.1038/s41467-020-18353-4.
8
A Comprehensive Survey on Graph Neural Networks.图神经网络综述。
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.
9
The graph neural network model.图神经网络模型。
IEEE Trans Neural Netw. 2009 Jan;20(1):61-80. doi: 10.1109/TNN.2008.2005605. Epub 2008 Dec 9.