• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个与平台无关的深度强化学习框架,用于实现向自动驾驶的有效模拟到真实迁移。

A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving.

作者信息

Li Dianzhao, Okhrin Ostap

机构信息

Chair of Econometrics and Statistics, esp. in the Transport Sector, Technische Universität Dresden, Dresden, Germany.

Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig, Dresden, Germany.

出版信息

Commun Eng. 2024 Oct 17;3(1):147. doi: 10.1038/s44172-024-00292-3.

DOI:10.1038/s44172-024-00292-3
PMID:39420147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11487131/
Abstract

Autonomous driving presents unique challenges, particularly in transferring agents trained in simulation to real-world environments due to the discrepancies between the two. To address this issue, here we propose a robust Deep Reinforcement Learning (DRL) framework that incorporates platform-dependent perception modules to extract task-relevant information, enabling the training of a lane-following and overtaking agent in simulation. This framework facilitates the efficient transfer of the DRL agent to new simulated environments and the real world with minimal adjustments. We assess the performance of the agent across various driving scenarios in both simulation and the real world, comparing it to human drivers and a proportional-integral-derivative (PID) baseline in simulation. Additionally, we contrast it with other DRL baselines to clarify the rationale behind choosing this framework. Our proposed approach helps bridge the gaps between different platforms and the Simulation to Reality (Sim2Real) gap, allowing the trained agent to perform consistently in both simulation and real-world scenarios, effectively driving the vehicle.

摘要

自动驾驶带来了独特的挑战,尤其是由于模拟环境和现实世界之间的差异,将在模拟环境中训练的智能体转移到现实世界环境时会面临这些挑战。为了解决这个问题,我们在此提出一个强大的深度强化学习(DRL)框架,该框架整合了依赖平台的感知模块来提取与任务相关的信息,从而能够在模拟环境中训练一个车道跟踪和超车智能体。这个框架有助于以最小的调整将DRL智能体高效地转移到新的模拟环境和现实世界中。我们在模拟环境和现实世界的各种驾驶场景中评估该智能体的性能,在模拟环境中将其与人类驾驶员和比例积分微分(PID)基线进行比较。此外,我们将其与其他DRL基线进行对比,以阐明选择这个框架的理由。我们提出的方法有助于弥合不同平台之间的差距以及模拟到现实(Sim2Real)的差距,使训练后的智能体在模拟和现实世界场景中都能持续表现良好,有效地驾驶车辆。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/c78c1f4db2f6/44172_2024_292_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/62baf1e68065/44172_2024_292_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/29eeb6320738/44172_2024_292_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/6ccc7b9fbf82/44172_2024_292_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/871732e55146/44172_2024_292_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/cd9e1c004ae4/44172_2024_292_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/b6422be9d23d/44172_2024_292_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/c78c1f4db2f6/44172_2024_292_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/62baf1e68065/44172_2024_292_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/29eeb6320738/44172_2024_292_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/6ccc7b9fbf82/44172_2024_292_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/871732e55146/44172_2024_292_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/cd9e1c004ae4/44172_2024_292_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/b6422be9d23d/44172_2024_292_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fd8/11487131/c78c1f4db2f6/44172_2024_292_Fig7_HTML.jpg

相似文献

1
A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving.一个与平台无关的深度强化学习框架,用于实现向自动驾驶的有效模拟到真实迁移。
Commun Eng. 2024 Oct 17;3(1):147. doi: 10.1038/s44172-024-00292-3.
2
A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles.一种基于多任务融合策略的自动驾驶车辆决策与规划方法
Sensors (Basel). 2023 Aug 8;23(16):7021. doi: 10.3390/s23167021.
3
Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.基于辅助评论家网络的自动驾驶策略深度强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3680-3690. doi: 10.1109/TNNLS.2021.3116063. Epub 2023 Jul 6.
4
Flying Through a Narrow Gap Using End-to-End Deep Reinforcement Learning Augmented With Curriculum Learning and Sim2Real.利用课程学习和模拟到现实增强的端到端深度强化学习飞过狭窄间隙
IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2701-2708. doi: 10.1109/TNNLS.2021.3107742. Epub 2023 May 2.
5
Robust ASV Navigation Through Ground to Water Cross-Domain Deep Reinforcement Learning.通过地面到水面跨域深度强化学习实现稳健的自主水面航行器导航
Front Robot AI. 2021 Sep 20;8:739023. doi: 10.3389/frobt.2021.739023. eCollection 2021.
6
Efficient Deep Reinforcement Learning With Imitative Expert Priors for Autonomous Driving.基于模仿专家先验的高效深度强化学习用于自动驾驶
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7391-7403. doi: 10.1109/TNNLS.2022.3142822. Epub 2023 Oct 5.
7
Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning.基于安全强化学习的稳健自主高速公路驾驶决策方法
Sensors (Basel). 2024 Jun 26;24(13):4140. doi: 10.3390/s24134140.
8
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving.基于恐惧神经的强化学习在自动驾驶中的安全应用。
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):267-279. doi: 10.1109/TPAMI.2023.3322426. Epub 2023 Dec 5.
9
Addressing data imbalance in Sim2Real: ImbalSim2Real scheme and its application in finger joint stiffness self-sensing for soft robot-assisted rehabilitation.解决从仿真到现实中的数据不平衡问题:ImbalSim2Real方案及其在软机器人辅助康复手指关节刚度自感知中的应用
Front Bioeng Biotechnol. 2024 Jun 14;12:1334643. doi: 10.3389/fbioe.2024.1334643. eCollection 2024.
10
Multiple Event-Based Simulation Scenario Generation Approach for Autonomous Vehicle Smart Sensors and Devices.基于多事件的自动驾驶车辆智能传感器与设备仿真场景生成方法。
Sensors (Basel). 2019 Oct 14;19(20):4456. doi: 10.3390/s19204456.

本文引用的文献

1
Discovering faster matrix multiplication algorithms with reinforcement learning.用强化学习发现更快的矩阵乘法算法。
Nature. 2022 Oct;610(7930):47-53. doi: 10.1038/s41586-022-05172-4. Epub 2022 Oct 5.
2
Magnetic control of tokamak plasmas through deep reinforcement learning.通过深度强化学习控制托卡马克等离子体。
Nature. 2022 Feb;602(7897):414-419. doi: 10.1038/s41586-021-04301-9. Epub 2022 Feb 16.
3
Outracing champion Gran Turismo drivers with deep reinforcement learning.用深度强化学习赶超 Gran Turismo 冠军车手。
Nature. 2022 Feb;602(7896):223-228. doi: 10.1038/s41586-021-04357-7. Epub 2022 Feb 9.
4
Learning efficient navigation in vortical flow fields.学习在涡旋流场中的有效导航。
Nat Commun. 2021 Dec 8;12(1):7143. doi: 10.1038/s41467-021-27015-y.
5
Intelligent driving intelligence test for autonomous vehicles with naturalistic and adversarial environment.自动驾驶智能测试 自然与对抗环境下的自主车辆
Nat Commun. 2021 Feb 2;12(1):748. doi: 10.1038/s41467-021-21007-8.
6
Autonomous navigation of stratospheric balloons using reinforcement learning.平流层气球的自主导航使用强化学习。
Nature. 2020 Dec;588(7836):77-82. doi: 10.1038/s41586-020-2939-8. Epub 2020 Dec 2.
7
An adaptive deep reinforcement learning framework enables curling robots with human-like performance in real-world conditions.一种自适应的深度强化学习框架使机器人能够在真实环境中表现出类人的性能。
Sci Robot. 2020 Sep 23;5(46). doi: 10.1126/scirobotics.abb9764.
8
Grandmaster level in StarCraft II using multi-agent reinforcement learning.星际争霸 II 中的大师级水平使用多智能体强化学习。
Nature. 2019 Nov;575(7782):350-354. doi: 10.1038/s41586-019-1724-z. Epub 2019 Oct 30.
9
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
10
A computational approach to edge detection.一种基于计算的边缘检测方法。
IEEE Trans Pattern Anal Mach Intell. 1986 Jun;8(6):679-98.