• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的全自动泊车系统端到端泊车

Reinforcement Learning-Based End-to-End Parking for Automatic Parking System.

机构信息

School of Automotive Studies, Tongji University, Shanghai 201804, China.

SAIC Motor Corporation Limited, Shanghai 201800, China.

出版信息

Sensors (Basel). 2019 Sep 16;19(18):3996. doi: 10.3390/s19183996.

DOI:10.3390/s19183996
PMID:31527481
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6766814/
Abstract

According to the existing mainstream automatic parking system (APS), a parking path is first planned based on the parking slot detected by the sensors. Subsequently, the path tracking module guides the vehicle to track the planned parking path. However, since the vehicle is non-linear dynamic, path tracking error inevitably occurs, leading to inclination and deviation of the parking. Accordingly, in this paper, a reinforcement learning-based end-to-end parking algorithm is proposed to achieve automatic parking. The vehicle can continuously learn and accumulate experience from numerous parking attempts and then learn the command of the optimal steering wheel angle at different parking slots. Based on this end-to-end parking, errors caused by path tracking can be avoided. Moreover, to ensure that the parking slot can be obtained continuously in the process of learning, a parking slot tracking algorithm is proposed based on the combination of vision and vehicle chassis information. Furthermore, given that the learning network output is hard to converge, and it is easy to fall into local optimum during the parking process, several reinforcement learning training methods in terms of parking conditions are developed. Lastly, by the real vehicle test, it is proved that using the proposed method can achieve a better parking attitude than using the path planning and path tracking-based method.

摘要

根据现有的主流自动泊车系统 (APS),首先基于传感器检测到的车位规划泊车路径。随后,路径跟踪模块引导车辆跟踪规划的泊车路径。然而,由于车辆是非线性动态的,路径跟踪误差不可避免,导致泊车倾斜和偏离。因此,本文提出了一种基于强化学习的端到端泊车算法,以实现自动泊车。车辆可以从无数次泊车尝试中不断学习和积累经验,然后学习在不同车位最佳转向角度的命令。基于这种端到端泊车,可以避免路径跟踪产生的误差。此外,为了确保在学习过程中能够连续获得车位,提出了一种基于视觉和车辆底盘信息相结合的车位跟踪算法。此外,由于学习网络输出难以收敛,并且在泊车过程中容易陷入局部最优,因此针对泊车条件开发了几种强化学习训练方法。最后,通过实车测试,证明了使用所提出的方法可以实现比基于路径规划和路径跟踪的方法更好的泊车姿态。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/8edc1b868a73/sensors-19-03996-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/a7d1904345bb/sensors-19-03996-g020a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/977f479b5245/sensors-19-03996-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/8edc1b868a73/sensors-19-03996-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/a7d1904345bb/sensors-19-03996-g020a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/977f479b5245/sensors-19-03996-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b688/6766814/8edc1b868a73/sensors-19-03996-g022.jpg

相似文献

1
Reinforcement Learning-Based End-to-End Parking for Automatic Parking System.基于强化学习的全自动泊车系统端到端泊车
Sensors (Basel). 2019 Sep 16;19(18):3996. doi: 10.3390/s19183996.
2
Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System.数据高效强化学习在自动化泊车系统中的横向规划与控制集成。
Sensors (Basel). 2020 Dec 18;20(24):7297. doi: 10.3390/s20247297.
3
Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces.基于模型的预测控制与强化学习用于垂直停车位的车辆泊车轨迹规划
Sensors (Basel). 2023 Aug 11;23(16):7124. doi: 10.3390/s23167124.
4
A Universal Vacant Parking Slot Recognition System Using Sensors Mounted on Off-the-Shelf Vehicles.一种使用安装在现成车辆上的传感器的通用空停车位识别系统。
Sensors (Basel). 2018 Apr 16;18(4):1213. doi: 10.3390/s18041213.
5
Robust Parking Path Planning with Error-Adaptive Sampling under Perception Uncertainty.基于感知不确定性的鲁棒停车路径规划与误差自适应采样。
Sensors (Basel). 2020 Jun 23;20(12):3560. doi: 10.3390/s20123560.
6
Geometric Features-Based Parking Slot Detection.基于几何特征的停车位检测。
Sensors (Basel). 2018 Aug 27;18(9):2821. doi: 10.3390/s18092821.
7
Autonomous Rear Parking via Rapidly Exploring Random-Tree-Based Reinforcement Learning.基于快速探索随机树的强化学习的自主倒车入库。
Sensors (Basel). 2022 Sep 2;22(17):6655. doi: 10.3390/s22176655.
8
Review of Vision-Based Deep Learning Parking Slot Detection on Surround View Images.基于视觉的深度学习在环视图像上进行停车位检测的综述。
Sensors (Basel). 2023 Aug 2;23(15):6869. doi: 10.3390/s23156869.
9
Application improvement of A* algorithm in intelligent vehicle trajectory planning.A*算法在智能车辆轨迹规划中的应用改进
Math Biosci Eng. 2020 Nov 17;18(1):1-21. doi: 10.3934/mbe.2021001.
10
Vacant Parking Slot Detection in the Around View Image Based on Deep Learning.基于深度学习的全景图像中空闲停车空位检测
Sensors (Basel). 2020 Apr 10;20(7):2138. doi: 10.3390/s20072138.

引用本文的文献

1
An Overview of Autonomous Parking Systems: Strategies, Challenges, and Future Directions.自动泊车系统概述:策略、挑战与未来方向
Sensors (Basel). 2025 Jul 10;25(14):4328. doi: 10.3390/s25144328.
2
A Deep Reinforcement Learning and Graph Convolution Approach to On-Street Parking Search Navigation.一种用于路内停车搜索导航的深度强化学习与图卷积方法
Sensors (Basel). 2025 Apr 9;25(8):2389. doi: 10.3390/s25082389.
3
Automatic parking trajectory planning in narrow spaces based on Hybrid A* and NMPC.基于混合A*算法和非线性模型预测控制的狭窄空间自动泊车轨迹规划

本文引用的文献

1
Introduction to the special issue on deep reinforcement learning:An editorial.深度强化学习特刊引言:一篇社论。
Neural Netw. 2018 Nov;107:1-2. doi: 10.1016/j.neunet.2018.08.001. Epub 2018 Aug 3.
2
Control of Gene Regulatory Networks Using Bayesian Inverse Reinforcement Learning.使用贝叶斯逆强化学习控制基因调控网络。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1250-1261. doi: 10.1109/TCBB.2018.2830357. Epub 2018 Apr 26.
3
Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.
Sci Rep. 2025 Jan 9;15(1):1384. doi: 10.1038/s41598-025-85541-x.
4
Improved Immune Moth-Flame Optimization Based on Gene Correction for Automatic Reverse Parking.基于基因校正的改进免疫蛾火优化算法在自动倒车中的应用
Sensors (Basel). 2024 Apr 2;24(7):2270. doi: 10.3390/s24072270.
5
End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function.基于带有生存惩罚函数的深度强化学习的端到端自主导航
Sensors (Basel). 2023 Oct 23;23(20):8651. doi: 10.3390/s23208651.
6
Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces.基于模型的预测控制与强化学习用于垂直停车位的车辆泊车轨迹规划
Sensors (Basel). 2023 Aug 11;23(16):7124. doi: 10.3390/s23167124.
7
Hierarchical Trajectory Planning for Narrow-Space Automated Parking with Deep Reinforcement Learning: A Federated Learning Scheme.基于深度强化学习的狭窄空间自动泊车分层轨迹规划:联邦学习方案。
Sensors (Basel). 2023 Apr 18;23(8):4087. doi: 10.3390/s23084087.
8
Autonomous Rear Parking via Rapidly Exploring Random-Tree-Based Reinforcement Learning.基于快速探索随机树的强化学习的自主倒车入库。
Sensors (Basel). 2022 Sep 2;22(17):6655. doi: 10.3390/s22176655.
9
Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System.数据高效强化学习在自动化泊车系统中的横向规划与控制集成。
Sensors (Basel). 2020 Dec 18;20(24):7297. doi: 10.3390/s20247297.
10
Sensors and Sensing for Intelligent Vehicles.智能车辆的传感器与传感技术。
Sensors (Basel). 2020 Sep 8;20(18):5115. doi: 10.3390/s20185115.
使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。
IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.
4
Quantization-Based Adaptive Actor-Critic Tracking Control With Tracking Error Constraints.基于量化的具有跟踪误差约束的自适应动作-评论家跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):970-980. doi: 10.1109/TNNLS.2017.2651104. Epub 2017 Feb 1.
5
Actor-Critic Off-Policy Learning for Optimal Control of Multiple-Model Discrete-Time Systems.多模型离散时间系统最优控制的基于演员-评论家的离线策略学习。
IEEE Trans Cybern. 2018 Jan;48(1):29-40. doi: 10.1109/TCYB.2016.2618926. Epub 2016 Nov 2.