• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的狭窄空间自动泊车分层轨迹规划:联邦学习方案。

Hierarchical Trajectory Planning for Narrow-Space Automated Parking with Deep Reinforcement Learning: A Federated Learning Scheme.

机构信息

School of Artificial Intelligence, Beijing University of Posts and Telecommunications, 10 Xitucheng Road, Haidian Distinct, Beijing 100876, China.

Centre for Telecommunications Research, King's College London, London WC2R 2LS, UK.

出版信息

Sensors (Basel). 2023 Apr 18;23(8):4087. doi: 10.3390/s23084087.

DOI:10.3390/s23084087
PMID:37112428
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10143055/
Abstract

Collision-free trajectory planning in narrow spaces has become one of the most challenging tasks in automated parking scenarios. Previous optimization-based approaches can generate accurate parking trajectories, but these methods cannot compute feasible solutions with extremely complex constraints in a limited time. Recent research uses neural-network-based approaches that can generate time-optimized parking trajectories in linear time. However, the generalization of these neural network models in different parking scenarios has not been considered thoroughly and the risk of privacy compromise exists in the case of centralized training. To address the above issues, this paper proposes a ierarchical trjectory panning method with deep reinfrcement learning in the fderated learning cheme (HALOES) to rapidly and accurately generate collision-free automated parking trajectories in multiple narrow spaces. HALOES is a federated learning based hierarchical trajectory planning method to fully exert high-level deep reinforcement learning and the low-level optimization-based approach. HALOES further fuse the deep reinforcement learning model parameters to improve the generalization capabilities with a decentralized training scheme. The federated learning scheme in HALOES aims to protect the privacy of the vehicle's data during model parameter aggregation. Simulation results show that the proposed method can achieve efficient automatic parking in multiple narrow spaces, improve planning time from 12.15% to 66.02% compared to other state-of-the-art methods (e.g., hybrid A*, OBCA) and maintain the same level of trajectory accuracy while having great model generalization.

摘要

在狭窄空间中进行无碰撞轨迹规划已成为自动泊车场景中最具挑战性的任务之一。基于优化的先前方法可以生成精确的停车轨迹,但这些方法无法在有限的时间内计算出具有极其复杂约束的可行解决方案。最近的研究使用基于神经网络的方法,可以在线性时间内生成时间优化的停车轨迹。然而,这些神经网络模型在不同的停车场景中的泛化尚未得到充分考虑,并且在集中式训练的情况下存在隐私泄露的风险。为了解决上述问题,本文提出了一种分层轨迹规划方法,该方法在联邦学习方案(HALOES)中结合了深度强化学习,以快速准确地生成多个狭窄空间中的无碰撞自动泊车轨迹。HALOES 是一种基于联邦学习的分层轨迹规划方法,可充分发挥高级深度强化学习和低级基于优化的方法的作用。HALOES 进一步融合了深度强化学习模型参数,通过分散式训练方案提高了泛化能力。HALOES 中的联邦学习方案旨在在模型参数聚合过程中保护车辆数据的隐私。仿真结果表明,与其他最先进的方法(例如,混合 A*,OBCA)相比,所提出的方法可以在多个狭窄空间中实现高效的自动泊车,规划时间从 12.15%提高到 66.02%,同时保持相同的轨迹精度水平,具有很强的模型泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/d556c387e988/sensors-23-04087-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/902910c1d5a5/sensors-23-04087-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/8d19b38b6dbc/sensors-23-04087-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/443be0074d40/sensors-23-04087-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/a3b6fdacc07e/sensors-23-04087-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/a12d58318bb2/sensors-23-04087-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/b5e7255896f2/sensors-23-04087-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/ae2f223fc108/sensors-23-04087-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/d185945b5e3a/sensors-23-04087-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/d556c387e988/sensors-23-04087-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/902910c1d5a5/sensors-23-04087-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/8d19b38b6dbc/sensors-23-04087-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/443be0074d40/sensors-23-04087-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/a3b6fdacc07e/sensors-23-04087-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/a12d58318bb2/sensors-23-04087-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/b5e7255896f2/sensors-23-04087-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/ae2f223fc108/sensors-23-04087-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/d185945b5e3a/sensors-23-04087-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fd0/10143055/d556c387e988/sensors-23-04087-g009.jpg

相似文献

1
Hierarchical Trajectory Planning for Narrow-Space Automated Parking with Deep Reinforcement Learning: A Federated Learning Scheme.基于深度强化学习的狭窄空间自动泊车分层轨迹规划:联邦学习方案。
Sensors (Basel). 2023 Apr 18;23(8):4087. doi: 10.3390/s23084087.
2
Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces.基于模型的预测控制与强化学习用于垂直停车位的车辆泊车轨迹规划
Sensors (Basel). 2023 Aug 11;23(16):7124. doi: 10.3390/s23167124.
3
Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System.数据高效强化学习在自动化泊车系统中的横向规划与控制集成。
Sensors (Basel). 2020 Dec 18;20(24):7297. doi: 10.3390/s20247297.
4
An automatic driving trajectory planning approach in complex traffic scenarios based on integrated driver style inference and deep reinforcement learning.一种基于集成驾驶员风格推理和深度强化学习的复杂交通场景下自动驾驶轨迹规划方法。
PLoS One. 2024 Jan 25;19(1):e0297192. doi: 10.1371/journal.pone.0297192. eCollection 2024.
5
Autonomous Rear Parking via Rapidly Exploring Random-Tree-Based Reinforcement Learning.基于快速探索随机树的强化学习的自主倒车入库。
Sensors (Basel). 2022 Sep 2;22(17):6655. doi: 10.3390/s22176655.
6
Design of a reinforcement learning-based intelligent car transfer planning system for parking lots.基于强化学习的停车场智能车辆转移规划系统设计
Math Biosci Eng. 2024 Jan;21(1):1058-1081. doi: 10.3934/mbe.2024044. Epub 2022 Dec 22.
7
Visual Detection and Image Processing of Parking Space Based on Deep Learning.基于深度学习的停车位视觉检测与图像处理。
Sensors (Basel). 2022 Sep 3;22(17):6672. doi: 10.3390/s22176672.
8
Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints.基于深度强化学习的不确定约束下轨迹规划
Front Neurorobot. 2022 May 2;16:883562. doi: 10.3389/fnbot.2022.883562. eCollection 2022.
9
A Trajectory Planning Method for Autonomous Valet Parking via Solving an Optimal Control Problem.一种通过求解最优控制问题实现自动代客泊车的轨迹规划方法。
Sensors (Basel). 2020 Nov 11;20(22):6435. doi: 10.3390/s20226435.
10
Reinforcement Learning-Based End-to-End Parking for Automatic Parking System.基于强化学习的全自动泊车系统端到端泊车
Sensors (Basel). 2019 Sep 16;19(18):3996. doi: 10.3390/s19183996.

引用本文的文献

1
An Overview of Autonomous Parking Systems: Strategies, Challenges, and Future Directions.自动泊车系统概述:策略、挑战与未来方向
Sensors (Basel). 2025 Jul 10;25(14):4328. doi: 10.3390/s25144328.
2
GPU Rasterization-Based 3D LiDAR Simulation for Deep Learning.用于深度学习的基于GPU光栅化的3D激光雷达模拟
Sensors (Basel). 2023 Sep 28;23(19):8130. doi: 10.3390/s23198130.

本文引用的文献

1
Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints.基于深度强化学习的不确定约束下轨迹规划
Front Neurorobot. 2022 May 2;16:883562. doi: 10.3389/fnbot.2022.883562. eCollection 2022.
2
Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System.数据高效强化学习在自动化泊车系统中的横向规划与控制集成。
Sensors (Basel). 2020 Dec 18;20(24):7297. doi: 10.3390/s20247297.
3
Reinforcement Learning-Based End-to-End Parking for Automatic Parking System.基于强化学习的全自动泊车系统端到端泊车
Sensors (Basel). 2019 Sep 16;19(18):3996. doi: 10.3390/s19183996.