• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于高密度停车场空间调度的混合残差多专家强化学习

Hybrid Residual Multiexpert Reinforcement Learning for Spatial Scheduling of High-Density Parking Lots.

作者信息

Hou Jing, Chen Guang, Li Zhijun, He Wei, Gu Shangding, Knoll Alois, Jiang Changjun

出版信息

IEEE Trans Cybern. 2024 May;54(5):2771-2783. doi: 10.1109/TCYB.2023.3312647. Epub 2024 Apr 16.

DOI:10.1109/TCYB.2023.3312647
PMID:37871089
Abstract

Industries, such as manufacturing, are accelerating their embrace of the metaverse to achieve higher productivity, especially in complex industrial scheduling. In view of the growing parking challenges in large cities, high-density vehicle spatial scheduling is one of the potential solutions. Stack-based parking lots utilize parking robots to densely park vehicles in the vertical stacks like container stacking, which greatly reduces the aisle area in the parking lot, but requires complex scheduling algorithms to park and take out the vehicles. The existing high-density parking (HDP) scheduling algorithms are mainly heuristic methods, which only contain simple logic and are difficult to utilize information effectively. We propose a hybrid residual multiexpert (HIRE) reinforcement learning (RL) approach, a method for interactive learning in the digital industrial metaverse, which efficiently solves the HDP batch space scheduling problem. In our proposed framework, each heuristic scheduling method is considered as an expert. The neural network trained by RL assigns the expert strategy according to the current parking lot state. Furthermore, to avoid being limited by heuristic expert performance, the proposed hierarchical network framework also sets up a residual output channel. Experiments show that our proposed algorithm outperforms various advanced heuristic methods and the end-to-end RL method in the number of vehicle maneuvers, and has good robustness to the parking lot size and the estimation accuracy of vehicle exit time. We believe that the proposed HIRE RL method can be effectively and conveniently applied to practical application scenarios, which can be regarded as a key step for RL to enter the practical application stage of the industrial metaverse.

摘要

制造业等行业正在加速对元宇宙的应用,以提高生产力,尤其是在复杂的工业调度方面。鉴于大城市日益严峻的停车挑战,高密度车辆空间调度是一种潜在的解决方案。基于堆叠的停车场利用停车机器人将车辆像集装箱堆叠一样密集地停放在垂直堆叠中,这大大减少了停车场的过道面积,但需要复杂的调度算法来停车和取车。现有的高密度停车(HDP)调度算法主要是启发式方法,其逻辑简单,难以有效利用信息。我们提出了一种混合残差多专家(HIRE)强化学习(RL)方法,这是一种在数字工业元宇宙中进行交互式学习的方法,能有效解决HDP批量空间调度问题。在我们提出的框架中,每种启发式调度方法都被视为一个专家。通过强化学习训练的神经网络根据当前停车场状态分配专家策略。此外,为避免受启发式专家性能的限制,所提出的分层网络框架还设置了一个残差输出通道。实验表明,我们提出的算法在车辆操作次数方面优于各种先进的启发式方法和端到端强化学习方法,并且对停车场大小和车辆离开时间的估计精度具有良好的鲁棒性。我们相信,所提出的HIRE强化学习方法能够有效且方便地应用于实际应用场景,这可被视为强化学习进入工业元宇宙实际应用阶段的关键一步。

相似文献

1
Hybrid Residual Multiexpert Reinforcement Learning for Spatial Scheduling of High-Density Parking Lots.用于高密度停车场空间调度的混合残差多专家强化学习
IEEE Trans Cybern. 2024 May;54(5):2771-2783. doi: 10.1109/TCYB.2023.3312647. Epub 2024 Apr 16.
2
Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System.数据高效强化学习在自动化泊车系统中的横向规划与控制集成。
Sensors (Basel). 2020 Dec 18;20(24):7297. doi: 10.3390/s20247297.
3
A Cost-Effective Electric Vehicle Intelligent Charge Scheduling Method for Commercial Smart Parking Lots Using a Simplified Convex Relaxation Technique.一种使用简化凸松弛技术的面向商业智能停车场的电动汽车经济高效智能充电调度方法。
Sensors (Basel). 2020 Aug 27;20(17):4842. doi: 10.3390/s20174842.
4
A Heuristic-Based Adaptive Iterated Greedy Algorithm for Lot-Streaming Hybrid Flow Shop Scheduling Problem with Consistent and Intermingled Sub-Lots.基于启发式的自适应迭代贪婪算法求解一致且混合子批的批量流混合流水车间调度问题。
Sensors (Basel). 2023 Mar 3;23(5):2808. doi: 10.3390/s23052808.
5
Dynamic Space Allocation Based on Internal Demand for Optimizing Release of Shared Parking.基于内部需求的动态空间分配优化共享停车释放。
Sensors (Basel). 2021 Dec 29;22(1):235. doi: 10.3390/s22010235.
6
Design of a reinforcement learning-based intelligent car transfer planning system for parking lots.基于强化学习的停车场智能车辆转移规划系统设计
Math Biosci Eng. 2024 Jan;21(1):1058-1081. doi: 10.3934/mbe.2024044. Epub 2022 Dec 22.
7
Spatiotemporal Clustering of Parking Lots at the City Level for Efficiently Sharing Occupancy Forecasting Models.城市级别的停车场时空聚类,以实现高效共享车位占有率预测模型。
Sensors (Basel). 2023 May 31;23(11):5248. doi: 10.3390/s23115248.
8
Reinforcement Learning-Based End-to-End Parking for Automatic Parking System.基于强化学习的全自动泊车系统端到端泊车
Sensors (Basel). 2019 Sep 16;19(18):3996. doi: 10.3390/s19183996.
9
Expert system design for vacant parking space location using automatic learning and artificial vision.基于自动学习与人工视觉的空车位定位专家系统设计
Multimed Tools Appl. 2022;81(27):38661-38683. doi: 10.1007/s11042-022-12906-z. Epub 2022 Apr 26.
10
Hierarchical Trajectory Planning for Narrow-Space Automated Parking with Deep Reinforcement Learning: A Federated Learning Scheme.基于深度强化学习的狭窄空间自动泊车分层轨迹规划:联邦学习方案。
Sensors (Basel). 2023 Apr 18;23(8):4087. doi: 10.3390/s23084087.