• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度异步演员-评论家学习的最优电梯群控

Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning.

作者信息

Wei Qinglai, Wang Lingxiao, Liu Yu, Polycarpou Marios M

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5245-5256. doi: 10.1109/TNNLS.2020.2965208. Epub 2020 Nov 30.

DOI:10.1109/TNNLS.2020.2965208
PMID:32071000
Abstract

In this article, a new deep reinforcement learning (RL) method, called asynchronous advantage actor-critic (A3C) method, is developed to solve the optimal control problem of elevator group control systems (EGCSs). The main contribution of this article is that the optimal control law of EGCSs is designed via a new deep RL method, such that the elevator system sends passengers to the desired destination floors as soon as possible. Deep convolutional and recurrent neural networks, which can update themselves during applications, are designed to dispatch elevators. Then, the structure of the A3C method is developed, and the training phase for the learning optimal law is discussed. Finally, simulation results illustrate that the developed method effectively reduces the average waiting time in a complex building environment. Comparisons with traditional algorithms further verify the effectiveness of the developed method.

摘要

在本文中,开发了一种名为异步优势动作评论家(A3C)方法的新型深度强化学习(RL)方法,以解决电梯群控系统(EGCS)的最优控制问题。本文的主要贡献在于,通过一种新型深度强化学习方法设计了EGCS的最优控制律,使得电梯系统能够尽快将乘客送达期望的目的楼层。设计了能够在应用过程中自我更新的深度卷积和循环神经网络来调度电梯。然后,构建了A3C方法的结构,并讨论了学习最优律的训练阶段。最后,仿真结果表明,所开发的方法在复杂建筑环境中有效减少了平均等待时间。与传统算法的比较进一步验证了所开发方法的有效性。

相似文献

1
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning.基于深度异步演员-评论家学习的最优电梯群控
IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5245-5256. doi: 10.1109/TNNLS.2020.2965208. Epub 2020 Nov 30.
2
Asynchronous learning for actor-critic neural networks and synchronous triggering for multiplayer system.异步学习的演员-批评神经网络和同步触发的多人系统。
ISA Trans. 2022 Oct;129(Pt B):295-308. doi: 10.1016/j.isatra.2022.02.007. Epub 2022 Feb 10.
3
Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning.基于深度强化学习的未知动态环境导航。
Sensors (Basel). 2019 Sep 5;19(18):3837. doi: 10.3390/s19183837.
4
Deep Reinforcement Learning Microgrid Optimization Strategy Considering Priority Flexible Demand Side.考虑优先级灵活需求侧的深度强化学习微电网优化策略
Sensors (Basel). 2022 Mar 14;22(6):2256. doi: 10.3390/s22062256.
5
Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.用于深度强化学习的随机集成演员-评论家算法
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6654-6666. doi: 10.1109/TNNLS.2022.3212273. Epub 2024 May 2.
6
Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.通过演员-评论家强化学习实现多人扑克的最优策略
Entropy (Basel). 2022 May 30;24(6):774. doi: 10.3390/e24060774.
7
An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems.一种基于深度强化学习的演员-评论家框架,用于解决柔性作业车间调度问题。
Math Biosci Eng. 2024 Jan;21(1):1445-1471. doi: 10.3934/mbe.2024062. Epub 2022 Dec 28.
8
RIS-Aided Proactive Mobile Network Downlink Interference Suppression: A Deep Reinforcement Learning Approach.基于RIS的主动式移动网络下行链路干扰抑制:一种深度强化学习方法。
Sensors (Basel). 2023 Jul 20;23(14):6550. doi: 10.3390/s23146550.
9
A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents.A3C-GS:具有锁的异步动作-评价器代理的自适应矩梯度共享。
IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):1162-1176. doi: 10.1109/TNNLS.2020.2980743. Epub 2021 Mar 1.
10
Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning.通过深度强化学习的仿生水下航行器目标跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3741-3752. doi: 10.1109/TNNLS.2021.3054402. Epub 2022 Aug 3.

引用本文的文献

1
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique.一种基于多可解释改进近端策略优化算法和少样本技术的平面跟踪策略。
Sci Rep. 2024 Feb 16;14(1):3910. doi: 10.1038/s41598-024-54268-6.
2
Category learning in a recurrent neural network with reinforcement learning.基于强化学习的循环神经网络中的类别学习。
Front Psychiatry. 2022 Oct 25;13:1008011. doi: 10.3389/fpsyt.2022.1008011. eCollection 2022.
3
Magnetic resonance imaging image analysis of the therapeutic effect and neuroprotective effect of deep brain stimulation in Parkinson's disease based on a deep learning algorithm.
基于深度学习算法的帕金森病深部脑刺激治疗效果和神经保护作用的磁共振成像图像分析。
Int J Numer Method Biomed Eng. 2022 Nov;38(11):e3642. doi: 10.1002/cnm.3642. Epub 2022 Aug 30.