• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化学习在基于风险的网络物理能源系统鲁棒控制中的作用。

Role of reinforcement learning for risk-based robust control of cyber-physical energy systems.

作者信息

Du Yan, Chatterjee Samrat, Bhattacharya Arnab, Dutta Ashutosh, Halappanavar Mahantesh

机构信息

Optimization and Control Group, Pacific Northwest National Laboratory, Richland, WA, USA.

Data Sciences and Machine Intelligence Group, Pacific Northwest National Laboratory, Richland, WA, USA.

出版信息

Risk Anal. 2023 Nov;43(11):2280-2297. doi: 10.1111/risa.14104. Epub 2023 Feb 6.

DOI:10.1111/risa.14104
PMID:36746175
Abstract

Critical infrastructures such as cyber-physical energy systems (CPS-E) integrate information flow and physical operations that are vulnerable to natural and targeted failures. Safe, secure, and reliable operation and control of CPS-E is critical to ensure societal well-being and economic prosperity. Automated control is key for real-time operations and may be mathematically cast as a sequential decision-making problem under uncertainty. Emergence of data-driven techniques for decision making under uncertainty, such as reinforcement learning (RL), have led to promising advances for addressing sequential decision-making problems for risk-based robust CPS-E control. However, existing research challenges include understanding the applicability of RL methods across diverse CPS-E applications, addressing the effect of risk preferences across multiple RL methods, and development of open-source domain-aware simulation environments for RL experimentation within a CPS-E context. This article systematically analyzes the applicability of four types of RL methods (model-free, model-based, hybrid model-free and model-based, and hierarchical) for risk-based robust CPS-E control. Problem features and solution stability for the RL methods are also discussed. We demonstrate and compare the performance of multiple RL methods under different risk specifications (risk-averse, risk-neutral, and risk-seeking) through the development and application of an open-source simulation environment. Motivating numerical simulation examples include representative single-zone and multizone building control use cases. Finally, six key insights for future research and broader adoption of RL methods are identified, with specific emphasis on problem features, algorithmic explainability, and solution stability.

摘要

诸如网络物理能源系统(CPS-E)之类的关键基础设施整合了信息流和物理操作,而这些都容易受到自然故障和针对性故障的影响。CPS-E的安全、可靠运行与控制对于确保社会福祉和经济繁荣至关重要。自动化控制是实时运行的关键,并且在数学上可以被视为不确定性下的顺序决策问题。用于不确定性下决策的数据驱动技术的出现,如强化学习(RL),为解决基于风险的稳健CPS-E控制的顺序决策问题带来了有前景的进展。然而,现有的研究挑战包括理解RL方法在不同CPS-E应用中的适用性、解决多种RL方法中风险偏好的影响,以及开发用于在CPS-E环境中进行RL实验的开源领域感知仿真环境。本文系统地分析了四种类型的RL方法(无模型、基于模型、混合无模型和基于模型以及分层)在基于风险的稳健CPS-E控制中的适用性。还讨论了RL方法的问题特征和解决方案稳定性。我们通过开发和应用开源仿真环境,展示并比较了多种RL方法在不同风险规范(风险规避、风险中性和风险寻求)下的性能。具有启发性的数值模拟示例包括具有代表性的单区域和多区域建筑控制用例。最后,确定了未来研究和更广泛采用RL方法的六个关键见解,特别强调问题特征、算法可解释性和解决方案稳定性。

相似文献

1
Role of reinforcement learning for risk-based robust control of cyber-physical energy systems.强化学习在基于风险的网络物理能源系统鲁棒控制中的作用。
Risk Anal. 2023 Nov;43(11):2280-2297. doi: 10.1111/risa.14104. Epub 2023 Feb 6.
2
QACDes: QoS-aware context-sensitive design of cyber-physical systems.QACDes:信息物理系统的QoS感知上下文敏感设计
Sci Rep. 2024 Aug 16;14(1):19032. doi: 10.1038/s41598-024-69371-x.
3
Defending the Defender: Adversarial Learning Based Defending Strategy for Learning Based Security Methods in Cyber-Physical Systems (CPS).捍卫防御者:基于对抗学习的防御策略,用于网络物理系统 (CPS) 中的基于学习的安全方法。
Sensors (Basel). 2023 Jun 9;23(12):5459. doi: 10.3390/s23125459.
4
Reinforcement Learning Methods in Public Health.公共卫生中的强化学习方法
Clin Ther. 2022 Jan;44(1):139-154. doi: 10.1016/j.clinthera.2021.11.002. Epub 2022 Jan 19.
5
Reinforcement learning for intensive care medicine: actionable clinical insights from novel approaches to reward shaping and off-policy model evaluation.重症医学中的强化学习:奖励塑造和离策略模型评估新方法带来的可操作临床见解。
Intensive Care Med Exp. 2024 Mar 25;12(1):32. doi: 10.1186/s40635-024-00614-x.
6
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
7
A Holistic Review of Cyber-Physical-Social Systems: New Directions and Opportunities.网络物理社会系统的全面综述:新方向与机遇
Sensors (Basel). 2023 Aug 24;23(17):7391. doi: 10.3390/s23177391.
8
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling.我们进行个性化了吗?使用重采样通过在线强化学习算法评估个性化。
Mach Learn. 2024 Jul;113(7):3961-3997. doi: 10.1007/s10994-024-06526-x. Epub 2024 Apr 10.
9
Progress and summary of reinforcement learning on energy management of MPS-EV.多端口系统电动汽车能量管理强化学习的进展与综述
Heliyon. 2023 Nov 29;10(1):e23014. doi: 10.1016/j.heliyon.2023.e23014. eCollection 2024 Jan 15.
10
Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications.用于多智能体系统的深度强化学习:挑战、解决方案及应用综述
IEEE Trans Cybern. 2020 Sep;50(9):3826-3839. doi: 10.1109/TCYB.2020.2977374. Epub 2020 Mar 20.