• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于自适应经济调度的分层深度强化学习

Hierarchical deep reinforcement learning for self-adaptive economic dispatch.

作者信息

Li Mengshi, Yang Dongyan, Xu Yuhan, Ji Tianyao

机构信息

School of Electric Power Engineering, South China University of Technology, 510000, Guangzhou, China.

出版信息

Heliyon. 2024 Jul 8;10(14):e33944. doi: 10.1016/j.heliyon.2024.e33944. eCollection 2024 Jul 30.

DOI:10.1016/j.heliyon.2024.e33944
PMID:39114005
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11303998/
Abstract

It is challenging to accurately model the overall uncertainty of the power system when it is connected to large-scale intermittent generation sources such as wind and photovoltaic generation due to the inherent volatility, uncertainty, and indivisibility of renewable energy. Deep reinforcement learning (DRL) algorithms are introduced as a solution to avoid modeling the complex uncertainties and to adapt the fluctuation of uncertainty by interacting with the environment and using feedback to continuously improve their strategies. However, the large-scale nature and uncertainty of the system lead to the sparse reward problem and high-dimensional space issue in DRL. A hierarchical deep reinforcement learning (HDRL) scheme is designed to decompose the process of solving this problem into two stages, using the reinforcement learning (RL) agent in the global stage and the heuristic algorithm in the local stage to find optimal dispatching decisions for power systems under uncertainty. Simulation studies have shown that the proposed HDRL scheme is efficient in solving power system economic dispatch problems under both deterministic and uncertain scenarios thanks to its adaptation system uncertainty, and coping with the volatility of uncertain factors while significantly improving the speed of online decision-making.

摘要

由于可再生能源具有固有的波动性、不确定性和不可分割性,当电力系统接入大规模间歇性发电电源(如风力发电和光伏发电)时,准确模拟电力系统的整体不确定性具有挑战性。引入深度强化学习(DRL)算法作为一种解决方案,以避免对复杂的不确定性进行建模,并通过与环境交互并利用反馈不断改进其策略来适应不确定性的波动。然而,系统的大规模性质和不确定性导致了DRL中的稀疏奖励问题和高维空间问题。设计了一种分层深度强化学习(HDRL)方案,将解决此问题的过程分解为两个阶段,在全局阶段使用强化学习(RL)智能体,在局部阶段使用启发式算法,以找到不确定性下电力系统的最优调度决策。仿真研究表明,所提出的HDRL方案由于能够适应系统不确定性,并应对不确定因素的波动性,同时显著提高在线决策速度,因此在解决确定性和不确定性场景下的电力系统经济调度问题方面是有效的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/904f85711fbd/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/c0c418c90f87/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/e9fa803876f1/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/45cfa4cd0c4c/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/16f3d982bc69/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/7c6b0d396d24/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/f77c748e9409/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/8da887dd090a/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/17c90271ac33/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/90fed9257922/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/a239fd9ed5c6/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/52250b45eb16/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/904f85711fbd/gr012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/c0c418c90f87/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/e9fa803876f1/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/45cfa4cd0c4c/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/16f3d982bc69/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/7c6b0d396d24/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/f77c748e9409/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/8da887dd090a/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/17c90271ac33/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/90fed9257922/gr009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/a239fd9ed5c6/gr010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/52250b45eb16/gr011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aca7/11303998/904f85711fbd/gr012.jpg

相似文献

1
Hierarchical deep reinforcement learning for self-adaptive economic dispatch.用于自适应经济调度的分层深度强化学习
Heliyon. 2024 Jul 8;10(14):e33944. doi: 10.1016/j.heliyon.2024.e33944. eCollection 2024 Jul 30.
2
Optimal solution of multiobjective stable environmental economic power dispatch problem considering probabilistic wind and solar PV generation.考虑风电和光伏概率发电的多目标稳定环境经济电力调度问题的最优解
Heliyon. 2024 Oct 9;10(20):e39041. doi: 10.1016/j.heliyon.2024.e39041. eCollection 2024 Oct 30.
3
Interventions for fertility preservation in women with cancer undergoing chemotherapy.对接受化疗的癌症女性进行生育力保存的干预措施。
Cochrane Database Syst Rev. 2025 Jun 19;6:CD012891. doi: 10.1002/14651858.CD012891.pub2.
4
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
5
Load frequency control of multi-area power system incorporated renewable energy considering electrical vehicle effect using modified cascaded controller tuned by BESSO algorithm.考虑电动汽车影响,采用经BESSO算法整定的改进型级联控制器的含可再生能源多区域电力系统负荷频率控制
Heliyon. 2024 May 23;10(11):e31840. doi: 10.1016/j.heliyon.2024.e31840. eCollection 2024 Jun 15.
6
Wood Waste Valorization and Classification Approaches: A systematic review.木材废料的增值与分类方法:一项系统综述
Open Res Eur. 2025 May 6;5:5. doi: 10.12688/openreseurope.18862.1. eCollection 2025.
7
Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.基于分子特征的腹膜后脂肪肉瘤分类:一项前瞻性队列研究。
Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887.
8
Aural toilet (ear cleaning) for chronic suppurative otitis media.慢性化脓性中耳炎的耳道清理(耳部清洁)
Cochrane Database Syst Rev. 2025 Jun 9;6(6):CD013057. doi: 10.1002/14651858.CD013057.pub3.
9
Cauliflower leaf diseases: A computer vision dataset for smart agriculture.花椰菜叶部病害:一个用于智慧农业的计算机视觉数据集。
Data Brief. 2025 Apr 28;60:111594. doi: 10.1016/j.dib.2025.111594. eCollection 2025 Jun.
10
A common alteration in effort-based decision-making in apathy, anhedonia, and late circadian rhythm.冷漠、快感缺乏和昼夜节律晚期基于努力的决策中的一种常见改变。
Elife. 2025 Jun 16;13:RP96803. doi: 10.7554/eLife.96803.

引用本文的文献

1
Multi-objective evolutionary method for multi-area dynamic emission/economic dispatch considering energy storage and renewable energy units.考虑储能和可再生能源机组的多区域动态排放/经济调度多目标进化方法
Heliyon. 2024 Sep 6;10(18):e37476. doi: 10.1016/j.heliyon.2024.e37476. eCollection 2024 Sep 30.