Dai Bangwu, Chang Yuqing, Xu Sheng, Wang Fuli
Taizhou Electric Power Conversion and Control Engineering Technology Research Center, Taizhou University, Taizhou 225300, China.
State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang, Liaoning 110819, China.
ACS Omega. 2025 Apr 9;10(15):14844-14857. doi: 10.1021/acsomega.4c10011. eCollection 2025 Apr 22.
For the coal-fired power unit, traditional centralized optimization control frameworks face challenges in achieving fast load response due to heavy computation and long online calculation times, especially for devices with limited computing power. This paper proposes a process knowledge-guided distributed optimization control framework for once-through boiler-turbine unit using multiagent deep reinforcement learning. In this framework, a centralized training distributed execution multiagent deep reinforcement learning algorithm is employed to obtain the optimal controllers of the once-through boiler-turbine unit, by dividing the coordinated control system into three subsystems and modeling the control process as a fully cooperative multiagent Markov decision process. Moreover, the process knowledge represented by a low-precision process model is used to guide and improve the training of multiagent deep reinforcement learning by distributed model predictive control algorithm generating the initial control actions and the designed action fusion strategy. Finally, the effectiveness of the process knowledge-guided optimization control framework is verified by the simulation platform, and the results show that the proposed algorithm has a faster speed and better control effect than the compared algorithms.
对于燃煤发电机组,传统的集中优化控制框架由于计算量大和在线计算时间长,在实现快速负荷响应方面面临挑战,特别是对于计算能力有限的设备。本文提出了一种基于多智能体深度强化学习的直流锅炉-汽轮机单元过程知识引导分布式优化控制框架。在该框架中,采用集中训练分布式执行的多智能体深度强化学习算法,通过将协调控制系统划分为三个子系统,并将控制过程建模为完全协作的多智能体马尔可夫决策过程,来获得直流锅炉-汽轮机单元的最优控制器。此外,由低精度过程模型表示的过程知识用于通过分布式模型预测控制算法生成初始控制动作和设计的动作融合策略来指导和改进多智能体深度强化学习的训练。最后,通过仿真平台验证了过程知识引导优化控制框架的有效性,结果表明所提算法比对比算法具有更快的速度和更好的控制效果。