Phan Bao Chau, Lai Ying-Chih, Lin Chin E
Department of Aeronautics and Aeronautics, National Cheng Kung University, Tainan 701, Taiwan.
UAV Center, Chang Jung Christian University, Tainan 701, Taiwan.
Sensors (Basel). 2020 May 27;20(11):3039. doi: 10.3390/s20113039.
On the issues of global environment protection, the renewable energy systems have been widely considered. The photovoltaic (PV) system converts solar power into electricity and significantly reduces the consumption of fossil fuels from environment pollution. Besides introducing new materials for the solar cells to improve the energy conversion efficiency, the maximum power point tracking (MPPT) algorithms have been developed to ensure the efficient operation of PV systems at the maximum power point (MPP) under various weather conditions. The integration of reinforcement learning and deep learning, named deep reinforcement learning (DRL), is proposed in this paper as a future tool to deal with the optimization control problems. Following the success of deep reinforcement learning (DRL) in several fields, the deep Q network (DQN) and deep deterministic policy gradient (DDPG) are proposed to harvest the MPP in PV systems, especially under a partial shading condition (PSC). Different from the reinforcement learning (RL)-based method, which is only operated with discrete state and action spaces, the methods adopted in this paper are used to deal with continuous state spaces. In this study,DQN solves the problem with discrete action spaces, while DDPG handles the continuous action spaces. The proposed methods are simulated in MATLAB/Simulink for feasibility analysis. Further tests under various input conditions with comparisons to the classical Perturb and observe (P&O) MPPT method are carried out for validation. Based on the simulation results in this study, the performance of the proposed methods is outstanding and efficient, showing its potential for further applications.
在全球环境保护问题上,可再生能源系统已得到广泛关注。光伏(PV)系统将太阳能转化为电能,显著减少了化石燃料消耗对环境污染的影响。除了引入新型材料以提高太阳能电池的能量转换效率外,还开发了最大功率点跟踪(MPPT)算法,以确保光伏系统在各种天气条件下都能在最大功率点(MPP)高效运行。本文提出将强化学习和深度学习相结合,即深度强化学习(DRL),作为未来解决优化控制问题的工具。继深度强化学习(DRL)在多个领域取得成功之后,提出了深度Q网络(DQN)和深度确定性策略梯度(DDPG)来获取光伏系统中的最大功率点,特别是在部分阴影条件(PSC)下。与仅在离散状态和动作空间中运行的基于强化学习(RL)的方法不同,本文采用的方法用于处理连续状态空间。在本研究中,DQN解决离散动作空间的问题,而DDPG处理连续动作空间的问题。所提出的方法在MATLAB/Simulink中进行了仿真以进行可行性分析。在各种输入条件下进行了进一步测试,并与经典的扰动观察(P&O)MPPT方法进行比较以进行验证。基于本研究的仿真结果,所提出方法的性能出色且高效,显示出其进一步应用的潜力。