Chen Wei, Wan Haiying, Luan Xiaoli, Liu Fei
Key Laboratory of Advanced Process Control for Light Industry, Institute of Automation, Jiangnan University, Wuxi 214122, China.
Sensors (Basel). 2024 Mar 20;24(6):1986. doi: 10.3390/s24061986.
This paper introduces a novel data-driven self-triggered control approach based on a hierarchical reinforcement learning framework in networked motor control systems. This approach divides the self-triggered control policy into higher and lower layers, with the higher-level policy guiding the lower-level policy in decision-making, thereby reducing the exploration space of the lower-level policy and improving the efficiency of the learning process. The data-driven framework integrates with the dual-actor critic algorithm, using two interconnected neural networks to approximate the hierarchical policies. In this framework, we use recurrent neural networks as the network architecture for the critic, utilizing the temporal dynamics of recurrent neural networks to better capture the dependencies between costs, thus enhancing the critic network's efficiency and accuracy in approximating the multi-time cumulative cost function. Additionally, we have developed a pre-training method for the control policy networks to further improve learning efficiency. The effectiveness of our proposed method is validated through a series of numerical simulations.
本文介绍了一种基于分层强化学习框架的新型数据驱动自触发控制方法,应用于网络化电机控制系统。该方法将自触发控制策略分为高层和低层,高层策略在决策中指导低层策略,从而减少低层策略的探索空间,提高学习过程的效率。数据驱动框架与双智能体评判算法相结合,使用两个相互连接的神经网络来近似分层策略。在此框架中,我们使用循环神经网络作为评判器的网络架构,利用循环神经网络的时间动态特性更好地捕捉成本之间的依赖关系,从而提高评判器网络在近似多时刻累积成本函数时的效率和准确性。此外,我们还开发了一种控制策略网络的预训练方法,以进一步提高学习效率。通过一系列数值模拟验证了我们所提方法的有效性。