基于多层稀疏编码和非凸正则化剪枝的双稀疏深度强化学习

Double Sparse Deep Reinforcement Learning via Multilayer Sparse Coding and Nonconvex Regularized Pruning.

作者信息

Zhao Haoli, Wu Jiqiang, Li Zhenni, Chen Wuhui, Zheng Zibin

出版信息

IEEE Trans Cybern. 2023 Feb;53(2):765-778. doi: 10.1109/TCYB.2022.3157892. Epub 2023 Jan 13.

DOI:10.1109/TCYB.2022.3157892

Abstract

Deep reinforcement learning (DRL), which highly depends on the data representation, has shown its potential in many practical decision-making problems. However, the process of acquiring representations in DRL is easily affected by interference from models, and moreover leaves unnecessary parameters, leading to control performance reduction. In this article, we propose a double sparse DRL via multilayer sparse coding and nonconvex regularized pruning. To alleviate interference in DRL, we propose a multilayer sparse-coding-structural network to obtain deep sparse representation for control in reinforcement learning. Furthermore, we employ a nonconvex log regularizer to promote strong sparsity, efficiently removing the unnecessary weights with a regularizer-based pruning scheme. Hence, a double sparse DRL algorithm is developed, which can not only learn deep sparse representation to reduce the interference but also remove redundant weights while keeping the robust performance. The experimental results in five benchmark environments of the deep q network (DQN) architecture demonstrate that the proposed method with deep sparse representations from the multilayer sparse-coding structure can outperform existing sparse-coding-based DRL in control, for example, completing Mountain Car with 140.81 steps, achieving near 10% reward increase from the single-layer sparse-coding DRL algorithm, and obtaining 286.08 scores in Catcher, which are over two times the rewards of the other algorithms. Moreover, the proposed algorithm can reduce over 80% parameters while keeping performance improvements from deep sparse representations.

摘要

深度强化学习（DRL）高度依赖于数据表示，已在许多实际决策问题中展现出其潜力。然而，DRL中获取表示的过程很容易受到模型干扰的影响，而且会留下不必要的参数，导致控制性能下降。在本文中，我们提出了一种通过多层稀疏编码和非凸正则化剪枝的双重稀疏DRL。为了减轻DRL中的干扰，我们提出了一种多层稀疏编码结构网络，以获得用于强化学习控制的深度稀疏表示。此外，我们采用非凸对数正则化器来促进强稀疏性，通过基于正则化器的剪枝方案有效地去除不必要的权重。因此，开发了一种双重稀疏DRL算法，它不仅可以学习深度稀疏表示以减少干扰，还能在保持鲁棒性能的同时去除冗余权重。在深度Q网络（DQN）架构的五个基准环境中的实验结果表明，所提出的具有来自多层稀疏编码结构的深度稀疏表示的方法在控制方面可以优于现有的基于稀疏编码的DRL，例如，以140.81步完成《山地车》游戏，比单层稀疏编码DRL算法的奖励提高近10%，在《接球手》游戏中获得286.08分，是其他算法奖励的两倍多。此外，所提出的算法可以减少超过80%的参数，同时保持深度稀疏表示带来的性能提升。

相似文献

Double Sparse Deep Reinforcement Learning via Multilayer Sparse Coding and Nonconvex Regularized Pruning.

IEEE Trans Cybern. 2023 Feb;53(2):765-778. doi: 10.1109/TCYB.2022.3157892. Epub 2023 Jan 13.

Dynamic sparse coding-based value estimation network for deep reinforcement learning.

Neural Netw. 2023 Nov;168:180-193. doi: 10.1016/j.neunet.2023.09.013. Epub 2023 Sep 11.

Approximate Policy-Based Accelerated Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2020 Jun;31(6):1820-1830. doi: 10.1109/TNNLS.2019.2927227. Epub 2019 Aug 6.

Deep reinforcement learning for automated radiation adaptation in lung cancer.

Med Phys. 2017 Dec;44(12):6690-6705. doi: 10.1002/mp.12625. Epub 2017 Nov 14.

Deep Reinforcement Learning: A Survey.

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5064-5078. doi: 10.1109/TNNLS.2022.3207346. Epub 2024 Apr 4.

Supervised Deep Sparse Coding Networks for Image Classification.

IEEE Trans Image Process. 2019 Jul 17. doi: 10.1109/TIP.2019.2928121.

Accelerated Log-Regularized Convolutional Transform Learning and Its Convergence Guarantee.

IEEE Trans Cybern. 2022 Oct;52(10):10785-10799. doi: 10.1109/TCYB.2021.3067352. Epub 2022 Sep 19.

Koopman Operator-Based Knowledge-Guided Reinforcement Learning for Safe Human-Robot Interaction.

Front Robot AI. 2022 Jun 16;9:779194. doi: 10.3389/frobt.2022.779194. eCollection 2022.

Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey.

Phys Med Biol. 2022 Nov 11;67(22). doi: 10.1088/1361-6560/ac9cb3.

Simplified Deep Reinforcement Learning Approach for Channel Prediction in Power Domain NOMA System.

Sensors (Basel). 2023 Nov 6;23(21):9010. doi: 10.3390/s23219010.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于多层稀疏编码和非凸正则化剪枝的双稀疏深度强化学习

Double Sparse Deep Reinforcement Learning via Multilayer Sparse Coding and Nonconvex Regularized Pruning.

作者信息

Zhao Haoli, Wu Jiqiang, Li Zhenni, Chen Wuhui, Zheng Zibin

出版信息

IEEE Trans Cybern. 2023 Feb;53(2):765-778. doi: 10.1109/TCYB.2022.3157892. Epub 2023 Jan 13.

DOI:10.1109/TCYB.2022.3157892

PMID:35316206

Abstract

摘要

基于多层稀疏编码和非凸正则化剪枝的双稀疏深度强化学习

Double Sparse Deep Reinforcement Learning via Multilayer Sparse Coding and Nonconvex Regularized Pruning.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于多层稀疏编码和非凸正则化剪枝的双稀疏深度强化学习

Double Sparse Deep Reinforcement Learning via Multilayer Sparse Coding and Nonconvex Regularized Pruning.

作者信息

出版信息

相似文献