Suppr超能文献

基于强化学习信号反馈的广义学习:理论与应用

Broad Learning With Reinforcement Learning Signal Feedback: Theory and Applications.

作者信息

Mao Ruiqi, Cui Rongxin, Chen C L Philip

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):2952-2964. doi: 10.1109/TNNLS.2020.3047941. Epub 2022 Jul 6.

Abstract

Broad learning systems (BLSs) have attracted considerable attention due to their powerful ability in efficient discriminative learning. In this article, a modified BLS with reinforcement learning signal feedback (BLRLF) is proposed as an efficient method for improving the performance of standard BLS. The main differences between our research and BLS are as follows. First, we add weight optimization after adding additional nodes or new training samples. Motivated by the weight iterative optimization in the convolution neural network (CNN), we use the output of the network as feedback while employing value iteration (VI)-based adaptive dynamic programming (ADP) to facilitate calculation of near-optimal increments of connection weights. Second, different from the homogeneous incremental algorithms in standard BLS, we integrate those broad expansion methods, and the heuristic search method is used to enable the proposed BLRLF to optimize the network structure autonomously. Although the training time is affected to a certain extent compared with BLS, the newly proposed BLRLF still retains a fast computational nature. Finally, the proposed BLRLF is evaluated using popular benchmarks from the UC Irvine Machine Learning Repository and many other challenging data sets. These results show that BLRLF outperforms many state-of-the-art deep learning algorithms and shallow networks proposed in recent years.

摘要

广义学习系统(BLSs)因其在高效判别学习方面的强大能力而备受关注。本文提出了一种带有强化学习信号反馈的改进型BLS(BLRLF),作为提高标准BLS性能的有效方法。我们的研究与BLS的主要区别如下。首先,在添加额外节点或新训练样本后,我们增加了权重优化。受卷积神经网络(CNN)中权重迭代优化的启发,我们在使用基于值迭代(VI)的自适应动态规划(ADP)来促进连接权重近最优增量的计算时,将网络输出用作反馈。其次,与标准BLS中的同类增量算法不同,我们整合了那些广义扩展方法,并使用启发式搜索方法使所提出的BLRLF能够自主优化网络结构。尽管与BLS相比,训练时间在一定程度上受到影响,但新提出的BLRLF仍然保持快速计算的特性。最后,使用来自加州大学欧文分校机器学习库的流行基准测试以及许多其他具有挑战性的数据集对所提出的BLRLF进行评估。这些结果表明,BLRLF优于近年来提出的许多先进深度学习算法和浅层网络。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验