使用简单循环神经网络实现长短期记忆网络的在线回归性能

Achieving Online Regression Performance of LSTMs With Simple RNNs.

作者信息

Vural N Mert, Ilhan Fatih, Yilmaz Selim F, Ergut Salih, Kozat Suleyman Serdar

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7632-7643. doi: 10.1109/TNNLS.2021.3086029. Epub 2022 Nov 30.

DOI:10.1109/TNNLS.2021.3086029

Abstract

Recurrent neural networks (RNNs) are widely used for online regression due to their ability to generalize nonlinear temporal dependencies. As an RNN model, long short-term memory networks (LSTMs) are commonly preferred in practice, as these networks are capable of learning long-term dependencies while avoiding the vanishing gradient problem. However, due to their large number of parameters, training LSTMs requires considerably longer training time compared to simple RNNs (SRNNs). In this article, we achieve the online regression performance of LSTMs with SRNNs efficiently. To this end, we introduce a first-order training algorithm with a linear time complexity in the number of parameters. We show that when SRNNs are trained with our algorithm, they provide very similar regression performance with the LSTMs in two to three times shorter training time. We provide strong theoretical analysis to support our experimental results by providing regret bounds on the convergence rate of our algorithm. Through an extensive set of experiments, we verify our theoretical work and demonstrate significant performance improvements of our algorithm with respect to LSTMs and the other state-of-the-art learning models.

摘要

递归神经网络（RNN）因其能够泛化非线性时间依赖性而被广泛用于在线回归。作为一种RNN模型，长短期记忆网络（LSTM）在实践中通常更受青睐，因为这些网络能够学习长期依赖性，同时避免梯度消失问题。然而，由于其参数数量众多，与简单递归神经网络（SRNN）相比，训练LSTM需要更长的训练时间。在本文中，我们有效地实现了SRNN的在线回归性能。为此，我们引入了一种在参数数量上具有线性时间复杂度的一阶训练算法。我们表明，当使用我们的算法训练SRNN时，它们在两到三倍短的训练时间内提供与LSTM非常相似的回归性能。我们通过为算法的收敛速度提供遗憾界来提供强有力的理论分析，以支持我们的实验结果。通过广泛的实验，我们验证了我们的理论工作，并证明了我们的算法相对于LSTM和其他先进学习模型的显著性能提升。

相似文献

Achieving Online Regression Performance of LSTMs With Simple RNNs.

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7632-7643. doi: 10.1109/TNNLS.2021.3086029. Epub 2022 Nov 30.

Subtraction Gates: Another Way to Learn Long-Term Dependencies in Recurrent Neural Networks.

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1740-1751. doi: 10.1109/TNNLS.2020.3043752. Epub 2022 Apr 4.

A critical review of RNN and LSTM variants in hydrological time series predictions.

MethodsX. 2024 Sep 12;13:102946. doi: 10.1016/j.mex.2024.102946. eCollection 2024 Dec.

Revisiting the problem of learning long-term dependencies in recurrent neural networks.

Neural Netw. 2025 Mar;183:106887. doi: 10.1016/j.neunet.2024.106887. Epub 2024 Nov 26.

Gating Revisited: Deep Multi-Layer RNNs That can be Trained.

IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):4081-4092. doi: 10.1109/TPAMI.2021.3064878. Epub 2022 Jul 1.

Training biologically plausible recurrent neural networks on cognitive tasks with long-term dependencies.

bioRxiv. 2023 Oct 10:2023.10.10.561588. doi: 10.1101/2023.10.10.561588.

Explicit Duration Recurrent Networks.

IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3120-3130. doi: 10.1109/TNNLS.2021.3051019. Epub 2022 Jul 6.

Recurrent Neural Networks With Auxiliary Memory Units.

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1652-1661. doi: 10.1109/TNNLS.2017.2677968. Epub 2017 Mar 21.

Working Memory Connections for LSTM.

Neural Netw. 2021 Dec;144:334-341. doi: 10.1016/j.neunet.2021.08.030. Epub 2021 Sep 4.

RNNCon: Contribution Coverage Testing for Stacked Recurrent Neural Networks.

Entropy (Basel). 2023 Mar 17;25(3):520. doi: 10.3390/e25030520.

引用本文的文献

A two-stage forecasting model using random forest subset-based feature selection and BiGRU with attention mechanism: Application to stock indices.

PLoS One. 2025 May 9;20(5):e0323015. doi: 10.1371/journal.pone.0323015. eCollection 2025.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用简单循环神经网络实现长短期记忆网络的在线回归性能

Achieving Online Regression Performance of LSTMs With Simple RNNs.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献