循环神经网络架构中的嵌入式记忆如何有助于学习长期时间依赖性。

How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies.

作者信息

Lin Tsungnan, Horne Bill G., Giles C Lee

机构信息

EPSON Palo Alto Laboratory, Palo Alto, USA

出版信息

Neural Netw. 1998 Jul;11(5):861-868. doi: 10.1016/s0893-6080(98)00018-5.

DOI:10.1016/s0893-6080(98)00018-5

PMID:12662788

Abstract

Learning long-term temporal dependencies with recurrent neural networks can be a difficult problem. It has recently been shown that a class of recurrent neural networks called NARX networks perform much better than conventional recurrent neural networks for learning certain simple long-term dependency problems. The intuitive explanation for this behavior is that the output memories of a NARX network can be manifested as jump-ahead connections in the time-unfolded network. These jump-ahead connections can propagate gradient information more efficiently, thus reducing the sensitivity of the network to long-term dependencies. This work gives empirical justification to our hypothesis that similar improvements in learning long-term dependencies can be achieved with other classes of recurrent neural network axchitectures simply by increasing the order of the embedded memory. In particular we explore the impact of learning simple long-term dependency problems on three classes of recurrent neural network architectures: globally recurrent networks, locally recurrent networks, and NARX (output feedback) networks.Comparing the performance of these architectures with different orders of embedded memory on two simple long-term dependencies problems shows that all of these classes of network architectures demonstrate significant improvement on learning long-term dependencies when the orders of embedded memory are increased. These results can be important to a user comfortable with a specific recurrent neural network architecture because simply increasing the embedding memory order of that architecture will make it more robust to the problem of long-term dependency learning.

摘要

使用递归神经网络学习长期时间依赖性可能是一个难题。最近有研究表明，一类称为NARX网络的递归神经网络在学习某些简单的长期依赖性问题时，比传统的递归神经网络表现要好得多。对这种行为的直观解释是，NARX网络的输出记忆可以在时间展开网络中表现为超前连接。这些超前连接可以更有效地传播梯度信息，从而降低网络对长期依赖性的敏感性。这项工作为我们的假设提供了实证依据，即通过增加嵌入式记忆的阶数，其他类别的递归神经网络架构也可以在学习长期依赖性方面实现类似的改进。特别是，我们探讨了学习简单的长期依赖性问题对三类递归神经网络架构的影响：全局递归网络、局部递归网络和NARX（输出反馈）网络。在两个简单的长期依赖性问题上比较这些具有不同嵌入式记忆阶数的架构的性能表明，当嵌入式记忆的阶数增加时，所有这些类别的网络架构在学习长期依赖性方面都表现出显著的改进。这些结果对于熟悉特定递归神经网络架构的用户可能很重要，因为简单地增加该架构的嵌入式记忆阶数将使其在长期依赖性学习问题上更具鲁棒性。

相似文献

How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies.

Neural Netw. 1998 Jul;11(5):861-868. doi: 10.1016/s0893-6080(98)00018-5.

Learning long-term dependencies in NARX recurrent neural networks.

IEEE Trans Neural Netw. 1996;7(6):1329-38. doi: 10.1109/72.548162.

Segmented-memory recurrent neural networks.

IEEE Trans Neural Netw. 2009 Aug;20(8):1267-80. doi: 10.1109/TNN.2009.2022980. Epub 2009 Jul 14.

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.

Neural Netw. 2023 Sep;166:645-669. doi: 10.1016/j.neunet.2023.07.023. Epub 2023 Aug 7.

Computational capabilities of recurrent NARX neural networks.

IEEE Trans Syst Man Cybern B Cybern. 1997;27(2):208-15. doi: 10.1109/3477.558801.

A generalized LSTM-like training algorithm for second-order recurrent neural networks.

Neural Netw. 2012 Jan;25(1):70-83. doi: 10.1016/j.neunet.2011.07.003. Epub 2011 Jul 18.

Learning long-term dependencies with gradient descent is difficult.

IEEE Trans Neural Netw. 1994;5(2):157-66. doi: 10.1109/72.279181.

A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures.

Neural Comput. 2019 Jul;31(7):1235-1270. doi: 10.1162/neco_a_01199. Epub 2019 May 21.

Deep Recurrent Neural Networks for Human Activity Recognition.

Sensors (Basel). 2017 Nov 6;17(11):2556. doi: 10.3390/s17112556.

Subtraction Gates: Another Way to Learn Long-Term Dependencies in Recurrent Neural Networks.

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1740-1751. doi: 10.1109/TNNLS.2020.3043752. Epub 2022 Apr 4.

引用本文的文献

Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin.

Environ Sci Pollut Res Int. 2024 Jun;31(27):39098-39119. doi: 10.1007/s11356-024-33732-w. Epub 2024 May 29.

Retaining Privileged Information for Multi-Task Learning.

KDD. 2019 Jul;2019:1369-1377. doi: 10.1145/3292500.3330907.

A novel general-purpose hybrid model for time series forecasting.

Appl Intell (Dordr). 2022;52(2):2212-2223. doi: 10.1007/s10489-021-02442-y. Epub 2021 Jun 5.

Nonlinear autoregressive neural networks with external inputs for forecasting of typhoon inundation level.

Environ Monit Assess. 2017 Aug;189(8):376. doi: 10.1007/s10661-017-6100-6. Epub 2017 Jul 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

循环神经网络架构中的嵌入式记忆如何有助于学习长期时间依赖性。

How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献