打开黑箱：高维递归神经网络中的低维动力学。

Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks.

机构信息

Department of Electrical Engineering, Neurosciences Program, Stanford University, Stanford, CA 94305-9505, USA.

出版信息

Neural Comput. 2013 Mar;25(3):626-49. doi: 10.1162/NECO_a_00409. Epub 2012 Dec 28.

Abstract

Recurrent neural networks (RNNs) are useful tools for learning nonlinear relationships between time-varying inputs and outputs with complex temporal dependencies. Recently developed algorithms have been successful at training RNNs to perform a wide variety of tasks, but the resulting networks have been treated as black boxes: their mechanism of operation remains unknown. Here we explore the hypothesis that fixed points, both stable and unstable, and the linearized dynamics around them, can reveal crucial aspects of how RNNs implement their computations. Further, we explore the utility of linearization in areas of phase space that are not true fixed points but merely points of very slow movement. We present a simple optimization technique that is applied to trained RNNs to find the fixed and slow points of their dynamics. Linearization around these slow regions can be used to explore, or reverse-engineer, the behavior of the RNN. We describe the technique, illustrate it using simple examples, and finally showcase it on three high-dimensional RNN examples: a 3-bit flip-flop device, an input-dependent sine wave generator, and a two-point moving average. In all cases, the mechanisms of trained networks could be inferred from the sets of fixed and slow points and the linearized dynamics around them.

摘要

递归神经网络 (RNN) 是一种有用的工具，可用于学习具有复杂时间依赖性的时变输入和输出之间的非线性关系。最近开发的算法已成功用于训练 RNN 以执行各种任务，但得到的网络被视为黑盒：其工作机制仍然未知。在这里，我们探讨了这样一个假设，即平衡点（稳定和不稳定的）及其周围的线性化动力学，可以揭示 RNN 如何实现其计算的关键方面。此外，我们还探讨了在线性化空间区域的应用，这些区域不是真正的平衡点，而仅仅是移动非常缓慢的点。我们提出了一种简单的优化技术，该技术应用于训练有素的 RNN 以找到其动力学的固定点和缓慢点。可以围绕这些缓慢区域进行线性化，以探索或反向设计 RNN 的行为。我们描述了该技术，使用简单的示例进行了说明，并最终在三个高维 RNN 示例上展示了该技术：一个 3 位触发器设备、一个依赖于输入的正弦波发生器和一个两点移动平均值。在所有情况下，都可以从固定点和缓慢点及其周围的线性化动力学中推断出经过训练的网络的机制。

相似文献

Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks.

Neural Comput. 2013 Mar;25(3):626-49. doi: 10.1162/NECO_a_00409. Epub 2012 Dec 28.

Reverse engineering recurrent networks for sentiment classification reveals line attractor dynamics.

Adv Neural Inf Process Syst. 2019 Dec;32:15696-15705.

Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks.

Biol Cybern. 2012 Jul;106(4-5):201-17. doi: 10.1007/s00422-012-0490-x. Epub 2012 May 12.

Training recurrent networks by Evolino.

Neural Comput. 2007 Mar;19(3):757-79. doi: 10.1162/neco.2007.19.3.757.

Considerations in using recurrent neural networks to probe neural dynamics.

J Neurophysiol. 2019 Dec 1;122(6):2504-2521. doi: 10.1152/jn.00467.2018. Epub 2019 Oct 16.

A new approach to knowledge-based design of recurrent neural networks.

IEEE Trans Neural Netw. 2008 Aug;19(8):1389-401. doi: 10.1109/TNN.2008.2000393.

Recurrent kernel machines: computing with infinite echo state networks.

Neural Comput. 2012 Jan;24(1):104-33. doi: 10.1162/NECO_a_00200. Epub 2011 Aug 18.

Temporal-kernel recurrent neural networks.

Neural Netw. 2010 Mar;23(2):239-43. doi: 10.1016/j.neunet.2009.10.009. Epub 2009 Nov 5.

Neural circuits as computational dynamical systems.

Curr Opin Neurobiol. 2014 Apr;25:156-63. doi: 10.1016/j.conb.2014.01.008. Epub 2014 Feb 5.

An augmented CRTRL for complex-valued recurrent neural networks.

Neural Netw. 2007 Dec;20(10):1061-6. doi: 10.1016/j.neunet.2007.09.015. Epub 2007 Sep 22.

引用本文的文献

Model Predictive Control on the Neural Manifold.

ArXiv. 2025 Aug 11:arXiv:2406.14801v2.

Stochastic activity in low-rank recurrent neural networks.

PLoS Comput Biol. 2025 Aug 18;21(8):e1013371. doi: 10.1371/journal.pcbi.1013371.

Active filtering: a predictive function of recurrent circuits of sensory cortex.

ArXiv. 2025 Jan 17:arXiv:2501.10521v1.

A solvable neural circuit model revealing the dynamical principle of non-optimal temporal weighting in perceptual decision making.

J Comput Neurosci. 2025 Sep;53(3):441-458. doi: 10.1007/s10827-025-00910-9. Epub 2025 Jul 29.

A neural manifold view of the brain.

Nat Neurosci. 2025 Jul 28. doi: 10.1038/s41593-025-02031-z.

Taming the chaos gently: a predictive alignment learning rule in recurrent neural networks.

Nat Commun. 2025 Jul 23;16(1):6784. doi: 10.1038/s41467-025-61309-9.

Stochastic activity in low-rank recurrent neural networks.

bioRxiv. 2025 Jul 11:2025.04.22.649933. doi: 10.1101/2025.04.22.649933.

Computational models of peripersonal space representation.

Phys Life Rev. 2025 Sep;54:128-140. doi: 10.1016/j.plrev.2025.07.002. Epub 2025 Jul 3.

Elucidating the selection mechanisms in context-dependent computation through low-rank neural network modeling.

Elife. 2025 Jul 3;13:RP103636. doi: 10.7554/eLife.103636.

Temporally-consistent koopman autoencoders for forecasting dynamical systems.

Sci Rep. 2025 Jul 1;15(1):22127. doi: 10.1038/s41598-025-05222-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

打开黑箱：高维递归神经网络中的低维动力学。

Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献