使用Evolino训练循环神经网络。

Training recurrent networks by Evolino.

作者信息

Schmidhuber Jürgen, Wierstra Daan, Gagliolo Matteo, Gomez Faustino

机构信息

IDSIA, 6928 Manno (Lugano), Switzerland.

出版信息

Neural Comput. 2007 Mar;19(3):757-79. doi: 10.1162/neco.2007.19.3.757.

DOI:10.1162/neco.2007.19.3.757

PMID:17298232

Abstract

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for training RNNs, due to numerous local minima. For such cases, we present a novel method: EVOlution of systems with LINear Outputs (Evolino). Evolino evolves weights to the nonlinear, hidden nodes of RNNs while computing optimal linear mappings from hidden state to output, using methods such as pseudo-inverse-based linear regression. If we instead use quadratic programming to maximize the margin, we obtain the first evolutionary recurrent support vector machines. We show that Evolino-based LSTM can solve tasks that Echo State nets (Jaeger, 2004a) cannot and achieves higher accuracy in certain continuous function generation tasks than conventional gradient descent RNNs, including gradient-based LSTM.

摘要

近年来，基于梯度的长短期记忆循环神经网络（RNN）解决了许多以前RNN无法学习的任务。然而，有时由于存在大量局部最小值，梯度信息对训练RNN用处不大。针对这种情况，我们提出了一种新颖的方法：具有线性输出的系统进化（Evolino）。Evolino在计算从隐藏状态到输出的最优线性映射时，将权重进化到RNN的非线性隐藏节点，使用基于伪逆的线性回归等方法。如果我们改用二次规划来最大化间隔，就得到了首个进化循环支持向量机。我们表明，基于Evolino的长短期记忆网络（LSTM）能够解决回声状态网络（Jaeger，2004a）无法解决的任务，并且在某些连续函数生成任务中比传统梯度下降RNN（包括基于梯度的LSTM）具有更高的准确率。

相似文献

Training recurrent networks by Evolino.使用Evolino训练循环神经网络。

Neural Comput. 2007 Mar;19(3):757-79. doi: 10.1162/neco.2007.19.3.757.

Framewise phoneme classification with bidirectional LSTM and other neural network architectures.使用双向长短期记忆网络和其他神经网络架构进行逐帧音素分类。

Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.

Evolutionary product unit based neural networks for regression.基于进化乘积单元的回归神经网络。

Neural Netw. 2006 May;19(4):477-86. doi: 10.1016/j.neunet.2005.11.001. Epub 2006 Feb 14.

Modeling of gene regulatory networks with hybrid differential evolution and particle swarm optimization.基于混合差分进化和粒子群优化的基因调控网络建模

Neural Netw. 2007 Oct;20(8):917-27. doi: 10.1016/j.neunet.2007.07.002. Epub 2007 Jul 22.

Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks.认识递归神经网络（rRNN）：递归神经网络的贝叶斯推理。

Biol Cybern. 2012 Jul;106(4-5):201-17. doi: 10.1007/s00422-012-0490-x. Epub 2012 May 12.

Random neural networks with synchronized interactions.具有同步相互作用的随机神经网络。

Neural Comput. 2008 Sep;20(9):2308-24. doi: 10.1162/neco.2008.04-07-509.

Recurrent kernel machines: computing with infinite echo state networks.递归核机器：使用无限回声状态网络进行计算。

Neural Comput. 2012 Jan;24(1):104-33. doi: 10.1162/NECO_a_00200. Epub 2011 Aug 18.

An augmented CRTRL for complex-valued recurrent neural networks.用于复值递归神经网络的增强型控制

Neural Netw. 2007 Dec;20(10):1061-6. doi: 10.1016/j.neunet.2007.09.015. Epub 2007 Sep 22.

Elements for a general memory structure: properties of recurrent neural networks used to form situation models.通用记忆结构的要素：用于形成情境模型的递归神经网络的属性。

Biol Cybern. 2008 May;98(5):371-95. doi: 10.1007/s00422-008-0221-5. Epub 2008 Mar 19.

Kalman filters improve LSTM network performance in problems unsolvable by traditional recurrent nets.卡尔曼滤波器在传统循环网络无法解决的问题中提高了长短期记忆网络的性能。

Neural Netw. 2003 Mar;16(2):241-50. doi: 10.1016/S0893-6080(02)00219-8.

引用本文的文献

Continual Sequence Modeling With Predictive Coding.基于预测编码的连续序列建模

Front Neurorobot. 2022 May 23;16:845955. doi: 10.3389/fnbot.2022.845955. eCollection 2022.

Sensory Stream Adaptation in Chaotic Networks.混沌网络中的感觉流适应。

Sci Rep. 2017 Dec 4;7(1):16844. doi: 10.1038/s41598-017-16478-z.

Information processing via physical soft body.通过物理软体进行信息处理。

Sci Rep. 2015 May 27;5:10487. doi: 10.1038/srep10487.

Emergence of task-dependent representations in working memory circuits.工作记忆回路中任务相关表征的出现。

Front Comput Neurosci. 2014 May 28;8:57. doi: 10.3389/fncom.2014.00057. eCollection 2014.

Comparison of classifiers for decoding sensory and cognitive information from prefrontal neuronal populations.用于从额叶前神经元群体解码感觉和认知信息的分类器比较。

PLoS One. 2014 Jan 23;9(1):e86314. doi: 10.1371/journal.pone.0086314. eCollection 2014.

A generalized LSTM-like training algorithm for second-order recurrent neural networks.二阶递归神经网络的广义 LSTM 样训练算法。

Neural Netw. 2012 Jan;25(1):70-83. doi: 10.1016/j.neunet.2011.07.003. Epub 2011 Jul 18.

Initialization and self-organized optimization of recurrent neural network connectivity.递归神经网络连接的初始化与自组织优化

HFSP J. 2009 Oct;3(5):340-9. doi: 10.2976/1.3240502. Epub 2009 Oct 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用Evolino训练循环神经网络。

Training recurrent networks by Evolino.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献