• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有不确定候补迹的线性感知器中的最优节点扰动。

Optimal node perturbation in linear perceptrons with uncertain eligibility trace.

机构信息

Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan.

出版信息

Neural Netw. 2010 Mar;23(2):219-25. doi: 10.1016/j.neunet.2009.11.013. Epub 2009 Dec 2.

DOI:10.1016/j.neunet.2009.11.013
PMID:20005670
Abstract

Node perturbation learning has been receiving much attention as a method for achieving stochastic gradient descent. As it does not require direct gradient calculations, it can be applied to a reinforcement learning framework. However, in conventional node perturbation learning, the residual error due to perturbation is not eliminated even after convergence. Using infinitesimal perturbations suppresses the residual error, but such perturbations are less robust against uncertainty and noise in an eligibility trace, which is a memory of perturbation and input. We derive an optimal parameter schedule for node perturbation learning used with linear perceptrons with uncertainty in the eligibility trace. Our adaptive learning rule resolves the trade-off between robustness against the uncertainty and residual error reduction. The results obtained will be useful in designing learning rules and interpreting related biological knowledge.

摘要

节点扰动学习作为一种实现随机梯度下降的方法受到了广泛关注。由于它不需要直接的梯度计算,因此可以应用于强化学习框架。然而,在传统的节点扰动学习中,即使在收敛后,由于扰动而产生的残差也不会被消除。使用无穷小的扰动可以抑制残差,但这种扰动对候选迹(对扰动和输入的记忆)中的不确定性和噪声的鲁棒性较差。我们推导出了具有候选迹不确定性的线性感知器的节点扰动学习的最优参数调度。我们的自适应学习规则解决了鲁棒性与残差减少之间的权衡问题。所得结果将有助于设计学习规则和解释相关的生物学知识。

相似文献

1
Optimal node perturbation in linear perceptrons with uncertain eligibility trace.具有不确定候补迹的线性感知器中的最优节点扰动。
Neural Netw. 2010 Mar;23(2):219-25. doi: 10.1016/j.neunet.2009.11.013. Epub 2009 Dec 2.
2
Learning curves for stochastic gradient descent in linear feedforward networks.线性前馈网络中随机梯度下降的学习曲线。
Neural Comput. 2005 Dec;17(12):2699-718. doi: 10.1162/089976605774320539.
3
A learning rule for very simple universal approximators consisting of a single layer of perceptrons.一种由单层感知器组成的非常简单的通用逼近器的学习规则。
Neural Netw. 2008 Jun;21(5):786-95. doi: 10.1016/j.neunet.2007.12.036. Epub 2007 Dec 31.
4
Node perturbation learning without noiseless baseline.无无噪基线的节点扰动学习。
Neural Netw. 2011 Apr;24(3):267-72. doi: 10.1016/j.neunet.2010.12.001. Epub 2010 Dec 9.
5
Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.通过调节尖峰时间依赖性突触可塑性进行强化学习。
Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.
6
Statistical mechanics of structural and temporal credit assignment effects on learning in neural networks.神经网络中结构和时间信用分配对学习影响的统计力学
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 May;83(5 Pt 1):051125. doi: 10.1103/PhysRevE.83.051125. Epub 2011 May 20.
7
Global stability analysis and robust design of multi-time-scale biological networks under parametric uncertainties.参数不确定性下多时间尺度生物网络的全局稳定性分析与鲁棒设计
Neural Netw. 2009 Jul-Aug;22(5-6):658-63. doi: 10.1016/j.neunet.2009.06.051. Epub 2009 Jul 14.
8
Stochastic error whitening algorithm for linear filter estimation with noisy data.用于含噪声数据的线性滤波器估计的随机误差白化算法。
Neural Netw. 2003 Jun-Jul;16(5-6):873-80. doi: 10.1016/S0893-6080(03)00109-6.
9
Learning algorithms based on linearization.基于线性化的学习算法。
Network. 1998 Aug;9(3):363-80.
10
Invariant object recognition in the visual system with error correction and temporal difference learning.视觉系统中具有纠错和时间差分学习的不变物体识别
Network. 2001 May;12(2):111-29.