• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system.

作者信息

Porr B, Wörgötter P

机构信息

Department of Psychology, University of Stirling, Stirling, UK.

出版信息

Biosystems. 2002 Oct-Dec;67(1-3):195-202. doi: 10.1016/s0303-2647(02)00077-1.

DOI:10.1016/s0303-2647(02)00077-1
PMID:12459299
Abstract

In this article, we present an isotropic algorithm for sequence order learning. Its central goal is to learn the causal relation between two (or more) inputs in order to react to the earliest incoming signal after successful learning (like in typical classical conditioning situations). We implement this algorithm in a behaving system (a robot) thereby creating a closed loop situation where the learner's actions influence its own sensor inputs to the end of creating an autonomous agent. Autonomous behaviour implies that learning goals are internally defined within the organism's capabilities. Standard learning models for sequence learning (e.g. temporal difference (TD)-learning) need an externally defined reward. This, however, is in conflict with the requirement of an implicitly defined internal goal in autonomous behaviour. Therefore, in this study we present a system in which the external reward is replaced by a reflex loop. This loop explicitly includes the environment. Every reflex loop has the inherent disadvantage, which is that its re-actions occur each time just after a reflex-eliciting sensor event and thus 'too late'. However, a reflex can serve as the internal reference for sequence order learning, which has the task of eliminating this disadvantage by creating earlier anticipatory actions. In our system learning is achieved by modifying synaptic weights of a linear neuron with a correlation based learning rule which involves the derivative of the neuron's output. All input lines are entirely isotropic. The synaptic weight change curve of this rule is strongly related to the temporal Hebb learning rule, which was found in spike timing experiments. We find that after learning the reflex loop is replaced in functional terms with an earlier anticipatory action (and pathway). In addition, we observed that the synaptic weights stabilise as soon as the reflex remains silent.

摘要

相似文献

1
Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system.
Biosystems. 2002 Oct-Dec;67(1-3):195-202. doi: 10.1016/s0303-2647(02)00077-1.
2
Isotropic-sequence-order learning in a closed-loop behavioural system.闭环行为系统中的各向同性序列顺序学习
Philos Trans A Math Phys Eng Sci. 2003 Oct 15;361(1811):2225-44. doi: 10.1098/rsta.2003.1273.
3
Isotropic sequence order learning.各向同性序列顺序学习。
Neural Comput. 2003 Apr;15(4):831-64. doi: 10.1162/08997660360581921.
4
ISO learning approximates a solution to the inverse-controller problem in an unsupervised behavioral paradigm.
Neural Comput. 2003 Apr;15(4):865-84. doi: 10.1162/08997660360581930.
5
Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only.仅通过使用输入相关性就显著提高了时间序列学习的稳定性并加快了收敛速度。
Neural Comput. 2006 Jun;18(6):1380-412. doi: 10.1162/neco.2006.18.6.1380.
6
Chained learning architectures in a simple closed-loop behavioural context.简单闭环行为情境中的链式学习架构
Biol Cybern. 2007 Dec;97(5-6):363-78. doi: 10.1007/s00422-007-0176-y. Epub 2007 Oct 3.
7
Behavioral analysis of differential Hebbian learning in closed-loop systems.闭环系统中差异赫布学习的行为分析
Biol Cybern. 2010 Oct;103(4):255-71. doi: 10.1007/s00422-010-0396-4. Epub 2010 Jun 17.
8
Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison.神经元时间差分规则和微分赫布学习的数学性质:比较
Biol Cybern. 2008 Mar;98(3):259-72. doi: 10.1007/s00422-007-0209-6. Epub 2008 Jan 15.
9
Autonomous learning based on cost assumptions: theoretical studies and experiments in robot control.基于成本假设的自主学习:机器人控制中的理论研究与实验
Int J Neural Syst. 1999 Jun;9(3):243-9. doi: 10.1142/s0129065799000241.
10
A forecast-based STDP rule suitable for neuromorphic implementation.一种适用于神经形态实现的基于预测的 STDP 规则。
Neural Netw. 2012 Aug;32:3-14. doi: 10.1016/j.neunet.2012.02.018. Epub 2012 Feb 14.