• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于机器人单试次学习的不断发展的基于尖峰时间依赖性的可塑性。

Evolving spike-timing-dependent plasticity for single-trial learning in robots.

作者信息

Di Paolo Ezequiel A

机构信息

School of Cognitive and Computing Sciences, University of Sussex, Brighton BN1 9QH, UK.

出版信息

Philos Trans A Math Phys Eng Sci. 2003 Oct 15;361(1811):2299-319. doi: 10.1098/rsta.2003.1256.

DOI:10.1098/rsta.2003.1256
PMID:14599321
Abstract

Single-trial learning is studied in an evolved robot model of synaptic spike-timing-dependent plasticity (STDP). Robots must perform positive phototaxis but must learn to perform negative phototaxis in the presence of a short-lived aversive sound stimulus. STDP acts at the millisecond range and depends asymmetrically on the relative timing of pre- and post-synaptic spikes. Although it has been involved in learning models of input prediction, these models require the iterated presentation of the input pattern, and it is hard to see how this mechanism could sustain single-trial learning over a time-scale of tens of seconds. An incremental evolutionary approach is used to answer this question. The evolved robots succeed in learning the appropriate behaviour, but learning does not depend on achieving the right synaptic configuration but rather the right pattern of neural activity. Robot performance during positive phototaxis is quite robust to loss of spike-timing information, but in contrast, this loss is catastrophic for learning negative phototaxis where entrained firing is common. Tests show that the final weight configuration carries no information about whether a robot is performing one behaviour or the other. Fixing weights, however, has the effect of degrading performance, thus demonstrating that plasticity is used to sustain the neural activity corresponding both to the normal phototaxis condition and to the learned behaviour. The implications and limitations of this result are discussed.

摘要

在一个基于突触脉冲时间依赖可塑性(STDP)的进化机器人模型中研究了单试学习。机器人必须执行正向趋光性,但在存在短暂厌恶声音刺激的情况下必须学会执行负向趋光性。STDP作用于毫秒级范围,并且不对称地依赖于突触前和突触后脉冲的相对时间。尽管它已被用于输入预测的学习模型中,但这些模型需要重复呈现输入模式,并且很难看出这种机制如何能在数十秒的时间尺度上维持单试学习。采用了一种增量进化方法来回答这个问题。进化后的机器人成功学会了适当的行为,但学习并不依赖于实现正确的突触配置,而是依赖于正确的神经活动模式。在正向趋光性过程中,机器人的表现对脉冲时间信息的丢失相当稳健,但相比之下,这种丢失对于学习负向趋光性来说是灾难性的,因为在负向趋光性中同步放电很常见。测试表明,最终的权重配置不携带关于机器人是在执行一种行为还是另一种行为的信息。然而,固定权重会导致性能下降,从而表明可塑性被用于维持与正常趋光性条件和学习行为相对应的神经活动。讨论了这一结果所具有的意义和局限性。

相似文献

1
Evolving spike-timing-dependent plasticity for single-trial learning in robots.用于机器人单试次学习的不断发展的基于尖峰时间依赖性的可塑性。
Philos Trans A Math Phys Eng Sci. 2003 Oct 15;361(1811):2299-319. doi: 10.1098/rsta.2003.1256.
2
Emergence of network structure due to spike-timing-dependent plasticity in recurrent neuronal networks V: self-organization schemes and weight dependence.循环神经网络中基于峰电位时间依赖性可塑性的网络结构出现V:自组织方案与权重依赖性
Biol Cybern. 2010 Nov;103(5):365-86. doi: 10.1007/s00422-010-0405-7. Epub 2010 Sep 29.
3
Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.通过调节尖峰时间依赖性突触可塑性进行强化学习。
Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.
4
Dynamic evolving spiking neural networks for on-line spatio- and spectro-temporal pattern recognition.用于在线时空谱模式识别的动态进化尖峰神经网络。
Neural Netw. 2013 May;41:188-201. doi: 10.1016/j.neunet.2012.11.014. Epub 2012 Dec 20.
5
Emergence of network structure due to spike-timing-dependent plasticity in recurrent neuronal networks. II. Input selectivity--symmetry breaking.由于递归神经元网络中尖峰时间依赖性可塑性导致的网络结构出现。II. 输入选择性——对称性破缺。
Biol Cybern. 2009 Aug;101(2):103-14. doi: 10.1007/s00422-009-0320-y. Epub 2009 Jun 18.
6
A forecast-based STDP rule suitable for neuromorphic implementation.一种适用于神经形态实现的基于预测的 STDP 规则。
Neural Netw. 2012 Aug;32:3-14. doi: 10.1016/j.neunet.2012.02.018. Epub 2012 Feb 14.
7
Representation of input structure in synaptic weights by spike-timing-dependent plasticity.通过尖峰时间依赖可塑性在突触权重中表征输入结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Aug;82(2 Pt 1):021912. doi: 10.1103/PhysRevE.82.021912. Epub 2010 Aug 13.
8
What can a neuron learn with spike-timing-dependent plasticity?神经元通过尖峰时间依赖性可塑性能够学习什么?
Neural Comput. 2005 Nov;17(11):2337-82. doi: 10.1162/0899766054796888.
9
Emergence of network structure due to spike-timing-dependent plasticity in recurrent neuronal networks. I. Input selectivity--strengthening correlated input pathways.由于递归神经网络中尖峰时间依赖性可塑性导致的网络结构出现。I. 输入选择性——强化相关输入通路。
Biol Cybern. 2009 Aug;101(2):81-102. doi: 10.1007/s00422-009-0319-4. Epub 2009 Jun 18.
10
Adaptive synchronization of activities in a recurrent network.循环网络中活动的自适应同步
Neural Comput. 2009 Jun;21(6):1749-75. doi: 10.1162/neco.2009.02-08-708.

引用本文的文献

1
Evolution of adaptive behaviour in robots by means of Darwinian selection.通过达尔文选择实现机器人适应性行为的进化。
PLoS Biol. 2010 Jan 26;8(1):e1000292. doi: 10.1371/journal.pbio.1000292.
2
Morphological communication: exploiting coupled dynamics in a complex mechanical structure to achieve locomotion.形态通讯:利用复杂机械结构中的耦合动力学实现运动。
J R Soc Interface. 2010 Apr 6;7(45):613-21. doi: 10.1098/rsif.2009.0240. Epub 2009 Sep 23.