• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种具有生物学合理性的动作发现体现模型。

A biologically plausible embodied model of action discovery.

机构信息

Department of Psychology, Adaptive Behaviour Research Group, University of Sheffield Sheffield, UK.

出版信息

Front Neurorobot. 2013 Mar 12;7:4. doi: 10.3389/fnbot.2013.00004. eCollection 2013.

DOI:10.3389/fnbot.2013.00004
PMID:23487577
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3594743/
Abstract

During development, animals can spontaneously discover action-outcome pairings enabling subsequent achievement of their goals. We present a biologically plausible embodied model addressing key aspects of this process. The biomimetic model core comprises the basal ganglia and its loops through cortex and thalamus. We incorporate reinforcement learning (RL) with phasic dopamine supplying a sensory prediction error, signalling "surprising" outcomes. Phasic dopamine is used in a cortico-striatal learning rule which is consistent with recent data. We also hypothesized that objects associated with surprising outcomes acquire "novelty salience" contingent on the predicability of the outcome. To test this idea we used a simple model of prediction governing the dynamics of novelty salience and phasic dopamine. The task of the virtual robotic agent mimicked an in vivo counterpart (Gancarz et al., 2011) and involved interaction with a target object which caused a light flash, or a control object which did not. Learning took place according to two schedules. In one, the phasic outcome was delivered after interaction with the target in an unpredictable way which emulated the in vivo protocol. Without novelty salience, the model was unable to account for the experimental data. In the other schedule, the phasic outcome was reliably delivered and the agent showed a rapid increase in the number of interactions with the target which then decreased over subsequent sessions. We argue this is precisely the kind of change in behavior required to repeatedly present representations of context, action and outcome, to neural networks responsible for learning action-outcome contingency. The model also showed cortico-striatal plasticity consistent with learning a new action in basal ganglia. We conclude that action learning is underpinned by a complex interplay of plasticity and stimulus salience, and that our model contains many of the elements for biological action discovery to take place.

摘要

在发育过程中,动物可以自发地发现动作-结果配对,从而实现后续目标。我们提出了一个具有生物合理性的体现模型,解决了这个过程的关键方面。仿生模型的核心包括基底神经节及其通过皮层和丘脑的循环。我们将强化学习(RL)与相位多巴胺结合使用,相位多巴胺提供感官预测误差,表明“意外”的结果。相位多巴胺用于皮质-纹状体学习规则,该规则与最近的数据一致。我们还假设,与意外结果相关的物体根据结果的可预测性获得“新颖性显着性”。为了检验这个想法,我们使用了一个简单的预测模型来控制新颖性显着性和相位多巴胺的动力学。虚拟机器人代理的任务模拟了体内对应物(Gancarz 等人,2011 年),并涉及与目标物体的交互作用,该目标物体导致光闪烁,或者与控制物体交互不产生光闪烁。学习是根据两个时间表进行的。在一个时间表中,相位结果在与目标以不可预测的方式交互后交付,这模拟了体内方案。没有新颖性显着性,该模型无法解释实验数据。在另一个时间表中,相位结果可靠地交付,代理与目标的交互次数迅速增加,然后在随后的会话中减少。我们认为,正是这种行为变化需要反复呈现上下文、动作和结果的表示,以便为负责学习动作-结果相关性的神经网络提供。该模型还显示出与基底神经节中学习新动作一致的皮质-纹状体可塑性。我们得出结论,动作学习是由可塑性和刺激显着性的复杂相互作用支撑的,并且我们的模型包含了许多发生生物动作发现的元素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/8ba8edc1bb23/fnbot-07-00004-g0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/3d312d3136fe/fnbot-07-00004-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/5d5e02b41ab2/fnbot-07-00004-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/bb4479694ab9/fnbot-07-00004-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/385a4ac952dc/fnbot-07-00004-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/27a547ffdd26/fnbot-07-00004-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/03cab5ceef2e/fnbot-07-00004-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/02810257f9ee/fnbot-07-00004-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/9a7c6eaf6831/fnbot-07-00004-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/16e8e006c9a9/fnbot-07-00004-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/60535b92c2f1/fnbot-07-00004-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/8f331b09bc64/fnbot-07-00004-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/cc9733eafe31/fnbot-07-00004-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/f3e9a73fc12c/fnbot-07-00004-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/8ba8edc1bb23/fnbot-07-00004-g0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/3d312d3136fe/fnbot-07-00004-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/5d5e02b41ab2/fnbot-07-00004-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/bb4479694ab9/fnbot-07-00004-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/385a4ac952dc/fnbot-07-00004-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/27a547ffdd26/fnbot-07-00004-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/03cab5ceef2e/fnbot-07-00004-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/02810257f9ee/fnbot-07-00004-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/9a7c6eaf6831/fnbot-07-00004-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/16e8e006c9a9/fnbot-07-00004-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/60535b92c2f1/fnbot-07-00004-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/8f331b09bc64/fnbot-07-00004-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/cc9733eafe31/fnbot-07-00004-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/f3e9a73fc12c/fnbot-07-00004-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29d8/3594743/8ba8edc1bb23/fnbot-07-00004-g0014.jpg

相似文献

1
A biologically plausible embodied model of action discovery.一种具有生物学合理性的动作发现体现模型。
Front Neurorobot. 2013 Mar 12;7:4. doi: 10.3389/fnbot.2013.00004. eCollection 2013.
2
A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.一种用于皮质-纹状体可塑性的新框架:行为理论在强化-行动界面与体外数据相遇。
PLoS Biol. 2015 Jan 6;13(1):e1002034. doi: 10.1371/journal.pbio.1002034. eCollection 2015 Jan.
3
A neurocomputational model of dopamine and prefrontal-striatal interactions during multicue category learning by Parkinson patients.帕金森病患者在多线索类别学习过程中多巴胺和前额叶-纹状体相互作用的神经计算模型。
J Cogn Neurosci. 2011 Jan;23(1):151-67. doi: 10.1162/jocn.2010.21420.
4
Striatal action-learning based on dopamine concentration.基于多巴胺浓度的纹状体动作学习。
Exp Brain Res. 2010 Jan;200(3-4):307-17. doi: 10.1007/s00221-009-2060-6. Epub 2009 Nov 11.
5
A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine.皮质-基底神经节通路的双重作用假说:多巴胺和腺苷介导的对立和时间差分。
Front Neural Circuits. 2019 Jan 7;12:111. doi: 10.3389/fncir.2018.00111. eCollection 2018.
6
Keep focussing: striatal dopamine multiple functions resolved in a single mechanism tested in a simulated humanoid robot.保持专注:纹状体多巴胺的多种功能在一个模拟人形机器人中得到验证的单一机制中得到解决。
Front Psychol. 2014 Feb 21;5:124. doi: 10.3389/fpsyg.2014.00124. eCollection 2014.
7
The role of prediction and outcomes in adaptive cognitive control.预测与结果在适应性认知控制中的作用。
J Physiol Paris. 2015 Feb-Jun;109(1-3):38-52. doi: 10.1016/j.jphysparis.2015.02.001. Epub 2015 Feb 17.
8
Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: a simulated robotic study.相位多巴胺作为内在和外在强化的预测误差,驱动着动作获取和奖励最大化:一项模拟机器人研究。
Neural Netw. 2013 Mar;39:40-51. doi: 10.1016/j.neunet.2012.12.012. Epub 2013 Jan 14.
9
The credit assignment problem in cortico-basal ganglia-thalamic networks: A review, a problem and a possible solution.皮质基底神经节丘脑网络中的信用分配问题:综述、问题及可能的解决方案。
Eur J Neurosci. 2021 Apr;53(7):2234-2253. doi: 10.1111/ejn.14745. Epub 2020 May 4.
10
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.

引用本文的文献

1
Feasibility of dopamine as a vector-valued feedback signal in the basal ganglia.多巴胺作为基底神经节中向量值反馈信号的可行性。
Proc Natl Acad Sci U S A. 2023 Aug 8;120(32):e2221994120. doi: 10.1073/pnas.2221994120. Epub 2023 Aug 1.
2
Hierarchical intrinsically motivated agent planning behavior with dreaming in grid environments.网格环境中具有梦境的分层内在动机智能体规划行为
Brain Inform. 2022 Apr 2;9(1):8. doi: 10.1186/s40708-022-00156-6.
3
Neurorobotics Workshop for High School Students Promotes Competence and Confidence in Computational Neuroscience.

本文引用的文献

1
Intrinsically motivated action-outcome learning and goal-based action recall: a system-level bio-constrained computational model.内在动机驱动的动作-结果学习和基于目标的动作回忆:一种系统级的生物约束计算模型。
Neural Netw. 2013 May;41:168-87. doi: 10.1016/j.neunet.2012.09.015. Epub 2012 Oct 4.
2
Habituation and the reinforcing effectiveness of visual stimuli.习惯化与视觉刺激的强化效力
Behav Processes. 2012 Oct;91(2):184-91. doi: 10.1016/j.beproc.2012.07.007. Epub 2012 Jul 31.
3
Functional properties of the basal ganglia's re-entrant loop architecture: selection and reinforcement.
面向高中生的神经机器人技术工作坊提升了计算神经科学方面的能力和信心。
Front Neurorobot. 2020 Feb 13;14:6. doi: 10.3389/fnbot.2020.00006. eCollection 2020.
4
Integrating Brain and Biomechanical Models-A New Paradigm for Understanding Neuro-muscular Control.整合大脑与生物力学模型——理解神经肌肉控制的新范式
Front Neurosci. 2018 Feb 6;12:39. doi: 10.3389/fnins.2018.00039. eCollection 2018.
5
The basal ganglia select the expected sensory input used for predictive coding.基底神经节选择用于预测编码的预期感觉输入。
Front Comput Neurosci. 2015 Sep 23;9:119. doi: 10.3389/fncom.2015.00119. eCollection 2015.
6
A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.一种用于皮质-纹状体可塑性的新框架:行为理论在强化-行动界面与体外数据相遇。
PLoS Biol. 2015 Jan 6;13(1):e1002034. doi: 10.1371/journal.pbio.1002034. eCollection 2015 Jan.
7
Finding minimal action sequences with a simple evaluation of actions.通过对动作进行简单评估来寻找最小动作序列。
Front Comput Neurosci. 2014 Nov 28;8:151. doi: 10.3389/fncom.2014.00151. eCollection 2014.
8
Nicotine and methamphetamine disrupt habituation of sensory reinforcer effectiveness in male rats.尼古丁和甲基苯丙胺会破坏雄性大鼠感知强化物效力的习惯化。
Exp Clin Psychopharmacol. 2014 Apr;22(2):166-75. doi: 10.1037/a0034741.
9
Habituation of reinforcer effectiveness.强化物效力的习惯化。
Front Integr Neurosci. 2014 Jan 9;7:107. doi: 10.3389/fnint.2013.00107.
10
A dynamic, embodied paradigm to investigate the role of serotonin in decision-making.一种用于研究决策中血清素作用的动态具身范例。
Front Integr Neurosci. 2013 Nov 21;7:78. doi: 10.3389/fnint.2013.00078.
基底神经节的折返环结构的功能特性:选择和强化。
Neuroscience. 2011 Dec 15;198:138-51. doi: 10.1016/j.neuroscience.2011.07.060. Epub 2011 Jul 29.
4
Robot cognitive control with a neurophysiologically inspired reinforcement learning model.基于神经生理学启发的强化学习模型的机器人认知控制。
Front Neurorobot. 2011 Jul 12;5:1. doi: 10.3389/fnbot.2011.00001. eCollection 2011.
5
Computational models of reinforcement learning: the role of dopamine as a reward signal.强化学习的计算模型:多巴胺作为奖励信号的作用。
Cogn Neurodyn. 2010 Jun;4(2):91-105. doi: 10.1007/s11571-010-9109-x. Epub 2010 Mar 21.
6
Computational models of motivated action selection in corticostriatal circuits.皮质纹状体电路中动机驱动的动作选择的计算模型。
Curr Opin Neurobiol. 2011 Jun;21(3):381-6. doi: 10.1016/j.conb.2011.02.013. Epub 2011 Apr 15.
7
Locomotor activity in a novel environment predicts both responding for a visual stimulus and self-administration of a low dose of methamphetamine in rats.在新环境中的运动活动可预测大鼠对视觉刺激的反应以及低剂量甲基苯丙胺的自我给药情况。
Behav Processes. 2011 Feb;86(2):295-304. doi: 10.1016/j.beproc.2010.12.013. Epub 2011 Jan 5.
8
Dopamine in motivational control: rewarding, aversive, and alerting.多巴胺在动机控制中的作用:奖赏、厌恶和警觉。
Neuron. 2010 Dec 9;68(5):815-34. doi: 10.1016/j.neuron.2010.11.022.
9
Rewarding feedback after correct visual discriminations has both general and specific influences on visual cortex.正确的视觉辨别后得到的奖励反馈对视觉皮层既有一般影响,也有特定影响。
J Neurophysiol. 2010 Sep;104(3):1746-57. doi: 10.1152/jn.00870.2009. Epub 2010 Jul 21.
10
Prediction, cognition and the brain.预测、认知与大脑。
Front Hum Neurosci. 2010 Mar 22;4:25. doi: 10.3389/fnhum.2010.00025. eCollection 2010.