• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在感觉运动学习过程中分离基于误差和基于强化的损失函数。

Dissociating error-based and reinforcement-based loss functions during sensorimotor learning.

作者信息

Cashaback Joshua G A, McGregor Heather R, Mohatarem Ayman, Gribble Paul L

机构信息

Brain and Mind Institute, Department of Psychology, Western University, London, ON, Canada.

Graduate Program in Neuroscience, Western University, London, ON, Canada.

出版信息

PLoS Comput Biol. 2017 Jul 28;13(7):e1005623. doi: 10.1371/journal.pcbi.1005623. eCollection 2017 Jul.

DOI:10.1371/journal.pcbi.1005623
PMID:28753634
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5550011/
Abstract

It has been proposed that the sensorimotor system uses a loss (cost) function to evaluate potential movements in the presence of random noise. Here we test this idea in the context of both error-based and reinforcement-based learning. In a reaching task, we laterally shifted a cursor relative to true hand position using a skewed probability distribution. This skewed probability distribution had its mean and mode separated, allowing us to dissociate the optimal predictions of an error-based loss function (corresponding to the mean of the lateral shifts) and a reinforcement-based loss function (corresponding to the mode). We then examined how the sensorimotor system uses error feedback and reinforcement feedback, in isolation and combination, when deciding where to aim the hand during a reach. We found that participants compensated differently to the same skewed lateral shift distribution depending on the form of feedback they received. When provided with error feedback, participants compensated based on the mean of the skewed noise. When provided with reinforcement feedback, participants compensated based on the mode. Participants receiving both error and reinforcement feedback continued to compensate based on the mean while repeatedly missing the target, despite receiving auditory, visual and monetary reinforcement feedback that rewarded hitting the target. Our work shows that reinforcement-based and error-based learning are separable and can occur independently. Further, when error and reinforcement feedback are in conflict, the sensorimotor system heavily weights error feedback over reinforcement feedback.

摘要

有人提出,感觉运动系统使用一种损失(成本)函数来评估存在随机噪声时的潜在运动。在此,我们在基于误差和基于强化的学习背景下测试这一观点。在一项伸手够物任务中,我们使用偏态概率分布使光标相对于真实手部位置横向移动。这种偏态概率分布的均值和众数分离,使我们能够区分基于误差的损失函数(对应于横向移动的均值)和基于强化的损失函数(对应于众数)的最优预测。然后,我们研究了感觉运动系统在伸手够物过程中决定手部瞄准位置时如何单独和组合使用误差反馈和强化反馈。我们发现,参与者根据所接收反馈的形式,对相同的偏态横向移动分布做出不同的补偿。当提供误差反馈时,参与者根据偏态噪声的均值进行补偿。当提供强化反馈时,参与者根据众数进行补偿。尽管收到了奖励击中目标的听觉、视觉和金钱强化反馈,但同时收到误差和强化反馈的参与者在反复未击中目标的情况下仍继续根据均值进行补偿。我们的研究表明,基于强化的学习和基于误差的学习是可分离的,并且可以独立发生。此外,当误差反馈和强化反馈发生冲突时,感觉运动系统对误差反馈的权重远高于强化反馈。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/37b2a568c550/pcbi.1005623.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/2bf16752663f/pcbi.1005623.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/e534addaa261/pcbi.1005623.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/9adc1d834450/pcbi.1005623.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/79f8d68b3a68/pcbi.1005623.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/b3d912b7febe/pcbi.1005623.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/37b2a568c550/pcbi.1005623.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/2bf16752663f/pcbi.1005623.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/e534addaa261/pcbi.1005623.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/9adc1d834450/pcbi.1005623.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/79f8d68b3a68/pcbi.1005623.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/b3d912b7febe/pcbi.1005623.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1664/5550011/37b2a568c550/pcbi.1005623.g006.jpg

相似文献

1
Dissociating error-based and reinforcement-based loss functions during sensorimotor learning.在感觉运动学习过程中分离基于误差和基于强化的损失函数。
PLoS Comput Biol. 2017 Jul 28;13(7):e1005623. doi: 10.1371/journal.pcbi.1005623. eCollection 2017 Jul.
2
Reach adaption to a visuomotor gain with terminal error feedback involves reinforcement learning.通过终端误差反馈适应视觉运动增益涉及强化学习。
PLoS One. 2022 Jun 1;17(6):e0269297. doi: 10.1371/journal.pone.0269297. eCollection 2022.
3
Decomposition of a sensory prediction error signal for visuomotor adaptation.用于视觉运动适应的感觉预测误差信号的分解
J Exp Psychol Hum Percept Perform. 2018 Feb;44(2):176-194. doi: 10.1037/xhp0000440. Epub 2017 May 15.
4
Reward abundance interferes with error-based learning in a visuomotor adaptation task.奖励丰富度会干扰视动适应任务中的基于错误的学习。
PLoS One. 2018 Mar 7;13(3):e0193002. doi: 10.1371/journal.pone.0193002. eCollection 2018.
5
Structure Learning in Bayesian Sensorimotor Integration.贝叶斯感觉运动整合中的结构学习
PLoS Comput Biol. 2015 Aug 25;11(8):e1004369. doi: 10.1371/journal.pcbi.1004369. eCollection 2015 Aug.
6
Performing a reaching task with one arm while adapting to a visuomotor rotation with the other can lead to complete transfer of motor learning across the arms.用一只手臂执行够物任务,同时用另一只手臂适应视觉运动旋转,可能会导致运动学习在双臂之间完全迁移。
J Neurophysiol. 2015 Apr 1;113(7):2302-8. doi: 10.1152/jn.00974.2014. Epub 2015 Jan 28.
7
Relative errors can cue absolute visuomotor mappings.相对误差可以提示绝对视觉运动映射。
Exp Brain Res. 2015 Dec;233(12):3367-77. doi: 10.1007/s00221-015-4403-9. Epub 2015 Aug 18.
8
Impact of online visual feedback on motor acquisition and retention when learning to reach in a force field.在线视觉反馈对在力场中学习伸手动作时运动技能习得与保持的影响。
Neuroscience. 2016 Nov 19;337:267-275. doi: 10.1016/j.neuroscience.2016.09.020. Epub 2016 Sep 17.
9
Somatic and Reinforcement-Based Plasticity in the Initial Stages of Human Motor Learning.人类运动学习初期基于躯体和强化的可塑性
J Neurosci. 2016 Nov 16;36(46):11682-11692. doi: 10.1523/JNEUROSCI.1767-16.2016.
10
Dorsomedial frontal cortex damage impairs error-based, but not reinforcement-based motor learning in humans.背内侧额前皮质损伤损害人类基于错误的,但不损害基于强化的运动学习。
Cereb Cortex. 2024 Jan 14;34(1). doi: 10.1093/cercor/bhad424.

引用本文的文献

1
Online Movements Reflect Ongoing Deliberation.线上运动反映了持续的审议过程。
J Neurosci. 2025 Jul 30;45(31):e1913242025. doi: 10.1523/JNEUROSCI.1913-24.2025.
2
Success-efficient/failure-safe strategy for hierarchical reinforcement motor learning.分层强化运动学习的成功高效/失败安全策略。
PLoS Comput Biol. 2025 May 9;21(5):e1013089. doi: 10.1371/journal.pcbi.1013089. eCollection 2025 May.
3
Touchscreen Response Precision Is Sensitive to the Explore/Exploit Trade-off.触摸屏响应精度对探索/利用权衡敏感。

本文引用的文献

1
Saccadic suppression as a perceptual consequence of efficient sensorimotor estimation.扫视抑制作为有效感觉运动估计的一种感知结果。
Elife. 2017 May 2;6:e25073. doi: 10.7554/eLife.25073.
2
Does the sensorimotor system minimize prediction error or select the most likely prediction during object lifting?在物体提起过程中,感觉运动系统是将预测误差最小化还是选择最可能的预测?
J Neurophysiol. 2017 Jan 1;117(1):260-274. doi: 10.1152/jn.00609.2016. Epub 2016 Oct 19.
3
The Size-Weight Illusion is not anti-Bayesian after all: a unifying Bayesian account.
eNeuro. 2025 May 8;12(5). doi: 10.1523/ENEURO.0538-24.2025. Print 2025 May.
4
Exploration-based learning of a stabilizing controller predicts locomotor adaptation.基于探索的稳定控制器学习预测运动适应。
Nat Commun. 2024 Nov 3;15(1):9498. doi: 10.1038/s41467-024-53416-w.
5
Touchscreen response precision is sensitive to the explore/exploit tradeoff.触摸屏响应精度对探索/利用权衡很敏感。
bioRxiv. 2024 Oct 23:2024.10.23.619903. doi: 10.1101/2024.10.23.619903.
6
Human motor learning dynamics in high-dimensional tasks.人类在高维任务中的运动学习动力学。
PLoS Comput Biol. 2024 Oct 14;20(10):e1012455. doi: 10.1371/journal.pcbi.1012455. eCollection 2024 Oct.
7
Roles and interplay of reinforcement-based and error-based processes during reaching and gait in neurotypical adults and individuals with Parkinson's disease.在神经正常的成年人和帕金森病患者的伸手和步态中,基于强化和基于错误的过程的作用和相互作用。
PLoS Comput Biol. 2024 Oct 14;20(10):e1012474. doi: 10.1371/journal.pcbi.1012474. eCollection 2024 Oct.
8
Non-invasive stimulation of the human striatum disrupts reinforcement learning of motor skills.非侵入性刺激人类纹状体可破坏运动技能的强化学习。
Nat Hum Behav. 2024 Aug;8(8):1581-1598. doi: 10.1038/s41562-024-01901-z. Epub 2024 May 29.
9
Oculomotor behavior can be adjusted on the basis of artificial feedback signals indicating externally caused errors.眼球运动行为可以根据指示外部引起的误差的人工反馈信号进行调整。
PLoS One. 2024 May 20;19(5):e0302872. doi: 10.1371/journal.pone.0302872. eCollection 2024.
10
Reinforcement Learning during Locomotion.运动过程中的强化学习。
eNeuro. 2024 Mar 15;11(3). doi: 10.1523/ENEURO.0383-23.2024. Print 2024 Mar.
大小-重量错觉终究并非反贝叶斯的:一种统一的贝叶斯解释。
PeerJ. 2016 Jun 16;4:e2124. doi: 10.7717/peerj.2124. eCollection 2016.
4
Somatosensory Contribution to the Initial Stages of Human Motor Learning.体感对人类运动学习初始阶段的贡献。
J Neurosci. 2015 Oct 21;35(42):14316-26. doi: 10.1523/JNEUROSCI.1344-15.2015.
5
Do Cost Functions for Tracking Error Generalize across Tasks with Different Noise Levels?用于跟踪误差的成本函数能否在不同噪声水平的任务中通用?
PLoS One. 2015 Aug 27;10(8):e0136251. doi: 10.1371/journal.pone.0136251. eCollection 2015.
6
Human representation of visuo-motor uncertainty as mixtures of orthogonal basis distributions.人类将视觉运动不确定性表征为正交基分布的混合。
Nat Neurosci. 2015 Aug;18(8):1152-8. doi: 10.1038/nn.4055. Epub 2015 Jun 29.
7
Persistent residual errors in motor adaptation tasks: reversion to baseline and exploratory escape.运动适应任务中的持续性残余误差:恢复至基线水平与探索性逃避。
J Neurosci. 2015 Apr 29;35(17):6969-77. doi: 10.1523/JNEUROSCI.2656-14.2015.
8
Probabilistic information on object weight shapes force dynamics in a grip-lift task.关于物体重量的概率信息塑造了抓握-提起任务中的力动态。
Exp Brain Res. 2015 Jun;233(6):1711-20. doi: 10.1007/s00221-015-4244-6. Epub 2015 Mar 12.
9
Reward-dependent modulation of movement variability.基于奖励的运动变异性调节。
J Neurosci. 2015 Mar 4;35(9):4015-24. doi: 10.1523/JNEUROSCI.3244-14.2015.
10
The dissociable effects of punishment and reward on motor learning.惩罚和奖励对运动学习的可分离影响。
Nat Neurosci. 2015 Apr;18(4):597-602. doi: 10.1038/nn.3956. Epub 2015 Feb 23.