• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化景观的梯度影响感觉运动学习。

The gradient of the reinforcement landscape influences sensorimotor learning.

机构信息

Human Performance Laboratory, University of Calgary, Calgary, Alberta, Canada.

Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada.

出版信息

PLoS Comput Biol. 2019 Mar 4;15(3):e1006839. doi: 10.1371/journal.pcbi.1006839. eCollection 2019 Mar.

DOI:10.1371/journal.pcbi.1006839
PMID:30830902
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6417747/
Abstract

Consideration of previous successes and failures is essential to mastering a motor skill. Much of what we know about how humans and animals learn from such reinforcement feedback comes from experiments that involve sampling from a small number of discrete actions. Yet, it is less understood how we learn through reinforcement feedback when sampling from a continuous set of possible actions. Navigating a continuous set of possible actions likely requires using gradient information to maximize success. Here we addressed how humans adapt the aim of their hand when experiencing reinforcement feedback that was associated with a continuous set of possible actions. Specifically, we manipulated the change in the probability of reward given a change in motor action-the reinforcement gradient-to study its influence on learning. We found that participants learned faster when exposed to a steep gradient compared to a shallow gradient. Further, when initially positioned between a steep and a shallow gradient that rose in opposite directions, participants were more likely to ascend the steep gradient. We introduce a model that captures our results and several features of motor learning. Taken together, our work suggests that the sensorimotor system relies on temporally recent and spatially local gradient information to drive learning.

摘要

考虑以前的成功和失败对于掌握运动技能至关重要。我们对人类和动物如何从这种强化反馈中学习的了解很大程度上来自于涉及从少数离散动作中采样的实验。然而,当从连续的可能动作集中采样时,我们通过强化反馈来学习的方式还不太清楚。在连续的可能动作集中导航可能需要使用梯度信息来最大化成功。在这里,我们研究了当人类经历与连续的可能动作集相关的强化反馈时,他们如何调整手部的目标。具体来说,我们操纵了给定运动动作变化时奖励概率的变化(强化梯度),以研究其对学习的影响。我们发现,与浅梯度相比,参与者在暴露于陡峭梯度时学习速度更快。此外,当最初处于一个陡峭的和一个浅的梯度之间,它们的斜率是相反的,参与者更有可能上升到陡峭的梯度。我们提出了一个模型来捕获我们的结果和运动学习的几个特征。总的来说,我们的工作表明,感觉运动系统依赖于时间上最近和空间上局部的梯度信息来驱动学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/b086f73ba961/pcbi.1006839.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/9f8c9bf06927/pcbi.1006839.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/e4a160323987/pcbi.1006839.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/cc3911529770/pcbi.1006839.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/109e94612cf1/pcbi.1006839.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/80967fbaed04/pcbi.1006839.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/ca80fe54fdfe/pcbi.1006839.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/cf37ec305b6f/pcbi.1006839.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/57247b64d7c8/pcbi.1006839.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/b086f73ba961/pcbi.1006839.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/9f8c9bf06927/pcbi.1006839.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/e4a160323987/pcbi.1006839.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/cc3911529770/pcbi.1006839.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/109e94612cf1/pcbi.1006839.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/80967fbaed04/pcbi.1006839.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/ca80fe54fdfe/pcbi.1006839.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/cf37ec305b6f/pcbi.1006839.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/57247b64d7c8/pcbi.1006839.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33a0/6417747/b086f73ba961/pcbi.1006839.g009.jpg

相似文献

1
The gradient of the reinforcement landscape influences sensorimotor learning.强化景观的梯度影响感觉运动学习。
PLoS Comput Biol. 2019 Mar 4;15(3):e1006839. doi: 10.1371/journal.pcbi.1006839. eCollection 2019 Mar.
2
Electroencephalographic evidence for a reinforcement learning advantage during motor skill acquisition.脑电图证据表明,在运动技能习得过程中强化学习具有优势。
Biol Psychol. 2020 Mar;151:107849. doi: 10.1016/j.biopsycho.2020.107849. Epub 2020 Jan 22.
3
Reinforcement-based processes actively regulate motor exploration along redundant solution manifolds.基于强化的过程沿冗余解流形主动调节运动探索。
Proc Biol Sci. 2023 Oct 25;290(2009):20231475. doi: 10.1098/rspb.2023.1475. Epub 2023 Oct 18.
4
Punishment Leads to Greater Sensorimotor Learning But Less Movement Variability Compared to Reward.与奖励相比,惩罚会导致更大的感觉运动学习,但运动变化性更小。
Neuroscience. 2024 Mar 5;540:12-26. doi: 10.1016/j.neuroscience.2024.01.004. Epub 2024 Jan 12.
5
Interactions between motor exploration and reinforcement learning.运动探索与强化学习的相互作用。
J Neurophysiol. 2019 Aug 1;122(2):797-808. doi: 10.1152/jn.00390.2018. Epub 2019 Jun 26.
6
Skill acquisition as a function of age, hand and task difficulty: Interactions between cognition and action.技能获取与年龄、手和任务难度的关系:认知与动作的相互作用。
PLoS One. 2019 Feb 7;14(2):e0211706. doi: 10.1371/journal.pone.0211706. eCollection 2019.
7
Somatic and Reinforcement-Based Plasticity in the Initial Stages of Human Motor Learning.人类运动学习初期基于躯体和强化的可塑性
J Neurosci. 2016 Nov 16;36(46):11682-11692. doi: 10.1523/JNEUROSCI.1767-16.2016.
8
Reinforcement Learning during Locomotion.运动过程中的强化学习。
eNeuro. 2024 Mar 15;11(3). doi: 10.1523/ENEURO.0383-23.2024. Print 2024 Mar.
9
The Role of Variability in Motor Learning.变异性在运动学习中的作用。
Annu Rev Neurosci. 2017 Jul 25;40:479-498. doi: 10.1146/annurev-neuro-072116-031548. Epub 2017 May 10.
10
A Potential Role for Reinforcement Learning in Speech Production.强化学习在言语产生中的潜在作用。
J Cogn Neurosci. 2021 Jul 1;33(8):1470-1486. doi: 10.1162/jocn_a_01742.

引用本文的文献

1
Age-dependent predictors of effective reinforcement motor learning across childhood.儿童期有效强化运动学习的年龄依赖性预测因素。
Elife. 2025 Aug 28;13:RP101036. doi: 10.7554/eLife.101036.
2
Online Movements Reflect Ongoing Deliberation.线上运动反映了持续的审议过程。
J Neurosci. 2025 Jul 30;45(31):e1913242025. doi: 10.1523/JNEUROSCI.1913-24.2025.
3
The sign of exploration during reward-based motor learning is not independent from trial to trial.基于奖励的运动学习过程中的探索信号在每次试验之间并非相互独立。

本文引用的文献

1
Neural signatures of reward and sensory error feedback processing in motor learning.运动学习中奖励和感觉错误反馈处理的神经特征。
J Neurophysiol. 2019 Apr 1;121(4):1561-1574. doi: 10.1152/jn.00792.2018. Epub 2019 Feb 27.
2
Increasing Motor Noise Impairs Reinforcement Learning in Healthy Individuals.电机噪声增大损害健康个体的强化学习能力。
eNeuro. 2018 Aug 13;5(3). doi: 10.1523/ENEURO.0050-18.2018. eCollection 2018 May-Jun.
3
The relationship between reinforcement and explicit control during visuomotor adaptation.在视动适应过程中强化与显式控制之间的关系。
Exp Brain Res. 2025 Apr 15;243(5):117. doi: 10.1007/s00221-025-07074-z.
4
Roles and interplay of reinforcement-based and error-based processes during reaching and gait in neurotypical adults and individuals with Parkinson's disease.在神经正常的成年人和帕金森病患者的伸手和步态中,基于强化和基于错误的过程的作用和相互作用。
PLoS Comput Biol. 2024 Oct 14;20(10):e1012474. doi: 10.1371/journal.pcbi.1012474. eCollection 2024 Oct.
5
Fundamental processes in sensorimotor learning: Reasoning, refinement, and retrieval.感觉运动学习的基本过程:推理、优化与检索。
Elife. 2024 Aug 1;13:e91839. doi: 10.7554/eLife.91839.
6
Age-dependent predictors of effective reinforcement motor learning across childhood.儿童期有效强化运动学习的年龄依赖性预测因素。
bioRxiv. 2025 Mar 15:2024.07.09.602665. doi: 10.1101/2024.07.09.602665.
7
Reinforcement feedback impairs locomotor adaptation and retention.强化反馈会损害运动适应性和记忆保持。
Front Behav Neurosci. 2024 Apr 24;18:1388495. doi: 10.3389/fnbeh.2024.1388495. eCollection 2024.
8
Reinforcement Learning during Locomotion.运动过程中的强化学习。
eNeuro. 2024 Mar 15;11(3). doi: 10.1523/ENEURO.0383-23.2024. Print 2024 Mar.
9
Punishment Leads to Greater Sensorimotor Learning But Less Movement Variability Compared to Reward.与奖励相比,惩罚会导致更大的感觉运动学习,但运动变化性更小。
Neuroscience. 2024 Mar 5;540:12-26. doi: 10.1016/j.neuroscience.2024.01.004. Epub 2024 Jan 12.
10
Reinforcement-based processes actively regulate motor exploration along redundant solution manifolds.基于强化的过程沿冗余解流形主动调节运动探索。
Proc Biol Sci. 2023 Oct 25;290(2009):20231475. doi: 10.1098/rspb.2023.1475. Epub 2023 Oct 18.
Sci Rep. 2018 Jun 14;8(1):9121. doi: 10.1038/s41598-018-27378-1.
4
Reward-based motor adaptation can generalize across actions.基于奖励的运动适应可以在不同动作之间进行泛化。
J Exp Psychol Learn Mem Cogn. 2019 Jan;45(1):71-81. doi: 10.1037/xlm0000573. Epub 2018 Apr 26.
5
Contribution of explicit processes to reinforcement-based motor learning.显性过程对基于强化的运动学习的贡献。
J Neurophysiol. 2018 Jun 1;119(6):2241-2255. doi: 10.1152/jn.00901.2017. Epub 2018 Mar 14.
6
High variability impairs motor learning regardless of whether it affects task performance.高度变异性会损害运动学习,无论它是否影响任务表现。
J Neurophysiol. 2018 Jan 1;119(1):39-48. doi: 10.1152/jn.00158.2017. Epub 2017 Sep 27.
7
Active lead-in variability affects motor memory formation and slows motor learning.主动导联变化会影响运动记忆的形成,从而减缓运动学习。
Sci Rep. 2017 Aug 10;7(1):7806. doi: 10.1038/s41598-017-05697-z.
8
Dissociating error-based and reinforcement-based loss functions during sensorimotor learning.在感觉运动学习过程中分离基于误差和基于强化的损失函数。
PLoS Comput Biol. 2017 Jul 28;13(7):e1005623. doi: 10.1371/journal.pcbi.1005623. eCollection 2017 Jul.
9
The Role of Variability in Motor Learning.变异性在运动学习中的作用。
Annu Rev Neurosci. 2017 Jul 25;40:479-498. doi: 10.1146/annurev-neuro-072116-031548. Epub 2017 May 10.
10
Predicting explorative motor learning using decision-making and motor noise.利用决策和运动噪声预测探索性运动学习
PLoS Comput Biol. 2017 Apr 24;13(4):e1005503. doi: 10.1371/journal.pcbi.1005503. eCollection 2017 Apr.