Suppr
超能文献

基于奖励的运动学习过程中的探索信号在每次试验之间并非相互独立。

The sign of exploration during reward-based motor learning is not independent from trial to trial.

作者信息

Kooij Katinka van der, Smeets Jeroen B J, Mastrigt Nina M van, Wijk Bernadette C M van

机构信息

Department of Human Movement Sciences, Vrije Universiteit Amsterdam, van der Boechorststraat 9, 1081BT, Amsterdam, The Netherlands.

Department of Psychology, Justus-Liebig-Universität Gießen, Gießen, Germany.

出版信息

Exp Brain Res. 2025 Apr 15;243(5):117. doi: 10.1007/s00221-025-07074-z.

DOI:10.1007/s00221-025-07074-z

PMID:40232309

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12000264/

Abstract

Humans can learn various motor tasks based on binary reward feedback on whether a movement attempt was successful or not. Such 'reward-based motor learning' relies on exploiting successful motor commands and exploring different motor commands following failure. Most computational models of reward-based motor learning have formalized exploration as a random process, in which on each trial a random draw is taken from a normal distribution centred on zero. Whether human motor exploration is indeed random from trial to trial has not been tested yet. Here we tested in a force production task whether human motor exploration is random. To this end, we compared the proportion trial-to-trial force changes in the behavioural data that have the same sign to the proportion expected in random exploration. One group of participants practiced with an adaptive reward criterion, which keeps rewarded performance close to current performance, and the other group practiced with a fixed reward criterion in which current performance can be far from reward performance. In both groups, we found a proportion same-sign changes larger than predicted. In the Adaptive group, both the learning and proportion same-sign changes were consistent with model simulations for low values of random exploration, whereas in the Fixed group both the learning and proportion same-sign changes were inconsistent with model simulations based on random exploration. This suggests that some form of non-random motor exploration contributes to reward-based motor learning.

摘要

人类可以根据动作尝试是否成功的二元奖励反馈来学习各种运动任务。这种“基于奖励的运动学习”依赖于利用成功的运动指令，并在失败后探索不同的运动指令。大多数基于奖励的运动学习计算模型将探索形式化为一个随机过程，即在每次试验中从以零为中心的正态分布中进行随机抽取。人类运动探索在每次试验中是否真的是随机的尚未得到检验。在这里，我们在一个力量产生任务中测试了人类运动探索是否是随机的。为此，我们将行为数据中逐次试验力量变化具有相同符号的比例与随机探索中预期的比例进行了比较。一组参与者采用自适应奖励标准进行练习，该标准使奖励表现接近当前表现，另一组参与者采用固定奖励标准进行练习，在该标准下当前表现可能与奖励表现相差甚远。在两组中，我们发现具有相同符号变化的比例大于预测值。在自适应组中，学习和具有相同符号变化的比例都与随机探索值较低时的模型模拟一致，而在固定组中，学习和具有相同符号变化的比例都与基于随机探索的模型模拟不一致。这表明某种形式的非随机运动探索有助于基于奖励的运动学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce26/12000264/1acbe2b889cf/221_2025_7074_Fig1_HTML.jpg

相似文献

The sign of exploration during reward-based motor learning is not independent from trial to trial.

Exp Brain Res. 2025 Apr 15;243(5):117. doi: 10.1007/s00221-025-07074-z.

Quantifying exploration in reward-based motor learning.

PLoS One. 2020 Apr 2;15(4):e0226789. doi: 10.1371/journal.pone.0226789. eCollection 2020.

Clustering analysis of movement kinematics in reinforcement learning.

J Neurophysiol. 2022 Feb 1;127(2):341-353. doi: 10.1152/jn.00229.2021. Epub 2021 Dec 22.

Alterations in the amplitude and burst rate of beta oscillations impair reward-dependent motor learning in anxiety.

Elife. 2020 May 19;9:e50654. doi: 10.7554/eLife.50654.

Interactions between motor exploration and reinforcement learning.

J Neurophysiol. 2019 Aug 1;122(2):797-808. doi: 10.1152/jn.00390.2018. Epub 2019 Jun 26.

Null effects of levodopa on reward- and error-based motor adaptation, savings, and anterograde interference.

J Neurophysiol. 2021 Jul 1;126(1):47-67. doi: 10.1152/jn.00696.2020. Epub 2021 May 26.

Somatic and Reinforcement-Based Plasticity in the Initial Stages of Human Motor Learning.

J Neurosci. 2016 Nov 16;36(46):11682-11692. doi: 10.1523/JNEUROSCI.1767-16.2016.

Domain-Specific Working Memory, But Not Dopamine-Related Genetic Variability, Shapes Reward-Based Motor Learning.

J Neurosci. 2019 Nov 20;39(47):9383-9396. doi: 10.1523/JNEUROSCI.0583-19.2019. Epub 2019 Oct 11.

Modulation of neural activity in frontopolar cortex drives reward-based motor learning.

Sci Rep. 2021 Oct 13;11(1):20303. doi: 10.1038/s41598-021-98571-y.

Pitfalls in quantifying exploration in reward-based motor learning and how to avoid them.

Biol Cybern. 2021 Aug;115(4):365-382. doi: 10.1007/s00422-021-00884-8. Epub 2021 Aug 2.

本文引用的文献

Decision-making under conditions of explicit risk and uncertainty in autistic and typically developing adolescents and young adults.

Cereb Cortex. 2024 May 2;34(13):1-7. doi: 10.1093/cercor/bhae097.

Explaining the flaws in human random generation as local sampling with momentum.

PLoS Comput Biol. 2024 Jan 5;20(1):e1011739. doi: 10.1371/journal.pcbi.1011739. eCollection 2024 Jan.

Dorsomedial frontal cortex damage impairs error-based, but not reinforcement-based motor learning in humans.

Cereb Cortex. 2024 Jan 14;34(1). doi: 10.1093/cercor/bhad424.

Reinforcement-based processes actively regulate motor exploration along redundant solution manifolds.

Proc Biol Sci. 2023 Oct 25;290(2009):20231475. doi: 10.1098/rspb.2023.1475. Epub 2023 Oct 18.

Implicit reward-based motor learning.

Exp Brain Res. 2023 Sep;241(9):2287-2298. doi: 10.1007/s00221-023-06683-w. Epub 2023 Aug 14.

A novel video game for remote studies of motor adaptation in children.

Physiol Rep. 2023 Jul;11(13):e15764. doi: 10.14814/phy2.15764.

Failure induces task-irrelevant exploration during a stencil task.

Exp Brain Res. 2023 Feb;241(2):677-686. doi: 10.1007/s00221-023-06548-2. Epub 2023 Jan 20.

Children are suboptimal in adapting motor exploration to task dimensionality during motor learning.

Neurosci Lett. 2022 Jan 23;770:136355. doi: 10.1016/j.neulet.2021.136355. Epub 2021 Nov 19.

Motivation as a function of success frequency.

Motiv Emot. 2021;45(6):759-768. doi: 10.1007/s11031-021-09904-3. Epub 2021 Sep 30.

Pitfalls in quantifying exploration in reward-based motor learning and how to avoid them.

Biol Cybern. 2021 Aug;115(4):365-382. doi: 10.1007/s00422-021-00884-8. Epub 2021 Aug 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

基于奖励的运动学习过程中的探索信号在每次试验之间并非相互独立。

The sign of exploration during reward-based motor learning is not independent from trial to trial.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译