• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

动作时间和抑制错误通过调整基础决策过程中的不同机制促进学习。

Errors in Action Timing and Inhibition Facilitate Learning by Tuning Distinct Mechanisms in the Underlying Decision Process.

机构信息

Department of Psychology, and.

Center for the Neural Basis of Cognition, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213.

出版信息

J Neurosci. 2019 Mar 20;39(12):2251-2264. doi: 10.1523/JNEUROSCI.1924-18.2019. Epub 2019 Jan 17.

DOI:10.1523/JNEUROSCI.1924-18.2019
PMID:30655353
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6433756/
Abstract

Goal-directed behavior requires integrating action selection processes with learning systems that adapt control using environmental feedback. These functions are known to intersect at a common neural substrate with multiple known targets of plasticity (the cortico-basal ganglia-thalamic network), suggesting that feedback signals have a multifaceted impact on future decisions. Using a hybrid of accumulation-to-bound decision models and reinforcement learning, we modeled the performance of humans in a stop signal task where participants (N 75: 37 males, 38 females) learned the prior distribution of the timing of a stop signal through trial-and-error feedback. Changes in the drift rate of the action execution process were driven by errors in action timing, whereas adaptation in the boundary height served to increase caution following failed stops. These findings highlight two interactive learning mechanisms for adapting the control of goal-directed actions based on dissociable dimensions of feedback error. Many complex behavioral goals rely on the ability to regulate the timing of action execution while also maintaining enough control to cancel actions in response to "Stop" cues in the environment. Here we examined how these fundamental components of behavior become tuned to the control demands of the environment by combining principles of reinforcement learning with accumulation-to-bound models. Model fits to behavioral data in an adaptive stop signal task revealed two adaptive mechanisms: (1) timing error-related changes in the rate of the execution signal; and (2) an increase in the execution boundary after failed stops. These findings demonstrate unique effects of timing and control errors on the underlying mechanisms of control, the rate and threshold of accumulating action signals.

摘要

目标导向行为需要将动作选择过程与学习系统相结合,学习系统使用环境反馈来适应控制。这些功能已知在一个共同的神经基质中相交,该基质具有多个已知的可塑性靶点(皮质基底节丘脑网络),这表明反馈信号对未来的决策有多种影响。我们使用积累到边界的决策模型和强化学习的混合模型,对人类在停止信号任务中的表现进行了建模,在该任务中,参与者(N = 75:37 名男性,38 名女性)通过试错反馈学习停止信号时间的先验分布。动作执行过程的漂移率的变化是由动作定时的误差驱动的,而边界高度的适应则有助于在失败的停止后增加谨慎性。这些发现强调了两种交互式学习机制,用于根据反馈误差的可分离维度来调整目标导向动作的控制。许多复杂的行为目标依赖于调节动作执行时间的能力,同时也需要保持足够的控制能力,以便在环境中的“停止”提示下取消动作。在这里,我们通过将强化学习原则与积累到边界模型相结合,研究了这些行为的基本组成部分如何根据环境的控制要求进行调整。适应性停止信号任务中的行为数据的模型拟合揭示了两种自适应机制:(1)执行信号速率的与时间误差相关的变化;(2)失败停止后执行边界的增加。这些发现证明了时间和控制误差对控制机制、积累动作信号的速率和阈值的潜在机制有独特的影响。

相似文献

1
Errors in Action Timing and Inhibition Facilitate Learning by Tuning Distinct Mechanisms in the Underlying Decision Process.动作时间和抑制错误通过调整基础决策过程中的不同机制促进学习。
J Neurosci. 2019 Mar 20;39(12):2251-2264. doi: 10.1523/JNEUROSCI.1924-18.2019. Epub 2019 Jan 17.
2
Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.在一项运动决策任务中,信用分配受机构影响,而不受感官预测误差影响。
J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.
3
Cross-Task Contributions of Frontobasal Ganglia Circuitry in Response Inhibition and Conflict-Induced Slowing.额顶眶额皮层-基底神经节回路在反应抑制和冲突诱发减速中的跨任务贡献。
Cereb Cortex. 2019 May 1;29(5):1969-1983. doi: 10.1093/cercor/bhy076.
4
Reward-driven changes in striatal pathway competition shape evidence evaluation in decision-making.奖赏驱动的纹状体通路竞争变化塑造了决策中的证据评估。
PLoS Comput Biol. 2019 May 6;15(5):e1006998. doi: 10.1371/journal.pcbi.1006998. eCollection 2019 May.
5
Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.在一项决策任务中,预测错误的神经特征受动作执行失败的调节。
Curr Biol. 2019 May 20;29(10):1606-1613.e5. doi: 10.1016/j.cub.2019.04.011. Epub 2019 May 2.
6
Neural signatures of experience-based improvements in deterministic decision-making.基于经验的确定性决策改进的神经特征。
Behav Brain Res. 2016 Dec 15;315:51-65. doi: 10.1016/j.bbr.2016.08.023. Epub 2016 Aug 11.
7
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.
8
Reward-Based Improvements in Motor Control Are Driven by Multiple Error-Reducing Mechanisms.基于奖励的运动控制改善是由多种错误减少机制驱动的。
J Neurosci. 2020 Apr 29;40(18):3604-3620. doi: 10.1523/JNEUROSCI.2646-19.2020. Epub 2020 Mar 31.
9
Learning Similar Actions by Reinforcement or Sensory-Prediction Errors Rely on Distinct Physiological Mechanisms.通过强化或感觉预测误差来学习相似动作依赖于不同的生理机制。
Cereb Cortex. 2018 Oct 1;28(10):3478-3490. doi: 10.1093/cercor/bhx214.
10
Predicting psychosis across diagnostic boundaries: Behavioral and computational modeling evidence for impaired reinforcement learning in schizophrenia and bipolar disorder with a history of psychosis.跨越诊断界限预测精神病:精神分裂症和有精神病病史的双相情感障碍中强化学习受损的行为和计算建模证据。
J Abnorm Psychol. 2015 Aug;124(3):697-708. doi: 10.1037/abn0000039.

引用本文的文献

1
Rhythm Facilitates Auditory Working Memory via Beta-Band Encoding and Theta-Band Maintenance.节律通过β波段编码和θ波段维持促进听觉工作记忆。
Neurosci Bull. 2025 Feb;41(2):195-210. doi: 10.1007/s12264-024-01289-w. Epub 2024 Aug 31.
2
Competing neural representations of choice shape evidence accumulation in humans.竞争的神经选择表征塑造了人类的证据积累。
Elife. 2023 Oct 11;12:e85223. doi: 10.7554/eLife.85223.
3
Identifying control ensembles for information processing within the cortico-basal ganglia-thalamic circuit.识别皮质基底神经节 - 丘脑回路内信息处理的控制集合。
PLoS Comput Biol. 2022 Jun 23;18(6):e1010255. doi: 10.1371/journal.pcbi.1010255. eCollection 2022 Jun.
4
Effects of beta-band and gamma-band rhythmic stimulation on motor inhibition.β波段和γ波段节律性刺激对运动抑制的影响。
iScience. 2022 Apr 30;25(5):104338. doi: 10.1016/j.isci.2022.104338. eCollection 2022 May 20.
5
Dynamic decision policy reconfiguration under outcome uncertainty.在结果不确定的情况下进行动态决策策略的重新配置。
Elife. 2021 Dec 24;10:e65540. doi: 10.7554/eLife.65540.
6
Adiposity covaries with signatures of asymmetric feedback learning during adaptive decisions.在适应性决策过程中,肥胖与不对称反馈学习的特征共同变化。
Soc Cogn Affect Neurosci. 2020 Nov 10;15(10):1145-1156. doi: 10.1093/scan/nsaa088.

本文引用的文献

1
Cortical beta power reflects decision dynamics and uncovers multiple facets of post-error adaptation.皮质β功率反映决策动态,揭示了错误后适应的多个方面。
Nat Commun. 2018 Nov 28;9(1):5038. doi: 10.1038/s41467-018-07456-8.
2
A competitive model for striatal action selection.纹状体动作选择的竞争模型。
Brain Res. 2019 Jun 15;1713:70-79. doi: 10.1016/j.brainres.2018.10.009. Epub 2018 Oct 6.
3
Striatal activity during reactive inhibition is related to the expectation of stop-signals.纹状体在反应性抑制期间的活动与对停止信号的预期有关。
Neuroscience. 2017 Oct 11;361:192-198. doi: 10.1016/j.neuroscience.2017.08.037. Epub 2017 Aug 24.
4
Distinct Sources of Deterministic and Stochastic Components of Action Timing Decisions in Rodent Frontal Cortex.在啮齿动物前额皮质中,动作时间决策的确定性和随机性成分的不同来源。
Neuron. 2017 May 17;94(4):908-919.e7. doi: 10.1016/j.neuron.2017.04.040.
5
Parameter recovery, bias and standard errors in the linear ballistic accumulator model.线性弹道累加器模型中的参数恢复、偏差和标准误差
Br J Math Stat Psychol. 2017 May;70(2):280-296. doi: 10.1111/bmsp.12100.
6
Testing the validity of conflict drift-diffusion models for use in estimating cognitive processes: A parameter-recovery study.检验用于估计认知过程的冲突漂移扩散模型的有效性:一项参数恢复研究。
Psychon Bull Rev. 2018 Feb;25(1):286-301. doi: 10.3758/s13423-017-1271-2.
7
Models of inhibitory control.抑制控制模型。
Philos Trans R Soc Lond B Biol Sci. 2017 Apr 19;372(1718). doi: 10.1098/rstb.2016.0193.
8
Distinct mechanisms mediate speed-accuracy adjustments in cortico-subthalamic networks.不同机制介导皮质-丘脑底核网络中的速度-准确性调整。
Elife. 2017 Jan 31;6:e21481. doi: 10.7554/eLife.21481.
9
The human subthalamic nucleus and globus pallidus internus differentially encode reward during action control.人类丘脑底核和苍白球内侧部在动作控制过程中对奖励进行不同编码。
Hum Brain Mapp. 2017 Apr;38(4):1952-1964. doi: 10.1002/hbm.23496. Epub 2017 Jan 28.
10
On the Globality of Motor Suppression: Unexpected Events and Their Influence on Behavior and Cognition.论运动抑制的全局性:意外事件及其对行为和认知的影响。
Neuron. 2017 Jan 18;93(2):259-280. doi: 10.1016/j.neuron.2016.12.013.