• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种关于对基于强化的决策进行认知控制的新计算解释:概率学习任务的建模

A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.

作者信息

Zendehrouh Sareh

机构信息

School of Cognitive Sciences, Institute for Research in Fundamental Sciences (IPM), P.O. Box 19395-5746, Tehran, Iran.

出版信息

Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.

DOI:10.1016/j.neunet.2015.08.006
PMID:26339919
Abstract

Recent work on decision-making field offers an account of dual-system theory for decision-making process. This theory holds that this process is conducted by two main controllers: a goal-directed system and a habitual system. In the reinforcement learning (RL) domain, the habitual behaviors are connected with model-free methods, in which appropriate actions are learned through trial-and-error experiences. However, goal-directed behaviors are associated with model-based methods of RL, in which actions are selected using a model of the environment. Studies on cognitive control also suggest that during processes like decision-making, some cortical and subcortical structures work in concert to monitor the consequences of decisions and to adjust control according to current task demands. Here a computational model is presented based on dual system theory and cognitive control perspective of decision-making. The proposed model is used to simulate human performance on a variant of probabilistic learning task. The basic proposal is that the brain implements a dual controller, while an accompanying monitoring system detects some kinds of conflict including a hypothetical cost-conflict one. The simulation results address existing theories about two event-related potentials, namely error related negativity (ERN) and feedback related negativity (FRN), and explore the best account of them. Based on the results, some testable predictions are also presented.

摘要

近期在决策领域的研究提出了一种关于决策过程的双系统理论。该理论认为,这一过程由两个主要控制器主导:目标导向系统和习惯系统。在强化学习(RL)领域,习惯行为与无模型方法相关联,在这种方法中,通过试错经验来学习适当的行为。然而,目标导向行为与基于模型的强化学习方法相关联,在这种方法中,使用环境模型来选择行为。对认知控制的研究还表明,在决策等过程中,一些皮层和皮层下结构协同工作,以监测决策的后果,并根据当前任务需求调整控制。在此,基于双系统理论和决策的认知控制视角提出了一个计算模型。所提出的模型用于模拟人类在概率学习任务变体上的表现。基本观点是,大脑实现了一个双控制器,同时一个伴随的监测系统检测包括假设的成本冲突在内的某些类型的冲突。模拟结果阐述了关于两种事件相关电位的现有理论,即错误相关负波(ERN)和反馈相关负波(FRN),并探索了对它们的最佳解释。基于这些结果,还提出了一些可检验的预测。

相似文献

1
A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.一种关于对基于强化的决策进行认知控制的新计算解释:概率学习任务的建模
Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.
2
The role of time in conflict-triggered control: Extending the theory of response-conflict monitoring.时间在冲突引发控制中的作用:扩展反应冲突监测理论
Neurosci Lett. 2016 Apr 8;618:110-114. doi: 10.1016/j.neulet.2016.02.062. Epub 2016 Mar 2.
3
Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.目标导向决策作为概率推理:计算框架和潜在的神经关联。
Psychol Rev. 2012 Jan;119(1):120-54. doi: 10.1037/a0026435.
4
Feedback for reinforcement learning based brain-machine interfaces using confidence metrics.基于置信度指标的用于脑机接口的强化学习反馈
J Neural Eng. 2017 Jun;14(3):036016. doi: 10.1088/1741-2552/aa6317. Epub 2017 Feb 27.
5
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.
6
Predicting psychosis across diagnostic boundaries: Behavioral and computational modeling evidence for impaired reinforcement learning in schizophrenia and bipolar disorder with a history of psychosis.跨越诊断界限预测精神病:精神分裂症和有精神病病史的双相情感障碍中强化学习受损的行为和计算建模证据。
J Abnorm Psychol. 2015 Aug;124(3):697-708. doi: 10.1037/abn0000039.
7
Reinforcement learning signals predict future decisions.强化学习信号预测未来决策。
J Neurosci. 2007 Jan 10;27(2):371-8. doi: 10.1523/JNEUROSCI.4421-06.2007.
8
Electrophysiological correlates reflect the integration of model-based and model-free decision information.电生理相关性反映了基于模型和无模型决策信息的整合。
Cogn Affect Behav Neurosci. 2017 Apr;17(2):406-421. doi: 10.3758/s13415-016-0487-3.
9
Dorsal anterior cingulate cortex integrates reinforcement history to guide voluntary behavior.背侧前扣带回皮层整合强化历史以指导自愿行为。
Cortex. 2008 May;44(5):548-59. doi: 10.1016/j.cortex.2007.08.013. Epub 2007 Dec 23.
10
Error-related negativity predicts reinforcement learning and conflict biases.错误相关负波可预测强化学习和冲突偏向。
Neuron. 2005 Aug 18;47(4):495-501. doi: 10.1016/j.neuron.2005.06.020.

引用本文的文献

1
Common and Distinct Neural Mechanisms Underlying Risk Seeking and Risk Aversion: Evidence From the Neuroimaging Meta-Analysis.寻求风险和规避风险背后的共同和不同神经机制:来自神经影像学荟萃分析的证据
Hum Brain Mapp. 2025 Aug 1;46(11):e70295. doi: 10.1002/hbm.70295.
2
Do Individuals With Obsessive-Compulsive Disorder and Obsessive-Compulsive Personality Disorder Share Similar Neural Mechanisms of Decision-Making Under Ambiguous Circumstances?患有强迫症和强迫型人格障碍的个体在模糊情境下的决策神经机制是否相似?
Front Hum Neurosci. 2020 Oct 22;14:585086. doi: 10.3389/fnhum.2020.585086. eCollection 2020.
3
Development, Validation, and Implementation of a Medical Judgment Metric.
医学判断指标的开发、验证与实施
MDM Policy Pract. 2017 Jun 19;2(1):2381468317715262. doi: 10.1177/2381468317715262. eCollection 2017 Jan-Jun.
4
Medical judgement analogue studies with applications to spaceflight crew medical officer.应用于航天机组医务人员的医学判断模拟研究。
BMJ Simul Technol Enhanc Learn. 2017 Oct;3(4):163-168. doi: 10.1136/bmjstel-2017-000210. Epub 2017 Jun 29.