一种关于对基于强化的决策进行认知控制的新计算解释：概率学习任务的建模

A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.

作者信息

Zendehrouh Sareh

机构信息

School of Cognitive Sciences, Institute for Research in Fundamental Sciences (IPM), P.O. Box 19395-5746, Tehran, Iran.

出版信息

Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.

DOI:10.1016/j.neunet.2015.08.006

PMID:26339919

Abstract

Recent work on decision-making field offers an account of dual-system theory for decision-making process. This theory holds that this process is conducted by two main controllers: a goal-directed system and a habitual system. In the reinforcement learning (RL) domain, the habitual behaviors are connected with model-free methods, in which appropriate actions are learned through trial-and-error experiences. However, goal-directed behaviors are associated with model-based methods of RL, in which actions are selected using a model of the environment. Studies on cognitive control also suggest that during processes like decision-making, some cortical and subcortical structures work in concert to monitor the consequences of decisions and to adjust control according to current task demands. Here a computational model is presented based on dual system theory and cognitive control perspective of decision-making. The proposed model is used to simulate human performance on a variant of probabilistic learning task. The basic proposal is that the brain implements a dual controller, while an accompanying monitoring system detects some kinds of conflict including a hypothetical cost-conflict one. The simulation results address existing theories about two event-related potentials, namely error related negativity (ERN) and feedback related negativity (FRN), and explore the best account of them. Based on the results, some testable predictions are also presented.

摘要

近期在决策领域的研究提出了一种关于决策过程的双系统理论。该理论认为，这一过程由两个主要控制器主导：目标导向系统和习惯系统。在强化学习（RL）领域，习惯行为与无模型方法相关联，在这种方法中，通过试错经验来学习适当的行为。然而，目标导向行为与基于模型的强化学习方法相关联，在这种方法中，使用环境模型来选择行为。对认知控制的研究还表明，在决策等过程中，一些皮层和皮层下结构协同工作，以监测决策的后果，并根据当前任务需求调整控制。在此，基于双系统理论和决策的认知控制视角提出了一个计算模型。所提出的模型用于模拟人类在概率学习任务变体上的表现。基本观点是，大脑实现了一个双控制器，同时一个伴随的监测系统检测包括假设的成本冲突在内的某些类型的冲突。模拟结果阐述了关于两种事件相关电位的现有理论，即错误相关负波（ERN）和反馈相关负波（FRN），并探索了对它们的最佳解释。基于这些结果，还提出了一些可检验的预测。

相似文献

A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.

Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.

The role of time in conflict-triggered control: Extending the theory of response-conflict monitoring.

Neurosci Lett. 2016 Apr 8;618:110-114. doi: 10.1016/j.neulet.2016.02.062. Epub 2016 Mar 2.

Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.

Psychol Rev. 2012 Jan;119(1):120-54. doi: 10.1037/a0026435.

Feedback for reinforcement learning based brain-machine interfaces using confidence metrics.

J Neural Eng. 2017 Jun;14(3):036016. doi: 10.1088/1741-2552/aa6317. Epub 2017 Feb 27.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Predicting psychosis across diagnostic boundaries: Behavioral and computational modeling evidence for impaired reinforcement learning in schizophrenia and bipolar disorder with a history of psychosis.

J Abnorm Psychol. 2015 Aug;124(3):697-708. doi: 10.1037/abn0000039.

Reinforcement learning signals predict future decisions.

J Neurosci. 2007 Jan 10;27(2):371-8. doi: 10.1523/JNEUROSCI.4421-06.2007.

Electrophysiological correlates reflect the integration of model-based and model-free decision information.

Cogn Affect Behav Neurosci. 2017 Apr;17(2):406-421. doi: 10.3758/s13415-016-0487-3.

Dorsal anterior cingulate cortex integrates reinforcement history to guide voluntary behavior.

Cortex. 2008 May;44(5):548-59. doi: 10.1016/j.cortex.2007.08.013. Epub 2007 Dec 23.

Neuron. 2005 Aug 18;47(4):495-501. doi: 10.1016/j.neuron.2005.06.020.

引用本文的文献

Common and Distinct Neural Mechanisms Underlying Risk Seeking and Risk Aversion: Evidence From the Neuroimaging Meta-Analysis.

Hum Brain Mapp. 2025 Aug 1;46(11):e70295. doi: 10.1002/hbm.70295.

Do Individuals With Obsessive-Compulsive Disorder and Obsessive-Compulsive Personality Disorder Share Similar Neural Mechanisms of Decision-Making Under Ambiguous Circumstances?

Front Hum Neurosci. 2020 Oct 22;14:585086. doi: 10.3389/fnhum.2020.585086. eCollection 2020.

Development, Validation, and Implementation of a Medical Judgment Metric.

MDM Policy Pract. 2017 Jun 19;2(1):2381468317715262. doi: 10.1177/2381468317715262. eCollection 2017 Jan-Jun.

Medical judgement analogue studies with applications to spaceflight crew medical officer.

BMJ Simul Technol Enhanc Learn. 2017 Oct;3(4):163-168. doi: 10.1136/bmjstel-2017-000210. Epub 2017 Jun 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种关于对基于强化的决策进行认知控制的新计算解释：概率学习任务的建模

A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献