主动推断与两步任务。

Active inference and the two-step task.

机构信息

Neurocomputation and Neuroimaging Unit, Freie Universität Berlin, 14195, Berlin, Germany.

Berlin School of Mind and Brain, Humboldt-Universität zu Berlin, 10117, Berlin, Germany.

出版信息

Sci Rep. 2022 Oct 21;12(1):17682. doi: 10.1038/s41598-022-21766-4.

DOI:10.1038/s41598-022-21766-4

PMID:36271279

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9586964/

Abstract

Sequential decision problems distill important challenges frequently faced by humans. Through repeated interactions with an uncertain world, unknown statistics need to be learned while balancing exploration and exploitation. Reinforcement learning is a prominent method for modeling such behaviour, with a prevalent application being the two-step task. However, recent studies indicate that the standard reinforcement learning model sometimes describes features of human task behaviour inaccurately and incompletely. We investigated whether active inference, a framework proposing a trade-off to the exploration-exploitation dilemma, could better describe human behaviour. Therefore, we re-analysed four publicly available datasets of the two-step task, performed Bayesian model selection, and compared behavioural model predictions. Two datasets, which revealed more model-based inference and behaviour indicative of directed exploration, were better described by active inference, while the models scored similarly for the remaining datasets. Learning using probability distributions appears to contribute to the improved model fits. Further, approximately half of all participants showed sensitivity to information gain as formulated under active inference, although behavioural exploration effects were not fully captured. These results contribute to the empirical validation of active inference as a model of human behaviour and the study of alternative models for the influential two-step task.

摘要

序贯决策问题提炼了人类经常面临的重要挑战。通过与不确定的世界反复交互，需要学习未知的统计数据，同时平衡探索和利用。强化学习是建模这种行为的一种突出方法，其一个常见的应用是两步任务。然而，最近的研究表明，标准的强化学习模型有时不能准确和完整地描述人类任务行为的特征。我们研究了主动推理（一种提出探索-利用困境权衡的框架）是否可以更好地描述人类行为。因此，我们重新分析了两步任务的四个公开可用数据集，进行了贝叶斯模型选择，并比较了行为模型预测。两个数据集揭示了更多基于模型的推理和有指导探索的行为，主动推理可以更好地描述这些数据集，而对于其余数据集，模型的得分相似。使用概率分布进行学习似乎有助于提高模型拟合度。此外，大约一半的参与者对主动推理中所表述的信息增益表现出敏感性，尽管行为探索效应并未完全捕捉到。这些结果有助于对主动推理作为人类行为模型的实证验证，并对有影响力的两步任务的替代模型进行研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7443/9586964/cabde263ce4d/41598_2022_21766_Fig1_HTML.jpg

相似文献

Active inference and the two-step task.

Sci Rep. 2022 Oct 21;12(1):17682. doi: 10.1038/s41598-022-21766-4.

An empirical evaluation of active inference in multi-armed bandits.

Neural Netw. 2021 Dec;144:229-246. doi: 10.1016/j.neunet.2021.08.018. Epub 2021 Aug 26.

Computational mechanisms of curiosity and goal-directed exploration.

Elife. 2019 May 10;8:e41703. doi: 10.7554/eLife.41703.

The empirical status of predictive coding and active inference.

Neurosci Biobehav Rev. 2024 Feb;157:105473. doi: 10.1016/j.neubiorev.2023.105473. Epub 2023 Nov 28.

Attenuated Directed Exploration during Reinforcement Learning in Gambling Disorder.

J Neurosci. 2021 Mar 17;41(11):2512-2522. doi: 10.1523/JNEUROSCI.1607-20.2021. Epub 2021 Feb 2.

Uncertainty-driven regulation of learning and exploration in adolescents: A computational account.

PLoS Comput Biol. 2020 Sep 30;16(9):e1008276. doi: 10.1371/journal.pcbi.1008276. eCollection 2020 Sep.

A reinforcement learning diffusion decision model for value-based decisions.

Psychon Bull Rev. 2019 Aug;26(4):1099-1121. doi: 10.3758/s13423-018-1554-2.

Variability in Action Selection Relates to Striatal Dopamine 2/3 Receptor Availability in Humans: A PET Neuroimaging Study Using Reinforcement Learning and Active Inference Models.

Cereb Cortex. 2020 May 18;30(6):3573-3589. doi: 10.1093/cercor/bhz327.

Bayesian reinforcement learning: A basic overview.

Neurobiol Learn Mem. 2024 May;211:107924. doi: 10.1016/j.nlm.2024.107924. Epub 2024 Apr 3.

Regulation of reinforcement learning parameters captures long-term changes in rat behaviour.

Eur J Neurosci. 2024 Aug;60(4):4469-4490. doi: 10.1111/ejn.16449. Epub 2024 Jun 24.

引用本文的文献

The role of affective states in computational psychiatry.

Int J Neuropsychopharmacol. 2025 Aug 1;28(8). doi: 10.1093/ijnp/pyaf049.

Signatures of Perseveration and Heuristic-Based Directed Exploration in Two-Step Sequential Decision Task Behaviour.

Comput Psychiatr. 2025 Feb 11;9(1):39-62. doi: 10.5334/cpsy.101. eCollection 2025.

本文引用的文献

Slower Learning Rates from Negative Outcomes in Substance Use Disorder over a 1-Year Period and Their Potential Predictive Utility.

Comput Psychiatr. 2022 Jun 8;6(1):117-141. doi: 10.5334/cpsy.85. eCollection 2022.

Explicit knowledge of task structure is a primary determinant of human model-based action.

Nat Hum Behav. 2022 Aug;6(8):1126-1141. doi: 10.1038/s41562-022-01346-2. Epub 2022 May 19.

A step-by-step tutorial on active inference and its application to empirical data.

J Math Psychol. 2022 Apr;107. doi: 10.1016/j.jmp.2021.102632. Epub 2022 Feb 4.

An empirical evaluation of active inference in multi-armed bandits.

Neural Netw. 2021 Dec;144:229-246. doi: 10.1016/j.neunet.2021.08.018. Epub 2021 Aug 26.

Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task.

Comput Brain Behav. 2021;4(4):442-462. doi: 10.1007/s42113-021-00112-3. Epub 2021 Aug 2.

Long-term stability of computational parameters during approach-avoidance conflict in a transdiagnostic psychiatric patient sample.

Sci Rep. 2021 Jun 3;11(1):11783. doi: 10.1038/s41598-021-91308-x.

Neural surprise in somatosensory Bayesian learning.

PLoS Comput Biol. 2021 Feb 2;17(2):e1008068. doi: 10.1371/journal.pcbi.1008068. eCollection 2021 Feb.

Active Inference: Demystified and Compared.

Neural Comput. 2021 Mar;33(3):674-712. doi: 10.1162/neco_a_01357. Epub 2021 Jan 5.

Learning in Volatile Environments With the Bayes Factor Surprise.

Neural Comput. 2021 Feb;33(2):269-340. doi: 10.1162/neco_a_01352. Epub 2021 Jan 5.

A Bayesian computational model reveals a failure to adapt interoceptive precision estimates across depression, anxiety, eating, and substance use disorders.

PLoS Comput Biol. 2020 Dec 14;16(12):e1008484. doi: 10.1371/journal.pcbi.1008484. eCollection 2020 Dec.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

主动推断与两步任务。

Active inference and the two-step task.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献