• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

决策与强化学习计算参数的可靠性

Reliability of Decision-Making and Reinforcement Learning Computational Parameters.

作者信息

Mkrtchian Anahit, Valton Vincent, Roiser Jonathan P

机构信息

Neuroscience and Mental Health Group, Institute of Cognitive Neuroscience, University College London, London, United Kingdom.

Applied Computational Psychiatry Lab, Mental Health Neuroscience Department, Division of Psychiatry and Max Planck Centre for Computational Psychiatry and Ageing Research, Queen Square Institute of Neurology, University College London, London, United Kingdom.

出版信息

Comput Psychiatr. 2023 Feb 8;7(1):30-46. doi: 10.5334/cpsy.86. eCollection 2023.

DOI:10.5334/cpsy.86
PMID:38774643
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11104400/
Abstract

Computational models can offer mechanistic insight into cognition and therefore have the potential to transform our understanding of psychiatric disorders and their treatment. For translational efforts to be successful, it is imperative that computational measures capture individual characteristics reliably. Here we examine the reliability of reinforcement learning and economic models derived from two commonly used tasks. Healthy individuals (N = 50) completed a restless four-armed bandit and a calibrated gambling task twice, two weeks apart. Reward and punishment learning rates from the reinforcement learning model showed good reliability and reward and punishment sensitivity from the same model had fair reliability; while risk aversion and loss aversion parameters from a prospect theory model exhibited good and excellent reliability, respectively. Both models were further able to predict future behaviour above chance within individuals. This prediction was better when based on participants' own model parameters than other participants' parameter estimates. These results suggest that reinforcement learning, and particularly prospect theory parameters, as derived from a restless four-armed bandit and a calibrated gambling task, can be measured reliably to assess learning and decision-making mechanisms. Overall, these findings indicate the translational potential of clinically-relevant computational parameters for precision psychiatry.

摘要

计算模型能够为认知提供机制性见解,因此有潜力改变我们对精神疾病及其治疗的理解。为使转化研究取得成功,至关重要的是计算指标能够可靠地捕捉个体特征。在此,我们检验了源自两项常用任务的强化学习模型和经济模型的可靠性。50名健康个体间隔两周完成了两次不安分的四臂赌博机任务和一次校准赌博任务。强化学习模型中的奖励和惩罚学习率显示出良好的可靠性,同一模型中的奖励和惩罚敏感性具有尚可的可靠性;而前景理论模型中的风险厌恶和损失厌恶参数分别表现出良好和出色的可靠性。两个模型都能够在个体层面上对未来行为做出高于机遇水平的预测。基于参与者自身的模型参数进行预测比基于其他参与者的参数估计效果更好。这些结果表明,源自不安分的四臂赌博机任务和校准赌博任务的强化学习,尤其是前景理论参数,可以被可靠地测量以评估学习和决策机制。总体而言,这些发现表明了临床相关计算参数在精准精神病学中的转化潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/43015cf6bfc6/cpsy-7-1-86-g7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/e84b7fa224ea/cpsy-7-1-86-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/1b5db1f9e814/cpsy-7-1-86-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/c3876955734b/cpsy-7-1-86-g3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/8f44a14da0fe/cpsy-7-1-86-g4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/693b13cada56/cpsy-7-1-86-g5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/8713a5065431/cpsy-7-1-86-g6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/43015cf6bfc6/cpsy-7-1-86-g7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/e84b7fa224ea/cpsy-7-1-86-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/1b5db1f9e814/cpsy-7-1-86-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/c3876955734b/cpsy-7-1-86-g3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/8f44a14da0fe/cpsy-7-1-86-g4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/693b13cada56/cpsy-7-1-86-g5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/8713a5065431/cpsy-7-1-86-g6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6bc/11104400/43015cf6bfc6/cpsy-7-1-86-g7.jpg

相似文献

1
Reliability of Decision-Making and Reinforcement Learning Computational Parameters.决策与强化学习计算参数的可靠性
Comput Psychiatr. 2023 Feb 8;7(1):30-46. doi: 10.5334/cpsy.86. eCollection 2023.
2
Test-retest reliability of reinforcement learning parameters.再测试学习参数的可靠性。
Behav Res Methods. 2024 Aug;56(5):4582-4599. doi: 10.3758/s13428-023-02203-4. Epub 2023 Sep 8.
3
Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity.提高计算分析的可靠性:基于模型的规划及其与强迫行为的关系。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Jun;5(6):601-609. doi: 10.1016/j.bpsc.2019.12.019. Epub 2020 Jan 13.
4
Test-retest reliability of the play-or-pass version of the Iowa Gambling Task.玩或不玩版本的爱荷华赌博任务的重测信度。
Cogn Affect Behav Neurosci. 2024 Aug;24(4):740-754. doi: 10.3758/s13415-024-01197-6. Epub 2024 Jun 7.
5
Task Learnability Modulates Surprise but Not Valence Processing for Reinforcement Learning in Probabilistic Choice Tasks.任务可学习性调节概率选择任务中强化学习的惊奇感,但不调节效价处理。
J Cogn Neurosci. 2021 Dec 6;34(1):34-53. doi: 10.1162/jocn_a_01777.
6
Approach-avoidance reinforcement learning as a translational and computational model of anxiety-related avoidance.趋近-回避强化学习作为焦虑相关回避的转化和计算模型。
Elife. 2023 Nov 14;12:RP87720. doi: 10.7554/eLife.87720.
7
Differential effects of reward and punishment in decision making under uncertainty: a computational study.不确定性下决策中奖惩的差异效应:一项计算研究。
Front Neurosci. 2014 Feb 21;8:30. doi: 10.3389/fnins.2014.00030. eCollection 2014.
8
Attenuated Directed Exploration during Reinforcement Learning in Gambling Disorder.赌博障碍的强化学习中减弱的有导向探索。
J Neurosci. 2021 Mar 17;41(11):2512-2522. doi: 10.1523/JNEUROSCI.1607-20.2021. Epub 2021 Feb 2.
9
Driven by Pain, Not Gain: Computational Approaches to Aversion-Related Decision Making in Psychiatry.受痛苦而非收益驱动:精神病学中与厌恶相关的决策的计算方法。
Biol Psychiatry. 2020 Feb 15;87(4):359-367. doi: 10.1016/j.biopsych.2019.08.025. Epub 2019 Sep 9.
10
Reinforcement Learning in Patients With Mood and Anxiety Disorders vs Control Individuals: A Systematic Review and Meta-analysis.心境和焦虑障碍患者与对照个体的强化学习:系统评价和荟萃分析。
JAMA Psychiatry. 2022 Apr 1;79(4):313-322. doi: 10.1001/jamapsychiatry.2022.0051.

引用本文的文献

1
Model-based exploration is measurable across tasks but not linked to personality and psychiatric assessments.基于模型的探索在各项任务中是可测量的,但与人格和精神评估无关。
Sci Rep. 2025 Jul 28;15(1):27479. doi: 10.1038/s41598-025-09152-2.
2
A common alteration in effort-based decision-making in apathy, anhedonia, and late circadian rhythm.冷漠、快感缺乏和昼夜节律晚期基于努力的决策中的一种常见改变。
Elife. 2025 Jun 16;13:RP96803. doi: 10.7554/eLife.96803.
3
Test-retest reliability of computational parameters versus manifest behavior for decisional flexibility in psychosis.

本文引用的文献

1
Test-Retest Reliability of Two Computationally-Characterised Affective Bias Tasks.两项经计算表征的情感偏差任务的重测信度
Comput Psychiatr. 2024 Dec 18;8(1):217-232. doi: 10.5334/cpsy.92. eCollection 2024.
2
The interpretation of computational model parameters depends on the context.计算模型参数的解释取决于上下文。
Elife. 2022 Nov 4;11:e75474. doi: 10.7554/eLife.75474.
3
Sufficient reliability of the behavioral and computational readouts of a probabilistic reversal learning task.概率反转学习任务的行为和计算读数具有足够的可靠性。
精神病患者决策灵活性的计算参数与明显行为的重测信度
Psychol Assess. 2025 Jun-Jul;37(6-7):273-287. doi: 10.1037/pas0001383. Epub 2025 Apr 7.
4
Test-Retest Reliability of Two Computationally-Characterised Affective Bias Tasks.两项经计算表征的情感偏差任务的重测信度
Comput Psychiatr. 2024 Dec 18;8(1):217-232. doi: 10.5334/cpsy.92. eCollection 2024.
5
Test-retest reliability of behavioral and computational measures of advice taking under volatility.在波动性下,采取建议的行为和计算测量的重测信度。
PLoS One. 2024 Nov 18;19(11):e0312255. doi: 10.1371/journal.pone.0312255. eCollection 2024.
6
A computational approach to understanding effort-based decision-making in depression.一种理解抑郁症中基于努力的决策的计算方法。
bioRxiv. 2025 Apr 2:2024.06.17.599286. doi: 10.1101/2024.06.17.599286.
7
Revisiting the role of computational neuroimaging in the era of integrative neuroscience.重新审视计算神经影像学在整合神经科学时代的作用。
Neuropsychopharmacology. 2024 Nov;50(1):103-113. doi: 10.1038/s41386-024-01946-8. Epub 2024 Sep 6.
8
Does the reliability of computational models truly improve with hierarchical modeling? Some recommendations and considerations for the assessment of model parameter reliability : Reliability of computational model parameters.计算模型的可靠性真的会通过分层建模得到提高吗?关于评估模型参数可靠性的一些建议和考量:计算模型参数的可靠性。
Psychon Bull Rev. 2024 Dec;31(6):2465-2486. doi: 10.3758/s13423-024-02490-8. Epub 2024 May 8.
9
Timing along the cardiac cycle modulates neural signals of reward-based learning.心脏周期中的时间调制奖励学习的神经信号。
Nat Commun. 2024 Apr 6;15(1):2976. doi: 10.1038/s41467-024-46921-5.
10
Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts.主动强化学习与动作偏差和滞后的比较:混合专家与非专家的控制。
PLoS Comput Biol. 2024 Mar 29;20(3):e1011950. doi: 10.1371/journal.pcbi.1011950. eCollection 2024 Mar.
Behav Res Methods. 2022 Dec;54(6):2993-3014. doi: 10.3758/s13428-021-01739-7. Epub 2022 Feb 15.
4
Loss Aversion and Risk Aversion in Non-Clinical Negative Symptoms and Hypomania.非临床阴性症状和轻躁狂中的损失厌恶与风险厌恶
Front Psychiatry. 2020 Sep 23;11:574131. doi: 10.3389/fpsyt.2020.574131. eCollection 2020.
5
Advances in the computational understanding of mental illness.精神疾病计算理解方面的进展。
Neuropsychopharmacology. 2021 Jan;46(1):3-19. doi: 10.1038/s41386-020-0746-4. Epub 2020 Jul 3.
6
What Is the Test-Retest Reliability of Common Task-Functional MRI Measures? New Empirical Evidence and a Meta-Analysis.常见任务态功能磁共振测量的重测信度如何?新的实证证据和荟萃分析。
Psychol Sci. 2020 Jul;31(7):792-806. doi: 10.1177/0956797620916786. Epub 2020 Jun 3.
7
Replicating patterns of prospect theory for decision under risk.复制风险决策下前景理论的模式。
Nat Hum Behav. 2020 Jun;4(6):622-633. doi: 10.1038/s41562-020-0886-x. Epub 2020 May 18.
8
Under the Hood: Using Computational Psychiatry to Make Psychological Therapies More Mechanism-Focused.深入探究:运用计算精神病学使心理治疗更聚焦机制
Front Psychiatry. 2020 Mar 18;11:140. doi: 10.3389/fpsyt.2020.00140. eCollection 2020.
9
Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity.提高计算分析的可靠性:基于模型的规划及其与强迫行为的关系。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Jun;5(6):601-609. doi: 10.1016/j.bpsc.2019.12.019. Epub 2020 Jan 13.
10
Realizing the Clinical Potential of Computational Psychiatry: Report From the Banbury Center Meeting, February 2019.挖掘计算精神病学的临床潜力:2019年2月班伯里中心会议报告
Biol Psychiatry. 2020 Jul 15;88(2):e5-e10. doi: 10.1016/j.biopsych.2019.12.026. Epub 2020 Feb 26.