• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

再测试学习参数的可靠性。

Test-retest reliability of reinforcement learning parameters.

机构信息

Department of Psychology, University of Amsterdam, Amsterdam, the Netherlands.

Cognitive Neuroscience Department, Radboud University Medical Centre, Nijmegen, the Netherlands.

出版信息

Behav Res Methods. 2024 Aug;56(5):4582-4599. doi: 10.3758/s13428-023-02203-4. Epub 2023 Sep 8.

DOI:10.3758/s13428-023-02203-4
PMID:37684495
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11289054/
Abstract

It has recently been suggested that parameter estimates of computational models can be used to understand individual differences at the process level. One area of research in which this approach, called computational phenotyping, has taken hold is computational psychiatry. One requirement for successful computational phenotyping is that behavior and parameters are stable over time. Surprisingly, the test-retest reliability of behavior and model parameters remains unknown for most experimental tasks and models. The present study seeks to close this gap by investigating the test-retest reliability of canonical reinforcement learning models in the context of two often-used learning paradigms: a two-armed bandit and a reversal learning task. We tested independent cohorts for the two tasks (N = 69 and N = 47) via an online testing platform with a between-test interval of five weeks. Whereas reliability was high for personality and cognitive measures (with ICCs ranging from .67 to .93), it was generally poor for the parameter estimates of the reinforcement learning models (with ICCs ranging from .02 to .52 for the bandit task and from .01 to .71 for the reversal learning task). Given that simulations indicated that our procedures could detect high test-retest reliability, this suggests that a significant proportion of the variability must be ascribed to the participants themselves. In support of that hypothesis, we show that mood (stress and happiness) can partly explain within-participant variability. Taken together, these results are critical for current practices in computational phenotyping and suggest that individual variability should be taken into account in the future development of the field.

摘要

最近有人提出,计算模型的参数估计可用于了解个体差异的过程水平。这种方法,即计算表型,在计算精神病学领域得到了广泛应用。成功进行计算表型分析的一个要求是,行为和参数在时间上是稳定的。令人惊讶的是,对于大多数实验任务和模型,行为和模型参数的重测信度仍然未知。本研究旨在通过在两个常用的学习范式(双臂赌博和反转学习任务)中研究经典强化学习模型的重测信度来弥补这一空白。我们通过一个在线测试平台,对两个任务的独立队列进行了测试(N=69 和 N=47),两次测试之间的间隔为五周。虽然人格和认知测量的可靠性较高(ICC 范围从.67 到.93),但强化学习模型的参数估计的可靠性通常较差(对于赌博任务,ICC 范围从.02 到.52,对于反转学习任务,ICC 范围从.01 到.71)。鉴于模拟表明我们的程序可以检测到高重测信度,这表明很大一部分变异性必须归因于参与者本身。为了支持这一假设,我们表明情绪(压力和幸福感)可以部分解释个体内的变异性。总而言之,这些结果对于当前的计算表型分析实践至关重要,并表明在该领域的未来发展中应考虑个体变异性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563b/11289054/5b9c21506d23/13428_2023_2203_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563b/11289054/b53fca0d7933/13428_2023_2203_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563b/11289054/5b9c21506d23/13428_2023_2203_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563b/11289054/b53fca0d7933/13428_2023_2203_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563b/11289054/5b9c21506d23/13428_2023_2203_Fig2_HTML.jpg

相似文献

1
Test-retest reliability of reinforcement learning parameters.再测试学习参数的可靠性。
Behav Res Methods. 2024 Aug;56(5):4582-4599. doi: 10.3758/s13428-023-02203-4. Epub 2023 Sep 8.
2
Reliability of Decision-Making and Reinforcement Learning Computational Parameters.决策与强化学习计算参数的可靠性
Comput Psychiatr. 2023 Feb 8;7(1):30-46. doi: 10.5334/cpsy.86. eCollection 2023.
3
Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity.提高计算分析的可靠性:基于模型的规划及其与强迫行为的关系。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Jun;5(6):601-609. doi: 10.1016/j.bpsc.2019.12.019. Epub 2020 Jan 13.
4
Sufficient reliability of the behavioral and computational readouts of a probabilistic reversal learning task.概率反转学习任务的行为和计算读数具有足够的可靠性。
Behav Res Methods. 2022 Dec;54(6):2993-3014. doi: 10.3758/s13428-021-01739-7. Epub 2022 Feb 15.
5
Does the reliability of computational models truly improve with hierarchical modeling? Some recommendations and considerations for the assessment of model parameter reliability : Reliability of computational model parameters.计算模型的可靠性真的会通过分层建模得到提高吗?关于评估模型参数可靠性的一些建议和考量:计算模型参数的可靠性。
Psychon Bull Rev. 2024 Dec;31(6):2465-2486. doi: 10.3758/s13423-024-02490-8. Epub 2024 May 8.
6
Reinforcement Learning during Adolescence in Rats.大鼠青春期的强化学习。
J Neurosci. 2020 Jul 22;40(30):5857-5870. doi: 10.1523/JNEUROSCI.0910-20.2020. Epub 2020 Jun 29.
7
Association of Neural and Emotional Impacts of Reward Prediction Errors With Major Depression.奖励预测误差的神经和情绪影响与重度抑郁症的关联。
JAMA Psychiatry. 2017 Aug 1;74(8):790-797. doi: 10.1001/jamapsychiatry.2017.1713.
8
Enhancing the Psychometric Properties of the Iowa Gambling Task Using Full Generative Modeling.使用全生成模型增强爱荷华赌博任务的心理测量特性。
Comput Psychiatr. 2022;6(1):189-212. doi: 10.5334/cpsy.89. Epub 2022 Aug 26.
9
Test-retest reliability of event-related functional MRI in a probabilistic reversal learning task.概率性反转学习任务中事件相关功能磁共振成像的重测信度
Psychiatry Res. 2009 Oct 30;174(1):40-6. doi: 10.1016/j.pscychresns.2009.03.003. Epub 2009 Sep 23.
10
Are methamphetamine users compulsive? Faulty reinforcement learning, not inflexibility, underlies decision making in people with methamphetamine use disorder.甲基苯丙胺使用者是否具有强迫性?在有甲基苯丙胺使用障碍的人群中,决策的基础是错误的强化学习,而不是不灵活性。
Addict Biol. 2021 Jul;26(4):e12999. doi: 10.1111/adb.12999. Epub 2021 Jan 3.

引用本文的文献

1
[Association of increased greater tubercle angle and critical shoulder angle with rotator cuff tears].大结节角度增加及临界肩角与肩袖撕裂的相关性
Beijing Da Xue Xue Bao Yi Xue Ban. 2025 Aug 18;57(4):740-747. doi: 10.19723/j.issn.1671-167X.2025.04.018.
2
Deep Learning Improves Parameter Estimation in Reinforcement Learning Models.深度学习改进强化学习模型中的参数估计。
bioRxiv. 2025 Jun 18:2025.03.21.644663. doi: 10.1101/2025.03.21.644663.
3
A task-invariant prior explains trial-by-trial active avoidance behaviour across gain and loss tasks.

本文引用的文献

1
What do Reinforcement Learning Models Measure? Interpreting Model Parameters in Cognition and Neuroscience.强化学习模型衡量的是什么?解读认知与神经科学中的模型参数。
Curr Opin Behav Sci. 2021 Oct;41:128-137. doi: 10.1016/j.cobeha.2021.06.004. Epub 2021 Jul 3.
2
Reliability and Replicability of Implicit and Explicit Reinforcement Learning Paradigms in People With Psychotic Disorders.精神分裂症患者内隐和外显强化学习范式的可靠性和可重复性。
Schizophr Bull. 2021 Apr 29;47(3):731-739. doi: 10.1093/schbul/sbaa165.
3
Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity.
一种任务不变先验解释了在收益和损失任务中逐次试验的主动回避行为。
Commun Psychol. 2025 May 22;3(1):82. doi: 10.1038/s44271-025-00254-1.
4
Test-retest reliability of computational parameters versus manifest behavior for decisional flexibility in psychosis.精神病患者决策灵活性的计算参数与明显行为的重测信度
Psychol Assess. 2025 Jun-Jul;37(6-7):273-287. doi: 10.1037/pas0001383. Epub 2025 Apr 7.
5
Sample size matters when estimating test-retest reliability of behaviour.在评估行为的重测信度时,样本量很重要。
Behav Res Methods. 2025 Mar 21;57(4):123. doi: 10.3758/s13428-025-02599-1.
6
Understanding learning through uncertainty and bias.通过不确定性和偏差来理解学习。
Commun Psychol. 2025 Feb 13;3(1):24. doi: 10.1038/s44271-025-00203-y.
7
Test-retest reliability of behavioral and computational measures of advice taking under volatility.在波动性下,采取建议的行为和计算测量的重测信度。
PLoS One. 2024 Nov 18;19(11):e0312255. doi: 10.1371/journal.pone.0312255. eCollection 2024.
8
Informing the treatment of social anxiety disorder with computational and neuroimaging data.利用计算和神经影像数据指导社交焦虑障碍的治疗。
Psychoradiology. 2024 May 3;4:kkae010. doi: 10.1093/psyrad/kkae010. eCollection 2024.
9
Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts.主动强化学习与动作偏差和滞后的比较:混合专家与非专家的控制。
PLoS Comput Biol. 2024 Mar 29;20(3):e1011950. doi: 10.1371/journal.pcbi.1011950. eCollection 2024 Mar.
10
Reliability of gamified reinforcement learning in densely sampled longitudinal assessments.密集抽样纵向评估中游戏化强化学习的可靠性
PLOS Digit Health. 2023 Sep 6;2(9):e0000330. doi: 10.1371/journal.pdig.0000330. eCollection 2023 Sep.
提高计算分析的可靠性:基于模型的规划及其与强迫行为的关系。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Jun;5(6):601-609. doi: 10.1016/j.bpsc.2019.12.019. Epub 2020 Jan 13.
4
Modulatory effects of positive mood and approach motivation on reward processing: Two sides of the same coin?积极情绪和趋近动机对奖励加工的调节作用:同一枚硬币的两面?
Cogn Affect Behav Neurosci. 2020 Apr;20(2):236-249. doi: 10.3758/s13415-019-00764-6.
5
Are individual differences quantitative or qualitative? An integrated behavioral and fMRI MIMIC approach.个体差异是定量的还是定性的?一种综合行为和 fMRI 模拟的方法。
Neuroimage. 2019 Nov 15;202:116058. doi: 10.1016/j.neuroimage.2019.116058. Epub 2019 Jul 26.
6
Altered learning under uncertainty in unmedicated mood and anxiety disorders.未用药的心境和焦虑障碍患者在不确定条件下的学习改变。
Nat Hum Behav. 2019 Oct;3(10):1116-1123. doi: 10.1038/s41562-019-0628-0. Epub 2019 Jun 17.
7
Reaction time variability and attention-deficit/hyperactivity disorder: is increased reaction time variability specific to attention-deficit/hyperactivity disorder? Testing predictions from the default-mode interference hypothesis.反应时变异性与注意力缺陷多动障碍:反应时变异性增加是否为注意力缺陷多动障碍所特有?检验来自默认模式干扰假说的预测。
Atten Defic Hyperact Disord. 2019 Mar;11(1):47-58. doi: 10.1007/s12402-018-0257-x. Epub 2019 Mar 29.
8
A reinforcement learning diffusion decision model for value-based decisions.基于价值的决策的强化学习扩散决策模型。
Psychon Bull Rev. 2019 Aug;26(4):1099-1121. doi: 10.3758/s13423-018-1554-2.
9
A psychometrics of individual differences in experimental tasks.实验任务中的个体差异的心理测量学。
Psychon Bull Rev. 2019 Apr;26(2):452-467. doi: 10.3758/s13423-018-1558-y.
10
Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models.联合建模反应时间和选择可提高强化学习模型的参数可识别性。
J Neurosci Methods. 2019 Apr 1;317:37-44. doi: 10.1016/j.jneumeth.2019.01.006. Epub 2019 Jan 18.