静态和动态环境中的次优准则学习

Suboptimal Criterion Learning in Static and Dynamic Environments.

作者信息

Norton Elyse H, Fleming Stephen M, Daw Nathaniel D, Landy Michael S

机构信息

Department of Psychology, New York University, New York, New York, United States of America.

Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom.

出版信息

PLoS Comput Biol. 2017 Jan 3;13(1):e1005304. doi: 10.1371/journal.pcbi.1005304. eCollection 2017 Jan.

DOI:10.1371/journal.pcbi.1005304

PMID:28046006

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5242548/

Abstract

Humans often make decisions based on uncertain sensory information. Signal detection theory (SDT) describes detection and discrimination decisions as a comparison of stimulus "strength" to a fixed decision criterion. However, recent research suggests that current responses depend on the recent history of stimuli and previous responses, suggesting that the decision criterion is updated trial-by-trial. The mechanisms underpinning criterion setting remain unknown. Here, we examine how observers learn to set a decision criterion in an orientation-discrimination task under both static and dynamic conditions. To investigate mechanisms underlying trial-by-trial criterion placement, we introduce a novel task in which participants explicitly set the criterion, and compare it to a more traditional discrimination task, allowing us to model this explicit indication of criterion dynamics. In each task, stimuli were ellipses with principal orientations drawn from two categories: Gaussian distributions with different means and equal variance. In the covert-criterion task, observers categorized a displayed ellipse. In the overt-criterion task, observers adjusted the orientation of a line that served as the discrimination criterion for a subsequently presented ellipse. We compared performance to the ideal Bayesian learner and several suboptimal models that varied in both computational and memory demands. Under static and dynamic conditions, we found that, in both tasks, observers used suboptimal learning rules. In most conditions, a model in which the recent history of past samples determines a belief about category means fit the data best for most observers and on average. Our results reveal dynamic adjustment of discrimination criterion, even after prolonged training, and indicate how decision criteria are updated over time.

摘要

人类常常基于不确定的感官信息做出决策。信号检测理论（SDT）将检测和辨别决策描述为刺激“强度”与固定决策标准的比较。然而，最近的研究表明，当前的反应取决于刺激的近期历史和先前的反应，这表明决策标准是逐次试验更新的。支撑标准设定的机制仍然未知。在这里，我们研究观察者如何在静态和动态条件下的方向辨别任务中学习设定决策标准。为了研究逐次试验标准设定的潜在机制，我们引入了一项新颖的任务，让参与者明确设定标准，并将其与更传统的辨别任务进行比较，从而使我们能够对这种标准动态的明确指示进行建模。在每个任务中，刺激都是椭圆，其主方向来自两类：具有不同均值和相等方差的高斯分布。在隐蔽标准任务中，观察者对显示的椭圆进行分类。在公开标准任务中，观察者调整一条线的方向，该线作为随后呈现的椭圆的辨别标准。我们将表现与理想贝叶斯学习者以及在计算和记忆需求方面各不相同的几个次优模型进行了比较。在静态和动态条件下，我们发现，在这两个任务中，观察者都使用了次优学习规则。在大多数情况下，一个模型，即过去样本的近期历史决定对类别均值的信念，最能拟合大多数观察者的数据以及平均数据。我们的结果揭示了即使经过长时间训练后辨别标准的动态调整，并指出了决策标准如何随时间更新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ebd/5242548/04e0ff797990/pcbi.1005304.g001.jpg

相似文献

Suboptimal Criterion Learning in Static and Dynamic Environments.

PLoS Comput Biol. 2017 Jan 3;13(1):e1005304. doi: 10.1371/journal.pcbi.1005304. eCollection 2017 Jan.

Decision criteria in dual discrimination tasks estimated using external-noise methods.

Atten Percept Psychophys. 2012 Jul;74(5):1042-55. doi: 10.3758/s13414-012-0269-0.

A dynamic stimulus-driven model of signal detection.

Psychol Rev. 2011 Oct;118(4):583-613. doi: 10.1037/a0025191.

A criterion setting theory of discrimination learning that accounts for anisotropies and context effects.

Seeing Perceiving. 2010;23(5-6):401-34. doi: 10.1163/187847510x541117.

Learning-induced uncertainty reduction in perceptual decisions is task-dependent.

Front Hum Neurosci. 2014 May 7;8:282. doi: 10.3389/fnhum.2014.00282. eCollection 2014.

Humans incorporate attention-dependent uncertainty into perceptual decisions and confidence.

Proc Natl Acad Sci U S A. 2018 Oct 23;115(43):11090-11095. doi: 10.1073/pnas.1717720115. Epub 2018 Oct 8.

Human online adaptation to changes in prior probability.

PLoS Comput Biol. 2019 Jul 8;15(7):e1006681. doi: 10.1371/journal.pcbi.1006681. eCollection 2019 Jul.

Perceptual learning increases orientation sampling efficiency.

J Vis. 2016;16(3):36. doi: 10.1167/16.3.36.

The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs.

Nat Commun. 2020 Jun 2;11(1):2757. doi: 10.1038/s41467-020-16196-7.

Monkeys and humans take local uncertainty into account when localizing a change.

J Vis. 2017 Sep 1;17(11):4. doi: 10.1167/17.11.4.

引用本文的文献

Dynamics of sensory and decisional biases in perceptual decision making: Insights from the face distortion illusion.

Psychon Bull Rev. 2025 Feb;32(1):317-325. doi: 10.3758/s13423-024-02539-8. Epub 2024 Jul 9.

An Infrastructure Framework for Remote Patient Monitoring Interventions and Research.

J Med Internet Res. 2024 May 30;26:e51234. doi: 10.2196/51234.

Boundary updating as a source of history effect on decision uncertainty.

iScience. 2023 Oct 28;26(11):108314. doi: 10.1016/j.isci.2023.108314. eCollection 2023 Nov 17.

Corrective feedback guides human perceptual decision-making by informing about the world state rather than rewarding its choice.

PLoS Biol. 2023 Nov 8;21(11):e3002373. doi: 10.1371/journal.pbio.3002373. eCollection 2023 Nov.

Neural Evidence for Boundary Updating as the Source of the Repulsive Bias in Classification.

J Neurosci. 2023 Jun 21;43(25):4664-4683. doi: 10.1523/JNEUROSCI.0166-23.2023. Epub 2023 Jun 7.

Individual difference in serial dependence results from opposite influences of perceptual choices and motor responses.

J Vis. 2020 Aug 3;20(8):2. doi: 10.1167/jov.20.8.2.

Priors and payoffs in confidence judgments.

Atten Percept Psychophys. 2020 Aug;82(6):3158-3175. doi: 10.3758/s13414-020-02018-x.

Human online adaptation to changes in prior probability.

PLoS Comput Biol. 2019 Jul 8;15(7):e1006681. doi: 10.1371/journal.pcbi.1006681. eCollection 2019 Jul.

Optimality and heuristics in perceptual neuroscience.

Nat Neurosci. 2019 Apr;22(4):514-523. doi: 10.1038/s41593-019-0340-4. Epub 2019 Feb 25.

Humans strategically shift decision bias by flexibly adjusting sensory evidence accumulation.

Elife. 2019 Feb 6;8:e37321. doi: 10.7554/eLife.37321.

本文引用的文献

Sequential effects: Superstition or rational behavior?

Adv Neural Inf Process Syst. 2008;21:1873-1880.

Suboptimal decision criteria are predicted by subjectively weighted probabilities and rewards.

Atten Percept Psychophys. 2015 Feb;77(2):638-58. doi: 10.3758/s13414-014-0779-z. Epub 2014 Nov 4.

Quantifying the effect of intertrial dependence on perceptual decisions.

J Vis. 2014 Jun 18;14(7):9. doi: 10.1167/14.7.9.

Autonomous mechanism of internal choice estimate underlies decision inertia.

Neuron. 2014 Jan 8;81(1):195-206. doi: 10.1016/j.neuron.2013.10.018. Epub 2013 Dec 12.

Trial-to-trial, uncertainty-based adjustment of decision boundaries in visual categorization.

Proc Natl Acad Sci U S A. 2013 Dec 10;110(50):20332-7. doi: 10.1073/pnas.1219756110. Epub 2013 Nov 22.

Implicit and explicit processes in category-based induction: is induction best when we don't think?

J Exp Psychol Gen. 2014 Feb;143(1):227-46. doi: 10.1037/a0032064. Epub 2013 Mar 18.

Dynamic estimation of task-relevant variance in movement under risk.

J Neurosci. 2012 Sep 12;32(37):12702-11. doi: 10.1523/JNEUROSCI.6160-11.2012.

Not noisy, just wrong: the role of suboptimal inference in behavioral variability.

Neuron. 2012 Apr 12;74(1):30-9. doi: 10.1016/j.neuron.2012.03.016.

Perceptual classification in a rapidly changing environment.

Neuron. 2011 Aug 25;71(4):725-36. doi: 10.1016/j.neuron.2011.06.022.

Inference for psychometric functions in the presence of nonstationary behavior.

J Vis. 2011 May 23;11(6):16. doi: 10.1167/11.6.16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

静态和动态环境中的次优准则学习

Suboptimal Criterion Learning in Static and Dynamic Environments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献