关于合并决策标准的奖励率最优性的理论分析。

A theoretical analysis of the reward rate optimality of collapsing decision criteria.

作者信息

Boehm Udo, van Maanen Leendert, Evans Nathan J, Brown Scott D, Wagenmakers Eric-Jan

机构信息

Department of Experimental Psychology, University of Groningen, Grote Kruisstraat 2/1, 9712TS, Groningen, The Netherlands.

Department of Psychology, University of Amsterdam, 1018 XA, Amsterdam, The Netherlands.

出版信息

Atten Percept Psychophys. 2020 Jun;82(3):1520-1534. doi: 10.3758/s13414-019-01806-4.

DOI:10.3758/s13414-019-01806-4

PMID:31359378

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7303092/

Abstract

A standard assumption of most sequential sampling models is that decision-makers rely on a decision criterion that remains constant throughout the decision process. However, several authors have recently suggested that, in order to maximize reward rates in dynamic environments, decision-makers need to rely on a decision criterion that changes over the course of the decision process. We used dynamic programming and simulation methods to quantify the reward rates obtained by constant and dynamic decision criteria in different environments. We further investigated what influence a decision-maker's uncertainty about the stochastic structure of the environment has on reward rates. Our results show that in most dynamic environments, both types of decision criteria yield similar reward rates, across different levels of uncertainty. This suggests that a static decision criterion might provide a robust default setting.

摘要

大多数序贯抽样模型的一个标准假设是，决策者依赖于在整个决策过程中保持不变的决策标准。然而，最近有几位作者提出，为了在动态环境中最大化奖励率，决策者需要依赖于在决策过程中不断变化的决策标准。我们使用动态规划和模拟方法来量化在不同环境中恒定和动态决策标准所获得的奖励率。我们进一步研究了决策者对环境随机结构的不确定性对奖励率有何影响。我们的结果表明，在大多数动态环境中，在不同的不确定性水平下，这两种决策标准产生的奖励率相似。这表明静态决策标准可能提供一个稳健的默认设置。

相似文献

A theoretical analysis of the reward rate optimality of collapsing decision criteria.关于合并决策标准的奖励率最优性的理论分析。

Atten Percept Psychophys. 2020 Jun;82(3):1520-1534. doi: 10.3758/s13414-019-01806-4.

Of monkeys and men: Impatience in perceptual decision-making.猴子与人类：感知决策中的不耐烦

Psychon Bull Rev. 2016 Jun;23(3):738-49. doi: 10.3758/s13423-015-0958-5.

Normative decision rules in changing environments.规范决策规则在不断变化的环境中。

Elife. 2022 Oct 25;11:e79824. doi: 10.7554/eLife.79824.

Time-varying decision boundaries: insights from optimality analysis.时变决策边界：优化分析的见解。

Psychon Bull Rev. 2018 Jun;25(3):971-996. doi: 10.3758/s13423-017-1340-6.

Mice and rats fail to integrate exogenous timing noise into their time-based decisions.小鼠和大鼠无法将外部时间噪声整合到基于时间的决策中。

Anim Cogn. 2016 Nov;19(6):1215-1225. doi: 10.1007/s10071-016-1033-y. Epub 2016 Sep 19.

Risk Attitude in Multicriteria Decision Analysis: A Compromise Approach.多准则决策分析中的风险态度：一种妥协方法。

Int J Environ Res Public Health. 2021 Jun 17;18(12):6536. doi: 10.3390/ijerph18126536.

A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker.作为有限理性决策者的奖励最大化脉冲神经元

Neural Comput. 2015 Aug;27(8):1686-720. doi: 10.1162/NECO_a_00758. Epub 2015 Jun 16.

The role of passing time in decision-making.时间在决策中的作用。

J Exp Psychol Learn Mem Cogn. 2020 Feb;46(2):316-326. doi: 10.1037/xlm0000725. Epub 2019 Jun 10.

A multi-attribute decision-making model for the evaluation of uncertainties in traffic pollution control planning.交通污染控制规划不确定性评估的多属性决策模型。

Environ Sci Pollut Res Int. 2019 Jun;26(18):17911-17917. doi: 10.1007/s11356-017-0631-9. Epub 2017 Nov 4.

Time-based reward maximization.基于时间的奖励最大化。

Philos Trans R Soc Lond B Biol Sci. 2014 Jan 20;369(1637):20120461. doi: 10.1098/rstb.2012.0461. Print 2014 Mar 5.

引用本文的文献

Support for the Time-Varying Drift Rate Model of Perceptual Discrimination in Dynamic and Static Noise Using Bayesian Model-Fitting Methodology.使用贝叶斯模型拟合方法对动态和静态噪声中感知辨别随时间变化的漂移率模型的支持。

Entropy (Basel). 2024 Jul 28;26(8):642. doi: 10.3390/e26080642.

Normative decision rules in changing environments.规范决策规则在不断变化的环境中。

Elife. 2022 Oct 25;11:e79824. doi: 10.7554/eLife.79824.

Rational inference strategies and the genesis of polarization and extremism.理性推理策略与极化和极端主义的产生。

Sci Rep. 2022 May 5;12(1):7344. doi: 10.1038/s41598-022-11389-0.

Core body temperature speeds up temporal processing and choice behavior under deadlines.核心体温在截止日期下加快了时间处理和选择行为。

Sci Rep. 2019 Jul 11;9(1):10053. doi: 10.1038/s41598-019-46073-3.

本文引用的文献

Caution in decision-making under time pressure is mediated by timing ability.在时间压力下进行决策时需要谨慎，这种谨慎程度受到时间能力的影响。

Cogn Psychol. 2019 May;110:16-29. doi: 10.1016/j.cogpsych.2019.01.002. Epub 2019 Feb 5.

Optimal or not; depends on the task.最优与否；取决于任务。

Psychon Bull Rev. 2019 Jun;26(3):1027-1034. doi: 10.3758/s13423-018-1536-4.

The computations that support simple decision-making: A comparison between the diffusion and urgency-gating models.支持简单决策的计算：扩散模型与紧急门控模型的比较

Sci Rep. 2017 Nov 27;7(1):16433. doi: 10.1038/s41598-017-16694-7.

Time-varying decision boundaries: insights from optimality analysis.时变决策边界：优化分析的见解。

Psychon Bull Rev. 2018 Jun;25(3):971-996. doi: 10.3758/s13423-017-1340-6.

Comparing fixed and collapsing boundary versions of the diffusion model.比较扩散模型的固定边界和塌缩边界版本。

J Math Psychol. 2016 Aug;73:59-79. doi: 10.1016/j.jmp.2016.04.008. Epub 2016 May 24.

Learning to allocate limited time to decisions with different expected outcomes.学会为具有不同预期结果的决策分配有限的时间。

Cogn Psychol. 2017 Jun;95:17-49. doi: 10.1016/j.cogpsych.2017.03.002. Epub 2017 Apr 19.

Overcoming indecision by changing the decision boundary.通过改变决策边界来克服犹豫不决。

J Exp Psychol Gen. 2017 Jun;146(6):776-805. doi: 10.1037/xge0000286. Epub 2017 Apr 13.

Global gain modulation generates time-dependent urgency during perceptual choice in humans.全局增益调制在人类感知选择中产生时变的紧迫感。

Nat Commun. 2016 Nov 24;7:13526. doi: 10.1038/ncomms13526.

People adopt optimal policies in simple decision-making, after practice and guidance.经过实践和指导后，人们在简单决策中会采用最优策略。

Psychon Bull Rev. 2017 Apr;24(2):597-606. doi: 10.3758/s13423-016-1135-1.

Of monkeys and men: Impatience in perceptual decision-making.猴子与人类：感知决策中的不耐烦

Psychon Bull Rev. 2016 Jun;23(3):738-49. doi: 10.3758/s13423-015-0958-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于合并决策标准的奖励率最优性的理论分析。

A theoretical analysis of the reward rate optimality of collapsing decision criteria.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献