评估选择反应时任务推理中模型选择方法之间的实际差异。

Assessing the practical differences between model selection methods in inferences about choice response time tasks.

机构信息

Department of Psychology, University of Amsterdam, Amsterdam, The Netherlands.

出版信息

Psychon Bull Rev. 2019 Aug;26(4):1070-1098. doi: 10.3758/s13423-018-01563-9.

DOI:10.3758/s13423-018-01563-9

PMID:30783896

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6710222/

Abstract

Evidence accumulations models (EAMs) have become the dominant modeling framework within rapid decision-making, using choice response time distributions to make inferences about the underlying decision process. These models are often applied to empirical data as "measurement tools", with different theoretical accounts being contrasted within the framework of the model. Some method is then needed to decide between these competing theoretical accounts, as only assessing the models on their ability to fit trends in the empirical data ignores model flexibility, and therefore, creates a bias towards more flexible models. However, there is no objectively optimal method to select between models, with methods varying in both their computational tractability and theoretical basis. I provide a systematic comparison between nine different model selection methods using a popular EAM-the linear ballistic accumulator (LBA; Brown & Heathcote, Cognitive Psychology 57(3), 153-178 2008)-in a large-scale simulation study and the empirical data of Dutilh et al. (Psychonomic Bulletin and Review, 1-19 2018). I find that the "predictive accuracy" class of methods (i.e., the Akaike Information Criterion [AIC], the Deviance Information Criterion [DIC], and the Widely Applicable Information Criterion [WAIC]) make different inferences to the "Bayes factor" class of methods (i.e., the Bayesian Information Criterion [BIC], and Bayes factors) in many, but not all, instances, and that the simpler methods (i.e., AIC and BIC) make inferences that are highly consistent with their more complex counterparts. These findings suggest that researchers should be able to use simpler "parameter counting" methods when applying the LBA and be confident in their inferences, but that researchers need to carefully consider and justify the general class of model selection method that they use, as different classes of methods often result in different inferences.

摘要

证据积累模型 (EAMs) 已成为快速决策中的主导建模框架，使用选择反应时分布来推断潜在的决策过程。这些模型通常作为“测量工具”应用于实证数据，在模型框架内对比不同的理论解释。然后，需要有一种方法来在这些竞争的理论解释之间做出选择，因为仅根据模型拟合经验数据趋势的能力来评估模型会忽略模型的灵活性，从而导致更灵活的模型产生偏差。然而，没有客观最优的方法来选择模型，方法在计算可操作性和理论基础上都有所不同。我在一项大规模模拟研究中，以及 Dutilh 等人的实证数据中，使用一种流行的 EAM——线性弹道积累器 (LBA; Brown & Heathcote, Cognitive Psychology 57(3), 153-178 2008)，对九种不同的模型选择方法进行了系统比较。我发现，“预测准确性”类方法（即 Akaike 信息准则 [AIC]、偏差信息准则 [DIC] 和广泛适用信息准则 [WAIC]）与“贝叶斯因子”类方法（即贝叶斯信息准则 [BIC] 和贝叶斯因子）在许多情况下做出了不同的推断，但并非所有情况下都是如此，并且更简单的方法（即 AIC 和 BIC）做出的推断与它们更复杂的对应方法高度一致。这些发现表明，研究人员在应用 LBA 时应该能够使用更简单的“参数计数”方法，并对他们的推断有信心，但研究人员需要仔细考虑和证明他们使用的模型选择方法的一般类别，因为不同类别的方法通常会导致不同的推断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5033/6710222/7761256ea68a/13423_2018_1563_Fig1_HTML.jpg

相似文献

Assessing the practical differences between model selection methods in inferences about choice response time tasks.

Psychon Bull Rev. 2019 Aug;26(4):1070-1098. doi: 10.3758/s13423-018-01563-9.

Diffusion versus linear ballistic accumulation: different models for response time with different conclusions about psychological mechanisms?

Can J Exp Psychol. 2012 Jun;66(2):125-36. doi: 10.1037/a0028189.

Model selection and psychological theory: a discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC).

Psychol Methods. 2012 Jun;17(2):228-43. doi: 10.1037/a0027127. Epub 2012 Feb 6.

Bayes factors for the linear ballistic accumulator model of decision-making.

Behav Res Methods. 2018 Apr;50(2):589-603. doi: 10.3758/s13428-017-0887-5.

Dynamic models of choice.

Behav Res Methods. 2019 Apr;51(2):961-985. doi: 10.3758/s13428-018-1067-y.

Bayesian statistical approaches to evaluating cognitive models.

Wiley Interdiscip Rev Cogn Sci. 2018 Mar;9(2). doi: 10.1002/wcs.1458. Epub 2017 Nov 28.

Performance of Akaike Information Criterion and Bayesian Information Criterion in Selecting Partition Models and Mixture Models.

Syst Biol. 2023 May 19;72(1):92-105. doi: 10.1093/sysbio/syac081.

AIC model selection using Akaike weights.

Psychon Bull Rev. 2004 Feb;11(1):192-6. doi: 10.3758/bf03206482.

Think fast! The implications of emphasizing urgency in decision-making.

Cognition. 2021 Sep;214:104704. doi: 10.1016/j.cognition.2021.104704. Epub 2021 May 8.

Bayesian outcome-based strategy classification.

Behav Res Methods. 2016 Mar;48(1):29-41. doi: 10.3758/s13428-014-0557-9.

引用本文的文献

Acute effects of Δ9-tetrahydrocannabinol on computational measures of neurocognitive processes are related to recent cannabis use among adolescents and young adults.

Front Adolesc Med. 2025;3. doi: 10.3389/fradm.2025.1541068. Epub 2025 Mar 4.

Unveiling the perfect workout: exercise modalities and dosages to ameliorate adipokine dysregulation in individuals with overweight and obesity: a systematic review with pairwise, network, and dose-response meta-analyses.

Front Nutr. 2025 Aug 12;12:1653449. doi: 10.3389/fnut.2025.1653449. eCollection 2025.

Beyond binary comparisons: a Bayesian dose-response meta-analysis of exercise on executive function in children and adolescents with ADHD.

Pediatr Res. 2025 Aug 7. doi: 10.1038/s41390-025-04325-1.

Preferences for HIV Pre-exposure Prophylaxis among Gay, Bisexual, and Men Who Have Sex with Men in China: a Discrete Choice Experiment.

AIDS Behav. 2025 Jun 4. doi: 10.1007/s10461-025-04793-w.

Optimal dose and type of exercise improve the overall balance in adults with Parkinson's disease: a systematic review and Bayesian network meta-analysis.

Neurol Sci. 2025 May 27. doi: 10.1007/s10072-025-08244-1.

Optimal dosage and modality of exercise on glycemic control in people with prediabetes: a systematic review and network meta-analysis.

Front Endocrinol (Lausanne). 2025 Apr 28;16:1560676. doi: 10.3389/fendo.2025.1560676. eCollection 2025.

The best approaches and doses of exercise for improving sleep quality: a network meta-analysis and dose-response relationship study.

BMC Public Health. 2025 Apr 11;25(1):1371. doi: 10.1186/s12889-025-22570-1.

Optimal dose and type of exercise improve walking velocity in adults with Parkinson's disease: a systematic review and Bayesian network meta-analysis.

Sci Rep. 2025 Jan 17;15(1):2239. doi: 10.1038/s41598-025-85456-7.

Optimal intensity and dose of exercise to improve university students' mental health: a systematic review and network meta-analysis of 48 randomized controlled trials.

Eur J Appl Physiol. 2025 May;125(5):1395-1410. doi: 10.1007/s00421-024-05688-9. Epub 2024 Dec 18.

Individual and contextual factors associated with abortion among reproductive age women in sub-Saharan Africa: Insights from 24 recent demographic and health surveys.

PLoS One. 2024 Dec 12;19(12):e0315262. doi: 10.1371/journal.pone.0315262. eCollection 2024.

本文引用的文献

Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection.

Comput Brain Behav. 2019;2(1):1-11. doi: 10.1007/s42113-018-0011-7. Epub 2018 Sep 27.

Thermodynamic Integration and Steppingstone Sampling Methods for Estimating Bayes Factors: A Tutorial.

J Math Psychol. 2019 Apr;89:67-86. doi: 10.1016/j.jmp.2019.01.005. Epub 2019 Feb 13.

A tutorial on Bayes Factor Design Analysis using an informed prior.

Behav Res Methods. 2019 Jun;51(3):1042-1058. doi: 10.3758/s13428-018-01189-8.

Thermodynamic integration via differential evolution: A method for estimating marginal likelihoods.

Behav Res Methods. 2019 Apr;51(2):930-947. doi: 10.3758/s13428-018-1172-y.

Optimal or not; depends on the task.

Psychon Bull Rev. 2019 Jun;26(3):1027-1034. doi: 10.3758/s13423-018-1536-4.

Refining the law of practice.

Psychol Rev. 2018 Jul;125(4):592-605. doi: 10.1037/rev0000105.

On the importance of avoiding shortcuts in applying cognitive models to hierarchical data.

Behav Res Methods. 2018 Aug;50(4):1614-1631. doi: 10.3758/s13428-018-1054-3.

Modeling the Covariance Structure of Complex Datasets Using Cognitive Models: An Application to Individual Differences and the Heritability of Cognitive Ability.

Cogn Sci. 2018 Jun 5. doi: 10.1111/cogs.12627.

The relative merit of empirical priors in non-identifiable and sloppy models: Applications to models of learning and decision-making : Empirical priors.

Psychon Bull Rev. 2018 Dec;25(6):2047-2068. doi: 10.3758/s13423-018-1446-5.

The Quality of Response Time Data Inference: A Blinded, Collaborative Assessment of the Validity of Cognitive Models.

Psychon Bull Rev. 2019 Aug;26(4):1051-1069. doi: 10.3758/s13423-017-1417-2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评估选择反应时任务推理中模型选择方法之间的实际差异。

Assessing the practical differences between model selection methods in inferences about choice response time tasks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献