Schad Daniel J, Nicenboim Bruno, Vasishth Shravan
Institute for Mind, Brain and Behavior, Health and Medical University (HMU).
Department of Cognitive Science and Artificial Intelligence, Tilburg University.
Psychol Methods. 2024 Jan 25. doi: 10.1037/met0000621.
Bayesian linear mixed-effects models (LMMs) and Bayesian analysis of variance (ANOVA) are increasingly being used in the cognitive sciences to perform null hypothesis tests, where a null hypothesis that an effect is zero is compared with an alternative hypothesis that the effect exists and is different from zero. While software tools for Bayes factor null hypothesis tests are easily accessible, how to specify the data and the model correctly is often not clear. In Bayesian approaches, many authors use data aggregation at the by-subject level and estimate Bayes factors on aggregated data. Here, we use simulation-based calibration for model inference applied to several example experimental designs to demonstrate that, as with frequentist analysis, such null hypothesis tests on aggregated data can be problematic in Bayesian analysis. Specifically, when random slope variances differ (i.e., violated sphericity assumption), Bayes factors are too conservative for contrasts where the variance is small and they are too liberal for contrasts where the variance is large. Running Bayesian ANOVA on aggregated data can-if the sphericity assumption is violated-likewise lead to biased Bayes factor results. Moreover, Bayes factors for by-subject aggregated data are biased (too liberal) when random item slope variance is present but ignored in the analysis. These problems can be circumvented or reduced by running Bayesian LMMs on nonaggregated data such as on individual trials, and by explicitly modeling the full random effects structure. Reproducible code is available from https://osf.io/mjf47/. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
贝叶斯线性混合效应模型(LMMs)和贝叶斯方差分析(ANOVA)在认知科学中越来越多地用于进行零假设检验,即将效应为零的零假设与效应存在且不为零的备择假设进行比较。虽然用于贝叶斯因子零假设检验的软件工具很容易获取,但如何正确指定数据和模型往往并不明确。在贝叶斯方法中,许多作者在被试水平上进行数据聚合,并在聚合数据上估计贝叶斯因子。在这里,我们将基于模拟的校准用于模型推断,并应用于几个示例实验设计,以证明与频率主义分析一样,在贝叶斯分析中,对聚合数据进行这样的零假设检验可能会有问题。具体来说,当随机斜率方差不同时(即违反了球对称性假设),对于方差小的对比,贝叶斯因子过于保守,而对于方差大的对比,它们又过于宽松。如果违反了球对称性假设,在聚合数据上运行贝叶斯方差分析同样会导致贝叶斯因子结果出现偏差。此外,当存在随机项目斜率方差但在分析中被忽略时,被试聚合数据的贝叶斯因子会有偏差(过于宽松)。通过在非聚合数据(如单个试验数据)上运行贝叶斯线性混合效应模型,并明确对完整的随机效应结构进行建模,可以规避或减少这些问题。可从https://osf.io/mjf47/获取可重复代码。(PsycInfo数据库记录(c)2025美国心理学会,保留所有权利)