Vanderbilt University, Nashville, TN, USA.
The Ohio State University, Columbus, OH, USA.
Behav Res Methods. 2022 Oct;54(5):2178-2220. doi: 10.3758/s13428-021-01709-z. Epub 2022 Mar 1.
Multilevel data structures are often found in multiple substantive research areas, and multilevel models (MLMs) have been widely used to allow for such multilevel data structures. One important step when applying MLM is the selection of an optimal set of random effects to account for variability and heteroscedasticity in multilevel data. Literature reviews on current practices in applying MLM showed that diagnostic plots are only rarely used for model selection and for model checking. In this study, possible random effects and a generic description of the random effects were provided to guide researchers to select necessary random effects. In addition, based on extensive literature reviews, level-specific diagnostic plots were presented using various kinds of level-specific residuals, and diagnostic measures and statistical tests were suggested to select a set of random effects. Existing and newly proposed methods were illustrated using two data sets: a cross-sectional data set and a longitudinal data set. Along with the illustration, we discuss the methods and provide guidelines to select necessary random effects in model-building steps. R code was provided for the analyses.
多水平数据结构在多个实质性研究领域中经常出现,而多层次模型 (MLM) 已被广泛用于允许这种多水平数据结构。在应用 MLM 时,一个重要的步骤是选择最佳的随机效应集合,以解释多水平数据中的变异性和异方差性。关于当前应用 MLM 的实践的文献综述表明,诊断图仅很少用于模型选择和模型检查。在这项研究中,提供了可能的随机效应和随机效应的通用描述,以指导研究人员选择必要的随机效应。此外,基于广泛的文献综述,使用各种特定水平的残差呈现了特定水平的诊断图,并提出了诊断措施和统计检验来选择一组随机效应。使用两个数据集:横断面数据集和纵向数据集来说明现有和新提出的方法。在说明的同时,我们讨论了这些方法,并提供了在模型构建步骤中选择必要的随机效应的指导方针。提供了用于分析的 R 代码。