Maastricht University, The Netherlands.
Br J Math Stat Psychol. 2013 May;66(2):338-52. doi: 10.1111/j.2044-8317.2012.02054.x. Epub 2012 May 28.
Laenen, Alonso, and Molenberghs (2007) and Laenen, Alonso, Molenberghs, and Vangeneugden (2009) proposed a method to assess the reliability of rating scales in a longitudinal context. The methodology is based on hierarchical linear models, and reliability coefficients are derived from the corresponding covariance matrices. However, finding a good parsimonious model to describe complex longitudinal data is a challenging task. Frequently, several models fit the data equally well, raising the problem of model selection uncertainty. When model uncertainty is high one may resort to model averaging, where inferences are based not on one but on an entire set of models. We explored the use of different model building strategies, including model averaging, in reliability estimation. We found that the approach introduced by Laenen et al. (2007, 2009) combined with some of these strategies may yield meaningful results in the presence of high model selection uncertainty and when all models are misspecified, in so far as some of them manage to capture the most salient features of the data. Nonetheless, when all models omit prominent regularities in the data, misleading results may be obtained. The main ideas are further illustrated on a case study in which the reliability of the Hamilton Anxiety Rating Scale is estimated. Importantly, the ambit of model selection uncertainty and model averaging transcends the specific setting studied in the paper and may be of interest in other areas of psychometrics.
拉内恩、阿隆索和莫伦贝赫斯(2007 年)以及拉内恩、阿隆索、莫伦贝赫斯和旺根内根(2009 年)提出了一种在纵向背景下评估评分量表可靠性的方法。该方法基于分层线性模型,可靠性系数来自相应的协方差矩阵。然而,找到一个好的简约模型来描述复杂的纵向数据是一项具有挑战性的任务。通常,有几个模型都能很好地拟合数据,这就提出了模型选择不确定性的问题。当模型不确定性很高时,人们可能会诉诸模型平均,其中推断不是基于一个模型,而是基于一整套模型。我们探讨了在可靠性估计中使用不同的建模策略,包括模型平均。我们发现,拉内恩等人(2007 年,2009 年)提出的方法与其中一些策略相结合,在模型选择不确定性高且所有模型都存在设定偏误时,可能会产生有意义的结果,只要其中一些模型能够捕捉到数据的最显著特征。然而,当所有模型都忽略了数据中的突出规律时,可能会得到误导性的结果。这些主要思想在一个案例研究中得到了进一步说明,该案例研究评估了汉密尔顿焦虑量表的可靠性。重要的是,模型选择不确定性和模型平均的范围超出了本文研究的特定背景,可能在心理计量学的其他领域也有意义。