Department of Psychometrics & Statistics, Faculty of Behavioural and Social Sciences, University of Groningen, The Netherlands.
Department of Methodology and Statistics, Tilburg School of Social and Behavioral Sciences, Tilburg University, The Netherlands.
Br J Math Stat Psychol. 2021 Feb;74(1):99-117. doi: 10.1111/bmsp.12206. Epub 2020 Jul 20.
A test score on a psychological test is usually expressed as a normed score, representing its position relative to test scores in a reference population. These typically depend on predictor(s) such as age. The test score distribution conditional on predictors is estimated using regression, which may need large normative samples to estimate the relationships between the predictor(s) and the distribution characteristics properly. In this study, we examine to what extent this burden can be alleviated by using prior information in the estimation of new norms with Bayesian Gaussian distributional regression. In a simulation study, we investigate to what extent this norm estimation is more efficient and how robust it is to prior model deviations. We varied the prior type, prior misspecification and sample size. In our simulated conditions, using a fixed effects prior resulted in more efficient norm estimation than a weakly informative prior as long as the prior misspecification was not age dependent. With the proposed method and reasonable prior information, the same norm precision can be achieved with a smaller normative sample, at least in empirical problems similar to our simulated conditions. This may help test developers to achieve cost-efficient high-quality norms. The method is illustrated using empirical normative data from the IDS-2 intelligence test.
心理测试的测试分数通常表示为标准化分数,代表其相对于参考人群中测试分数的位置。这些通常取决于预测因素,如年龄。在预测因素的条件下,使用回归估计测试分数分布,这可能需要大量的规范样本才能正确估计预测因素与分布特征之间的关系。在这项研究中,我们研究了在使用贝叶斯高斯分布回归对新规范进行估计时,通过使用先验信息可以在多大程度上减轻这种负担。在一项模拟研究中,我们研究了这种规范估计的效率如何,以及它对先验模型偏差的稳健性如何。我们改变了先验类型、先验不恰当指定和样本量。在我们的模拟条件下,只要先验不恰当指定不依赖于年龄,使用固定效应先验比使用弱信息先验进行更有效的规范估计。使用所提出的方法和合理的先验信息,在至少与我们的模拟条件相似的实证问题中,可以使用较小的规范样本实现相同的规范精度。这可能有助于测试开发人员实现具有成本效益的高质量规范。该方法使用 IDS-2 智力测试的实证规范数据进行了说明。