Wicherts Jelte M
a Department of Methodology and Statistics , Tilburg University , Tilburg , The Netherlands.
Clin Neuropsychol. 2016 Oct;30(7):1006-16. doi: 10.1080/13854046.2016.1205136. Epub 2016 Jun 30.
Neurocognitive test batteries such as recent editions of the Wechsler's Adult Intelligence Scale (WAIS-III/WAIS-IV) typically use nation-level population-based norms. The question is whether these batteries function in the same manner across different subgroups based on gender, age, educational background, socioeconomic status, ethnicity, mother tongue, or race. Here, the author argues that measurement invariance is a core issue in determining whether population-based norms are valid for different subgroups.
The author introduces measurement invariance, argues why it is an important topic of study, discusses why invariance might fail in cognitive ability testing, and reviews a dozen studies of invariance of commonly used neurocognitive test batteries.
In over half of the reviewed studies, IQ batteries were not found to be measurement invariant across groups based on ethnicity, gender, educational background, cohort, or age. Apart from age and cohort, test manuals do not take such lack of invariance into account in computing full-scale IQ scores or normed domain scores.
Measurement invariance is crucial for valid use of neurocognitive tests in clinical, educational, and professional practice. The appropriateness of population-based norms to particular subgroups should depend also on whether measurement invariance holds with respect to important subgroups.
诸如韦氏成人智力量表(WAIS - III/WAIS - IV)的最新版本等神经认知测试组合通常使用基于全国人口的常模。问题在于,这些测试组合在基于性别、年龄、教育背景、社会经济地位、种族、母语或种族划分的不同亚组中是否以相同方式发挥作用。在此,作者认为测量不变性是确定基于人口的常模对不同亚组是否有效的核心问题。
作者介绍了测量不变性,阐述了为何它是一个重要的研究课题,讨论了在认知能力测试中不变性可能失效的原因,并回顾了十几项关于常用神经认知测试组合不变性的研究。
在超过半数的被审查研究中,未发现智商测试组合在基于种族、性别、教育背景、队列或年龄的不同群体间具有测量不变性。除年龄和队列外,测试手册在计算全量表智商分数或常模化领域分数时并未考虑到这种缺乏不变性的情况。
测量不变性对于在临床、教育和专业实践中有效使用神经认知测试至关重要。基于人口的常模对特定亚组的适用性还应取决于针对重要亚组测量不变性是否成立。