Grieder Silvia, Bünger Anette, Odermatt Salome D, Schweizer Florine, Grob Alexander
University of Basel, Basel, Switzerland.
Assessment. 2022 Sep;29(6):1172-1189. doi: 10.1177/10731911211005171. Epub 2021 Apr 2.
Research on comparability of general intelligence composites (GICs) is scarce and has focused exclusively on comparing GICs from different test batteries, revealing limited individual-level comparability. We add to these findings, investigating the group- and individual-level comparability of different GICs within test batteries (i.e., internal score comparability), thereby minimizing transient error and ruling out between-battery variance completely. We (a) determined the magnitude of intraindividual IQ differences, (b) investigated their impact on external validity, (c) explored possible predictors for these differences, and (d) examined ways to deal with incomparability. Results are based on the standardization samples of three intelligence test batteries, spanning from early childhood to late adulthood. Despite high group-level comparability, individual-level comparability was often unsatisfactory, especially toward the tails of the IQ distribution. This limited comparability has consequences for external validity, as GICs were differentially related to and often less predictive for school grades for individuals with high IQ differences. Of several predictors, only IQ level and age were systematically related to comparability. Consequently, findings challenge the use of overall internal consistencies for confidence intervals and suggest using confidence intervals based on test-retest reliabilities or age- and IQ-specific internal consistencies for clinical interpretation. Implications for test construction and application are discussed.
关于一般智力合成分数(GICs)可比性的研究很少,且仅专注于比较来自不同测试组的GICs,显示出有限的个体水平可比性。我们补充了这些研究结果,调查了测试组内不同GICs的组水平和个体水平可比性(即内部分数可比性),从而将瞬时误差降至最低并完全排除测试组间差异。我们(a)确定了个体内智商差异的大小,(b)研究了它们对外部效度的影响,(c)探索了这些差异的可能预测因素,以及(d)研究了处理不可比性的方法。结果基于三个智力测试组的标准化样本,涵盖从幼儿期到成年晚期。尽管组水平可比性较高,但个体水平可比性往往不尽人意,尤其是在智商分布的两端。这种有限的可比性对外部效度有影响,因为对于智商差异较大的个体,GICs与学校成绩的相关性不同且预测性往往较低。在几个预测因素中,只有智商水平和年龄与可比性有系统关联。因此,研究结果对使用总体内部一致性来确定置信区间提出了挑战,并建议在临床解释中使用基于重测信度或年龄和智商特定内部一致性的置信区间。讨论了对测试构建和应用的影响。