Roid G H, Woodcock R W
George Fox University, 414 N. Meridian St., Newberg, OR 97132-2697, USA.
J Outcome Meas. 2000;4(2):579-94.
This report describes two types of findings: (a) the consistency between two major cognitive tests in terms of their developmental scales based on item-response theory, and (b) the initial development of ideas and methods for the revival of the classic concept of ratio IQ. The ratio IQ (e.g., Stanford-Binet, 1937) was formed by the division of mental age (derived from test performance) by chronological age multiplied by 100. Following a multitude of criticisms about the scaling qualities of the ratio IQ, it was mostly abandoned by the major intelligence batteries, beginning with the Wechsler scales in the 1940's, in favor of standard scores. This study presents a new approach to age equivalence scores as a basis for mental age, and the calculation of ratio IQ, based on Rasch-model item response theory. The new ratio IQ was compared statistically with standard-score IQ (mean 100, SD 15) from the Leiter International Performance Scale--Revised (Leiter-R) and from the Woodcock-Johnson Psychoeducational Battery--Revised (WJ-R). The essential element of the new ratio IQ is the W-scale, a Rasch-based score employed in the WJ-R and in the Leiter-R. Mental age was estimated from the W-scale estimate of ability and chronological age from a W-scale age equivalence for each month of age. Statistical results showed a highly similar growth curve for the W-scale scores on the Leiter-R and the WJ-R, even though the two scales have different content and standardization samples. Also, high correlations were found between the new ratio IQ and standard-score IQ (e.g., correlations ranging from .87 to .95 depending on age range). Criterion-related evidence of validity was found in the correlation of .82 between the new ratio IQ and the Wechsler Intelligence Scale for Children--Third Edition (WISC-III) standard-score IQ and in correlations with achievement-test scores. Finally, the ratio IQ showed predictable mean differences between groups of children with typical cognitive ability, cognitive delay levels of performance and giftedness. The standard deviation of the new ratio IQ was somewhat variable across age groups, however, so new interpretive guidelines would be needed if the new index is to employed in published tests. Implications of the scaling methods are discussed.
(a)基于项目反应理论,两种主要认知测试在其发展量表方面的一致性;(b)复兴比率智商这一经典概念的相关理念和方法的初步发展。比率智商(例如,1937年的斯坦福 - 比奈智力量表)是通过心理年龄(由测试成绩得出)除以实足年龄再乘以100得出的。在对比率智商的量表质量进行了诸多批评之后,从20世纪40年代的韦克斯勒智力量表开始,它在主要的智力测验工具中大多被摒弃,转而采用标准分数。本研究提出了一种新方法,以年龄等值分数作为心理年龄的基础,并基于拉施模型项目反应理论计算比率智商。将新的比率智商与来自修订版莱特国际操作量表(Leiter - R)和修订版伍德考克 - 约翰逊心理教育测验(WJ - R)的标准分数智商(均值100,标准差15)进行了统计比较。新比率智商的关键要素是W量表,这是一种用于WJ - R和Leiter - R的基于拉施模型的分数。心理年龄是根据W量表对能力的估计得出的,实足年龄则是根据每个月龄的W量表年龄等值得出的。统计结果显示,尽管Leiter - R和WJ - R这两个量表内容不同且标准化样本不同,但它们的W量表分数的增长曲线高度相似。此外,新比率智商与标准分数智商之间存在高度相关性(例如,根据年龄范围,相关性在0.87至0.95之间)。在新比率智商与韦氏儿童智力量表第三版(WISC - III)标准分数智商之间的0.82相关性以及与成就测验分数的相关性中,发现了与标准相关的效度证据。最后,比率智商在具有典型认知能力、认知延迟表现水平和天赋的儿童组之间显示出可预测的平均差异。然而,新比率智商的标准差在不同年龄组中略有变化,因此,如果要在已发表的测试中使用新指标,就需要新的解释指南。文中还讨论了量表编制方法的意义。