Correspondence to Lesa Hoffman:
J Speech Lang Hear Res. 2012 Jun;55(3):754-63. doi: 10.1044/1092-4388(2011/10-0216). Epub 2012 Jan 9.
The present work describes how vocabulary ability as assessed by 3 different forms of the Peabody Picture Vocabulary Test (PPVT; Dunn & Dunn, 1997) can be placed on a common latent metric through item response theory (IRT) modeling, by which valid comparisons of ability between samples or over time can then be made.
Responses from 2,625 cases in a longitudinal study of 697 persons for 459 unique PPVT items (175 items from Peabody Picture Vocabulary Test--Revised [PPVT-R] Form M [Dunn & Dunn, 1981], 201 items from Peabody Picture Vocabulary Test--3 [PPVT-3] Form A [Dunn & Dunn, 1997], and 83 items from PPVT-3 Form B [Dunn & Dunn, 1997]) were analyzed using a 2-parameter logistic IRT model.
The test forms each covered approximately ± 3 SDs of vocabulary ability with high reliability. Some differences between item sets in item difficulty and discrimination were found between the PPVT-3 Forms A and B.
Comparable estimates of vocabulary ability obtained from different test forms can be created through IRT modeling. The authors have also written a freely available SAS program that uses the obtained item parameters to provide IRT ability estimates given item responses to any of the 3 forms. This scoring resource will allow others with existing PPVT data to benefit from this work as well.
本研究通过项目反应理论(IRT)建模,将皮博迪图片词汇测验(PPVT;Dunn & Dunn,1997)的 3 种不同形式评估的词汇能力置于共同的潜在度量标准之下,从而可以在样本之间或随时间进行有效的能力比较。
对 697 人 459 项独特 PPVT 项目(175 项来自 Peabody Picture Vocabulary Test-Revised [PPVT-R] Form M [Dunn & Dunn,1981],201 项来自 Peabody Picture Vocabulary Test-3 [PPVT-3] Form A [Dunn & Dunn,1997],83 项来自 PPVT-3 Form B [Dunn & Dunn,1997])的 2625 例纵向研究的反应进行分析,采用双参数逻辑 IRT 模型。
各测试形式涵盖了词汇能力约 ±3SD 的范围,具有较高的可靠性。在 PPVT-3 形式 A 和 B 之间发现了不同项目集之间在项目难度和区分度方面的差异。
通过 IRT 建模可以创建来自不同测试形式的可比词汇能力估计值。作者还编写了一个免费的 SAS 程序,该程序使用获得的项目参数,根据 3 种形式中的任何一种对项目反应,提供 IRT 能力估计值。该评分资源将允许其他具有现有 PPVT 数据的人也从中受益。