Universidad Autónoma de Madrid, Madrid, Spain.
Psicothema. 2013;25(2):238-44. doi: 10.7334/psicothema2012.147.
Criterion-referenced interpretations of tests are highly necessary, which usually involves the difficult task of establishing cut scores. Contrasting with other Item Response Theory (IRT)-based standard setting methods, a non-judgmental approach is proposed in this study, in which Item Characteristic Curve (ICC) transformations lead to the final cut scores.
eCat-Listening, a computerized adaptive test for the evaluation of English Listening, was administered to 1,576 participants, and the proposed standard setting method was applied to classify them into the performance standards of the Common European Framework of Reference for Languages (CEFR).
The results showed a classification closely related to relevant external measures of the English language domain, according to the CEFR.
It is concluded that the proposed method is a practical and valid standard setting alternative for IRT-based tests interpretations.
对测试进行基于准则的解释非常必要,这通常涉及到建立分数线的艰巨任务。与其他基于项目反应理论(IRT)的标准设定方法不同,本研究提出了一种非判断性的方法,其中项目特征曲线(ICC)转换导致最终的分数线。
eCat-Listening 是一种用于评估英语听力的计算机化自适应测试,共有 1576 名参与者参加了该测试,应用了所提出的标准设定方法将他们分类到欧洲共同语言参考框架(CEFR)的性能标准中。
结果显示,根据 CEFR,分类与英语语言领域的相关外部测量密切相关。
因此,得出结论,该方法是一种实用且有效的 IRT 测试解释的标准设定替代方法。