项目反应理论（IRT）以及等效组线性和等百分位等值的纵向稳定性。

Longitudinal stability of IRT and equivalent-groups linear and equipercentile equating.

作者信息

Zhang Xiuyuan, McDermott Paul A, Fantuzzo John W, Gadsden Vivian L

机构信息

University of Pennsylvania, USA.

出版信息

Psychol Rep. 2013 Aug;113(1):1303-25. doi: 10.2466/03.10.pr0.113x11z6.

DOI:10.2466/03.10.pr0.113x11z6

PMID:24340818

Abstract

A multiscale criterion-referenced test that featured two presumably equivalent forms (A and B), was administered to 1,667 Head Start children at each of four points over an academic year. Using a randomly equivalent groups design, three equating methods were applied: common-item IRT equating using concurrent calibration, linear transformation, and equipercentile transformation. The methods were compared by examining mean score differences, weighted mean squared difference, and Kolmogorov's D statistics for each subscale. The results indicated that over time the IRT equating method and conventional equating methods exhibited different patterns of discrepancy between the two test forms. IRT equating yielded marginally smaller form-to-form mean score differences and generated slightly fewer distributional discrepancies between Forms A and B than both linear and equipercentile equating. However, the results were mixed indicating that more studies are needed to provide additional information on the relative merits and weaknesses of each approach.

摘要

一项多尺度标准参照测试采用了两种假定等效的形式（A和B），在一学年的四个时间点对1667名开端计划儿童进行了测试。采用随机等效组设计，应用了三种等值方法：使用同时校准的共同项目IRT等值、线性变换和等百分位变换。通过检查每个子量表的平均分数差异、加权均方差异和科尔莫戈罗夫D统计量来比较这些方法。结果表明，随着时间的推移，IRT等值方法和传统等值方法在两种测试形式之间表现出不同的差异模式。与线性等值和等百分位等值相比，IRT等值产生的形式间平均分数差异略小，并且在A表和B表之间产生的分布差异略少。然而，结果喜忧参半，这表明需要更多的研究来提供关于每种方法相对优缺点的更多信息。

相似文献

Longitudinal stability of IRT and equivalent-groups linear and equipercentile equating.

Psychol Rep. 2013 Aug;113(1):1303-25. doi: 10.2466/03.10.pr0.113x11z6.

A Comparison of IRT Observed Score Kernel Equating and Several Equating Methods.

Front Psychol. 2020 Mar 6;11:308. doi: 10.3389/fpsyg.2020.00308. eCollection 2020.

Item Response Theory Observed-Score Kernel Equating.

Psychometrika. 2017 Mar;82(1):48-66. doi: 10.1007/s11336-016-9528-7. Epub 2016 Oct 14.

Item response theory test equating in health sciences education.

Adv Health Sci Educ Theory Pract. 2008 Mar;13(1):3-10. doi: 10.1007/s10459-006-9020-8. Epub 2006 Jul 18.

Efficiency Analysis of Item Response Theory Kernel Equating for Mixed-Format Tests.

Appl Psychol Meas. 2023 Nov;47(7-8):496-512. doi: 10.1177/01466216231209757. Epub 2023 Oct 19.

Equating and item banking with the Rasch model.

J Appl Meas. 2000;1(4):409-34.

Evaluating Equating Transformations in IRT Observed-Score and Kernel Equating Methods.

Appl Psychol Meas. 2023 Mar;47(2):123-140. doi: 10.1177/01466216221124087. Epub 2022 Oct 4.

Rasch simultaneous vertical equating for measuring reading growth.

J Appl Meas. 2003;4(1):10-23.

Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: psychometric approaches to test equating.

J Chin Med Assoc. 2014 Mar;77(3):150-4. doi: 10.1016/j.jcma.2013.10.011. Epub 2013 Nov 28.

Reading Comprehension Tests for Children: Test Equating and Specific Age-Interval Reports.

Front Psychol. 2021 Sep 10;12:662192. doi: 10.3389/fpsyg.2021.662192. eCollection 2021.

引用本文的文献

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model.

Educ Psychol Meas. 2023 Dec;83(6):1249-1290. doi: 10.1177/00131644221143051. Epub 2023 Jan 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

项目反应理论（IRT）以及等效组线性和等百分位等值的纵向稳定性。

Longitudinal stability of IRT and equivalent-groups linear and equipercentile equating.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献