Suppr超能文献

变化的自适应测量对项目参数估计误差的稳健性。

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error.

作者信息

Cooperman Allison W, Weiss David J, Wang Chun

机构信息

University of Minnesota-Twin Cities, Minneapolis, MN, USA.

University of Washington, Seattle, WA, USA.

出版信息

Educ Psychol Meas. 2022 Aug;82(4):643-677. doi: 10.1177/00131644211033902. Epub 2021 Aug 16.

Abstract

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests-a test, likelihood ratio test, and score ratio index-have demonstrated desirable statistical properties in this context, including low false positive rates and high true positive rates. However, the extant AMC research has assumed that the item parameter values in the simulated item banks were devoid of estimation error. This assumption is unrealistic for applied testing settings, where item parameters are estimated from a calibration sample before test administration. Using Monte Carlo simulation, this study evaluated the robustness of the common AMC hypothesis tests to the presence of item parameter estimation error when measuring omnibus change across four testing occasions. Results indicated that item parameter estimation error had at most a small effect on false positive rates and latent trait change recovery, and these effects were largely explained by the computerized adaptive testing item bank information functions. Differences in AMC performance as a function of item parameter estimation error and choice of hypothesis test were generally limited to simulees with particularly low or high latent trait values, where the item bank provided relatively lower information. These simulations highlight how AMC can accurately measure intra-individual change in the presence of item parameter estimation error when paired with an informative item bank. Limitations and future directions for AMC research are discussed.

摘要

适应性变化测量(AMC)是一种心理测量方法,用于测量个体在多个测试场合中一个或多个潜在特质的个体内部变化。三种假设检验——a检验、似然比检验和得分比指数——在这种情况下已显示出理想的统计特性,包括低假阳性率和高真阳性率。然而,现有的AMC研究假设模拟题库中的项目参数值没有估计误差。对于应用测试设置来说,这个假设是不现实的,在应用测试设置中,项目参数是在测试管理之前从校准样本中估计出来的。本研究使用蒙特卡罗模拟,评估了在测量四个测试场合的综合变化时,常见的AMC假设检验对项目参数估计误差存在的稳健性。结果表明,项目参数估计误差对假阳性率和潜在特质变化恢复最多只有很小的影响,这些影响在很大程度上可以通过计算机自适应测试题库信息函数来解释。AMC性能作为项目参数估计误差和假设检验选择的函数的差异通常仅限于潜在特质值特别低或高的模拟对象,在这些情况下,题库提供的信息相对较少。这些模拟突出了AMC与信息丰富的题库配合使用时,如何在存在项目参数估计误差的情况下准确测量个体内部变化。讨论了AMC研究的局限性和未来方向。

相似文献

1
Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error.
Educ Psychol Meas. 2022 Aug;82(4):643-677. doi: 10.1177/00131644211033902. Epub 2021 Aug 16.
3
Parameter Recovery in Multidimensional Item Response Theory Models Under Complexity and Nonnormality.
Appl Psychol Meas. 2017 Oct;41(7):530-544. doi: 10.1177/0146621617707507. Epub 2017 May 11.
4
-Stratified Computerized Adaptive Testing in the Presence of Calibration Error.
Educ Psychol Meas. 2015 Apr;75(2):260-283. doi: 10.1177/0013164414530719. Epub 2014 Apr 21.
6
Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach.
Educ Psychol Meas. 2016 Aug;76(4):662-684. doi: 10.1177/0013164415608418. Epub 2015 Oct 12.
7
A New Online Calibration Method for Multidimensional Computerized Adaptive Testing.
Psychometrika. 2016 Sep;81(3):674-701. doi: 10.1007/s11336-015-9482-9. Epub 2015 Nov 25.
9
An Adaptive Design for Item Parameter Online Estimation and Q-Matrix Online Calibration in CD-CAT.
Front Psychol. 2021 Aug 24;12:710497. doi: 10.3389/fpsyg.2021.710497. eCollection 2021.
10
THE IMPACT OF FALLIBLE ITEM PARAMETER ESTIMATES ON LATENT TRAIT RECOVERY.
Psychometrika. 2010 Jun;75(2):280-291. doi: 10.1007/s11336-009-9144-x.

引用本文的文献

1
A new person-fit statistic for the detection of aberrant responses in polytomous cognitive diagnostic models.
Behav Res Methods. 2025 Apr 9;57(5):138. doi: 10.3758/s13428-025-02659-6.
2
Adaptive Measurement of Change in the Context of Item Parameter Drift.
Appl Psychol Meas. 2024 Dec 30:01466216241310599. doi: 10.1177/01466216241310599.

本文引用的文献

1
The Impact of Item Calibration Error on Variable-Length Cognitive Diagnostic Computerized Adaptive Testing.
Front Psychol. 2020 Dec 2;11:575141. doi: 10.3389/fpsyg.2020.575141. eCollection 2020.
2
Hypothesis Testing Methods for Multivariate Multi-Occasion Intra-Individual Change.
Multivariate Behav Res. 2021 May-Jun;56(3):459-475. doi: 10.1080/00273171.2020.1730739. Epub 2020 Mar 3.
3
Variable-Length Stopping Rules for Multidimensional Computerized Adaptive Testing.
Psychometrika. 2019 Sep;84(3):749-771. doi: 10.1007/s11336-018-9644-7. Epub 2018 Dec 3.
4
Sources of Error in IRT Trait Estimation.
Appl Psychol Meas. 2018 Jul;42(5):359-375. doi: 10.1177/0146621617733955. Epub 2017 Oct 6.
5
Multivariate Hypothesis Testing Methods for Evaluating Significant Individual Change.
Appl Psychol Meas. 2018 May;42(3):221-239. doi: 10.1177/0146621617726787. Epub 2017 Oct 13.
6
Computerized adaptive testing: the capitalization on chance problem.
Span J Psychol. 2012 Mar;15(1):424-41. doi: 10.5209/rev_sjop.2012.v15.n1.37348.
7
A New Stopping Rule for Computerized Adaptive Testing.
Educ Psychol Meas. 2010 Dec 1;70(6):1-17. doi: 10.1177/0013164410387338.
8
THE IMPACT OF FALLIBLE ITEM PARAMETER ESTIMATES ON LATENT TRAIT RECOVERY.
Psychometrika. 2010 Jun;75(2):280-291. doi: 10.1007/s11336-009-9144-x.
9
Clinical significance: a statistical approach to defining meaningful change in psychotherapy research.
J Consult Clin Psychol. 1991 Feb;59(1):12-9. doi: 10.1037//0022-006x.59.1.12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验