Wu Tong, Kim Stella Y, Westine Carl
University of North Carolina at Charlotte, USA.
Riverside Insights, Itasca, IL.
Educ Psychol Meas. 2023 Dec;83(6):1202-1228. doi: 10.1177/00131644221140941. Epub 2022 Dec 9.
For large-scale assessments, data are often collected with missing responses. Despite the wide use of item response theory (IRT) in many testing programs, however, the existing literature offers little insight into the effectiveness of various approaches to handling missing responses in the context of scale linking. Scale linking is commonly used in large-scale assessments to maintain scale comparability over multiple forms of a test. Under a common-item nonequivalent group design (CINEG), missing data that occur to common items potentially influence the linking coefficients and, consequently, may affect scale comparability, test validity, and reliability. The objective of this study was to evaluate the effect of six missing data handling approaches, including listwise deletion (LWD), treating missing data as incorrect responses (IN), corrected item mean imputation (CM), imputing with a response function (RF), multiple imputation (MI), and full information likelihood information (FIML), on IRT scale linking accuracy when missing data occur to common items. Under a set of simulation conditions, the relative performance of the six missing data treatment methods under two missing mechanisms was explored. Results showed that RF, MI, and FIML produced less errors for conducting scale linking whereas LWD was associated with the most errors regardless of various testing conditions.
对于大规模评估,数据收集时常常会出现缺失回答的情况。然而,尽管项目反应理论(IRT)在许多测试项目中被广泛应用,但现有文献对于在量表链接背景下处理缺失回答的各种方法的有效性几乎没有深入见解。量表链接在大规模评估中普遍用于保持同一测试多种形式之间的量表可比性。在共同项目非等组设计(CINEG)下,共同项目出现的缺失数据可能会影响链接系数,进而可能影响量表可比性、测试效度和信度。本研究的目的是评估六种缺失数据处理方法,包括全列删除(LWD)、将缺失数据视为错误回答(IN)、校正项目均值插补(CM)、用反应函数插补(RF)、多重插补(MI)和完全信息似然信息(FIML),在共同项目出现缺失数据时对IRT量表链接准确性的影响。在一组模拟条件下,探讨了六种缺失数据处理方法在两种缺失机制下的相对性能。结果表明,在进行量表链接时,RF、MI和FIML产生的误差较小,而无论各种测试条件如何,LWD产生的误差最大。