Wollack James A, Cohen Allan S, Eckerly Carol A
University of Wisconsin, Madison, WI, USA.
University of Georgia, Athens, GA, USA.
Educ Psychol Meas. 2015 Dec;75(6):931-953. doi: 10.1177/0013164414568716. Epub 2015 Jan 23.
Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not lend themselves to making probabilistic statements about the likelihood of the observations. In this article, a new erasure detection index, , is developed, which uses item response theory to compare the number of observed wrong-to-right erasures to the number expected due to chance, conditional on the examinee's ability-level and number of erased items. A simulation study is presented to evaluate the Type I error rate and power of under various types of fraudulent and benign erasures. Results show that with a correction for continuity yields Type I error rates that are less than or equal to nominal levels for every condition studied, and has high power to detect even small amounts of tampering among the students for whom tampering is most likely.
篡改考试成绩,尤其是在教育问责制考试中,是一个不幸的现实,这使得国家(或其考试供应商)必须进行数据取证分析,如擦除分析,以寻找可能存在不当行为的迹象。用于检测欺诈性擦除的统计方法很少,而且现有的方法大多无法对观察结果的可能性做出概率性陈述。在本文中,开发了一种新的擦除检测指数,它使用项目反应理论,根据考生的能力水平和擦除项目的数量,将观察到的从错误答案改为正确答案的擦除数量与因随机因素预期的数量进行比较。本文进行了一项模拟研究,以评估在各种类型的欺诈性和良性擦除情况下该指数的第一类错误率和检验功效。结果表明,经过连续性校正后的该指数在每个研究条件下的第一类错误率均小于或等于名义水平,并且即使对于最有可能存在篡改行为的学生群体中少量的篡改行为也具有很高的检测能力。