Seo Dong Gi, Kim Myeong Gi, Kim Na Hui, Shin Hye Sook, Kim Hyun Jung
Department of Psychology, College of Social Science, Hallym University, Chuncheon, Korea.
Department of Education, College of Education, Kangwon National University, Chuncheon, Korea.
J Educ Eval Health Prof. 2018;15:26. doi: 10.3352/jeehp.2018.15.26. Epub 2018 Oct 18.
This study aimed to find the best way of developing equivalent item sets and to propose a stable and effective management plan for the periodical licensing examinations.
Five pre-equated item sets were developed based on the predicted correct answer rate of each item by using linear programming. These pre-equated item sets were compared to the ones that were developed with random item selection method based on the actual answer rate and difficulty from item response theory (IRT). Also, the results with and without common items were compared in the same way. ACAR and the IRT difficulty was used to determine whether there is a significant difference between pre-equating conditions.
There was a statistically significant difference in IRT difficulty among the results from different pre-equated conditions. As predicted correct answer rate was divided into 2 or 3 difficulty boundaries, the actual answer rate and IRT difficulty parameters of the 5 item sets were equally constructed. Comparing item sets conditions with common items and without common items, including common items did not contribute much for the equating of 5 item sets.
This study suggested the linear programming method is applicable to construct equated-item sets that reflect each content area. The best method to construct equated item sets suggested is to divide the predicted correct answer rate into 2 or 3 difficulty boundaries regardless of common items. If pre-equated item sets are required to construct a test based on the actual data, several optimal methods should be considered by simulation studies before administrating a real test.
本研究旨在找到开发等效项目集的最佳方法,并为定期许可考试提出一个稳定有效的管理计划。
通过线性规划,基于每个项目的预测正确答案率开发了五个预等值项目集。将这些预等值项目集与基于项目反应理论(IRT)的实际答案率和难度,采用随机项目选择方法开发的项目集进行比较。同样,以相同方式比较有无共同项目时的结果。使用ACAR和IRT难度来确定预等值条件之间是否存在显著差异。
不同预等值条件的结果在IRT难度上存在统计学显著差异。随着预测正确答案率被划分为2或3个难度界限,5个项目集的实际答案率和IRT难度参数得到了同等构建。比较有共同项目和无共同项目的项目集条件,包含共同项目对5个项目集的等值作用不大。
本研究表明线性规划方法适用于构建反映每个内容领域的等值项目集。建议的构建等值项目集的最佳方法是,无论有无共同项目,都将预测正确答案率划分为2或3个难度界限。如果需要根据实际数据构建预等值项目集来进行测试,在进行实际测试之前,应通过模拟研究考虑几种优化方法。