Center for Education Development and Research in Health Professions (CEDAR), University of Groningen and University Medical Center Groningen, Antonius Deusinglaan 1, FC40, 9713, AV, Groningen, The Netherlands.
Department Business IT & Management, NHL University of Applied Sciences, Leeuwarden, Netherlands.
BMC Med Educ. 2017 Nov 9;17(1):192. doi: 10.1186/s12909-017-1051-8.
Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods.
A 2 × 2 crossover design was used in a study where 298 students from four medical schools participated. A sample of 200 previously used questions from the progress tests was selected. The data were analyzed using the Rasch model, which provides fit parameters, reliability coefficients, and response option analysis.
The fit parameters were in the optimal interval ranging from 0.50 to 1.50, and the means were around 1.00. The person and item reliability coefficients were higher in the number-right condition than in the formula-scoring condition. The response option analysis showed that the majority of dysfunctional items emerged in the formula-scoring condition.
The findings of this study support the use of number-right scoring over formula scoring. Rasch model analyses showed that tests with number-right scoring have better psychometric properties than formula scoring. However, choosing the appropriate scoring method should depend not only on psychometric properties but also on self-directed test-taking strategies and metacognitive skills.
进展测试是一种评估工具,用于在课程结束时定期评估所有学生。由于学生不可能知道所有的知识,所以重要的是让他们认识到自己的知识不足。因此,通常使用公式评分法。但是,在需要考虑部分知识的情况下,使用答对计分法。比较这两种方法的研究结果相互矛盾。据我们所知,在所有这些研究中,经典测试理论或概化理论被用于分析数据。与这些研究不同,我们将探索使用 Rasch 模型来比较这两种方法。
在一项有 4 所医学院 298 名学生参与的 2×2 交叉设计研究中使用了这种方法。从进展测试中选择了 200 个以前使用过的样本问题。使用 Rasch 模型对数据进行分析,该模型提供拟合参数、可靠性系数和反应选项分析。
拟合参数在 0.50 到 1.50 的最佳区间内,平均值在 1.00 左右。在答对计分条件下,人与题的可靠性系数高于公式计分条件。反应选项分析表明,在公式计分条件下出现了大量功能失调的项目。
这项研究的结果支持使用答对计分而不是公式计分。Rasch 模型分析表明,使用答对计分的测试具有比公式计分更好的心理测量特性。然而,选择适当的评分方法不仅取决于心理测量特性,还取决于自主应试策略和元认知技能。