公式评分与数字评分在本科医学教育中的比较：一项 Rasch 模型分析。

Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

机构信息

Center for Education Development and Research in Health Professions (CEDAR), University of Groningen and University Medical Center Groningen, Antonius Deusinglaan 1, FC40, 9713, AV, Groningen, The Netherlands.

Department Business IT & Management, NHL University of Applied Sciences, Leeuwarden, Netherlands.

出版信息

BMC Med Educ. 2017 Nov 9;17(1):192. doi: 10.1186/s12909-017-1051-8.

DOI:10.1186/s12909-017-1051-8

PMID:29121888

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5679154/

Abstract

BACKGROUND

Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods.

METHODS

A 2 × 2 crossover design was used in a study where 298 students from four medical schools participated. A sample of 200 previously used questions from the progress tests was selected. The data were analyzed using the Rasch model, which provides fit parameters, reliability coefficients, and response option analysis.

RESULTS

The fit parameters were in the optimal interval ranging from 0.50 to 1.50, and the means were around 1.00. The person and item reliability coefficients were higher in the number-right condition than in the formula-scoring condition. The response option analysis showed that the majority of dysfunctional items emerged in the formula-scoring condition.

CONCLUSIONS

The findings of this study support the use of number-right scoring over formula scoring. Rasch model analyses showed that tests with number-right scoring have better psychometric properties than formula scoring. However, choosing the appropriate scoring method should depend not only on psychometric properties but also on self-directed test-taking strategies and metacognitive skills.

摘要

背景

进展测试是一种评估工具，用于在课程结束时定期评估所有学生。由于学生不可能知道所有的知识，所以重要的是让他们认识到自己的知识不足。因此，通常使用公式评分法。但是，在需要考虑部分知识的情况下，使用答对计分法。比较这两种方法的研究结果相互矛盾。据我们所知，在所有这些研究中，经典测试理论或概化理论被用于分析数据。与这些研究不同，我们将探索使用 Rasch 模型来比较这两种方法。

方法

在一项有 4 所医学院 298 名学生参与的 2×2 交叉设计研究中使用了这种方法。从进展测试中选择了 200 个以前使用过的样本问题。使用 Rasch 模型对数据进行分析，该模型提供拟合参数、可靠性系数和反应选项分析。

结果

拟合参数在 0.50 到 1.50 的最佳区间内，平均值在 1.00 左右。在答对计分条件下，人与题的可靠性系数高于公式计分条件。反应选项分析表明，在公式计分条件下出现了大量功能失调的项目。

结论

这项研究的结果支持使用答对计分而不是公式计分。Rasch 模型分析表明，使用答对计分的测试具有比公式计分更好的心理测量特性。然而，选择适当的评分方法不仅取决于心理测量特性，还取决于自主应试策略和元认知技能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d68d/5679154/343ddf4aacb1/12909_2017_1051_Fig1_HTML.jpg

相似文献

Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

BMC Med Educ. 2017 Nov 9;17(1):192. doi: 10.1186/s12909-017-1051-8.

The effect of a 'don't know' option on test scores: number-right and formula scoring compared.

Med Educ. 1999 Apr;33(4):267-75. doi: 10.1046/j.1365-2923.1999.00292.x.

Evidence-based decision about test scoring rules in clinical anatomy multiple-choice examinations.

Anat Sci Educ. 2015 May-Jun;8(3):242-8. doi: 10.1002/ase.1478. Epub 2014 Jul 22.

The don't know option in progress testing.

Adv Health Sci Educ Theory Pract. 2015 Dec;20(5):1325-38. doi: 10.1007/s10459-015-9604-2. Epub 2015 Apr 26.

Multiple true-false items: a comparison of scoring algorithms.

Adv Health Sci Educ Theory Pract. 2018 Aug;23(3):455-463. doi: 10.1007/s10459-017-9805-y. Epub 2017 Nov 30.

Using Rasch measurement to score, evaluate, and improve examinations in an anatomy course.

Anat Sci Educ. 2014 Nov-Dec;7(6):450-60. doi: 10.1002/ase.1436. Epub 2014 Jan 15.

Psychometrics of Multiple Choice Questions with Non-Functioning Distracters: Implications to Medical Education.

Indian J Physiol Pharmacol. 2015 Oct-Dec;59(4):428-35.

Pick-N multiple choice-exams: a comparison of scoring algorithms.

Adv Health Sci Educ Theory Pract. 2011 May;16(2):211-21. doi: 10.1007/s10459-010-9256-1. Epub 2010 Oct 31.

Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education.

BMC Med Educ. 2005 Mar 7;5(1):9. doi: 10.1186/1472-6920-5-9.

Analyzing script concordance test scoring methods and items by difficulty and type.

Teach Learn Med. 2014;26(2):135-45. doi: 10.1080/10401334.2014.884464.

引用本文的文献

Investigating possible causes of bias in a progress test translation: an one-edged sword.

Korean J Med Educ. 2019 Sep;31(3):193-204. doi: 10.3946/kjme.2019.130. Epub 2019 Aug 26.

本文引用的文献

Development of cognitive processing and judgments of knowledge in medical students: Analysis of progress test results.

Med Teach. 2016 Nov;38(11):1125-1129. doi: 10.3109/0142159X.2016.1170781. Epub 2016 Apr 27.

The don't know option in progress testing.

Adv Health Sci Educ Theory Pract. 2015 Dec;20(5):1325-38. doi: 10.1007/s10459-015-9604-2. Epub 2015 Apr 26.

Improving assessment practice through cross-institutional collaboration: An exercise on the use of OSCEs.

Med Teach. 2016;38(3):263-71. doi: 10.3109/0142159X.2015.1016487. Epub 2015 Mar 18.

Rasch analysis of professional behavior in medical education.

Adv Health Sci Educ Theory Pract. 2015 Dec;20(5):1179-94. doi: 10.1007/s10459-015-9594-0. Epub 2015 Mar 4.

Lucky guess or knowledge: a cross-sectional study using the Bland and Altman analysis to compare confidence-based testing of pharmacological knowledge in 3rd and 5th year medical students.

Adv Health Sci Educ Theory Pract. 2015 May;20(2):431-40. doi: 10.1007/s10459-014-9537-1. Epub 2014 Aug 8.

The use of progress testing.

Perspect Med Educ. 2012 Mar;1(1):24-30. doi: 10.1007/s40037-012-0007-2. Epub 2012 Mar 10.

A systemic framework for the progress test: strengths, constraints and issues: AMEE Guide No. 71.

Med Teach. 2012;34(9):683-97. doi: 10.3109/0142159X.2012.704437.

Progress testing in clinical science education: results of a pilot project between the National Board of Medical Examiners and a US Medical School.

Med Teach. 2010;32(6):503-8. doi: 10.3109/01421590903514655.

A primer on classical test theory and item response theory for assessments in medical education.

Med Educ. 2010 Jan;44(1):109-17. doi: 10.1111/j.1365-2923.2009.03425.x.

Evidence of gender bias in True-False-Abstain medical examinations.

BMC Med Educ. 2009 Jun 7;9:32. doi: 10.1186/1472-6920-9-32.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

公式评分与数字评分在本科医学教育中的比较：一项 Rasch 模型分析。

Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献