Suppr超能文献

使用应届毕业生作为评判者的安格夫标准设定程序在进展测试中的可靠性和可信度。

Reliability and credibility of an angoff standard setting procedure in progress testing using recent graduates as judges.

作者信息

Verhoeven B H, van der Steeg A F, Scherpbier A J, Muijtjens A M, Verwijnen G M, van der Vleuten C P

机构信息

Department: Skillslab, University of Maastricht, The Netherlands.

出版信息

Med Educ. 1999 Nov;33(11):832-7. doi: 10.1046/j.1365-2923.1999.00487.x.

Abstract

INTRODUCTION

Progress testing is an assessment method that samples the complete domain of knowledge that is considered pertinent to undergraduate medical education. Because of the comprehensive nature of this test, it is very difficult to set a passing score. We obtained a progress test standard using an Angoff procedure with recent graduates as judges. This paper reports on the reliability and credibility of this approach.

METHODS

The Angoff procedure was applied to a sample of 146 progress test items. The items were judged by a panel of eight recently graduated students. Generalizability theory was used to investigate the reliability as a function of the number of items and judges. Credibility was judged by comparing the pass/fail rates resulting from the standard arrived at by the Angoff procedure with those obtained using a relative and a fixed standard.

RESULTS

The results indicate that an acceptable error score can be achieved, yielding a precision within one percentage on the scoring scale, by using 10 judges on a full-length progress test (i.e. 250 items). The pass/fail rates associated with the Angoff standard came closest to those of the relative standard, which takes variations in test difficulty into account. A high correlation was found between item-Angoff estimates and the item P-values.

CONCLUSION

The results of this study suggest that the Angoff procedure, using recently graduated students as judges, is an appropriate standard setting method for a progress test.

摘要

引言

进阶测试是一种评估方法,它对与本科医学教育相关的全部知识领域进行抽样。由于该测试具有全面性,因此很难设定及格分数。我们采用安格夫程序,以应届毕业生为评判者,得出了进阶测试标准。本文报告了该方法的可靠性和可信度。

方法

将安格夫程序应用于146个进阶测试项目的样本。这些项目由八名应届毕业生组成的小组进行评判。运用概化理论来研究可靠性与项目数量及评判者数量之间的函数关系。通过比较安格夫程序得出的标准与相对标准和固定标准得出的及格/不及格率来判断可信度。

结果

结果表明,在完整的进阶测试(即250个项目)中,使用10名评判者可以实现可接受的误差分数,在评分量表上的精度在一个百分点以内。与安格夫标准相关的及格/不及格率最接近考虑了测试难度差异的相对标准。在项目安格夫估计值与项目P值之间发现了高度相关性。

结论

本研究结果表明,以应届毕业生为评判者的安格夫程序是进阶测试的一种合适的标准设定方法。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验