书签法和 Angoff 标准设定法在医学绩效测试中有效性的比较。

Comparison of the validity of bookmark and Angoff standard setting methods in medical performance tests.

机构信息

Department of Psychology, Faculty of Humanities, University of Zanjan, Zanjan, Iran.

出版信息

BMC Med Educ. 2021 Jan 2;21(1):1. doi: 10.1186/s12909-020-02436-3.

DOI:10.1186/s12909-020-02436-3

PMID:33388043

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7778792/

Abstract

BACKGROUND

One of the main processes of determining the ability level at which a student should pass an assessment is standard setting. The current study aimed to compare the validity of Angoff and bookmark methods in standard-setting.

METHOD

190 individuals with an M.Sc. degree in laboratory science participated in the study. A test with 32 items, designed by a group of experts, was used to assess the laboratory skills of the participants. Moreover, two groups each containing 12 content specialists in laboratory sciences, voluntarily participated in the application of the Angoff and bookmark methods. To assess the process validity, a 5-item questionnaire was asked from two groups of panelists. To investigate the internal validity, the classification agreement was calculated using the kappa and Fleiss's Kappa coefficient. External validity was assessed by using five indices (correlation with criterion score, specificity, sensitivity, and positive and negative predictive values of correlation test with criterion score).

RESULTS

The results showed that the obtained cut-scores was 17.67 for Angoff and 18.8 for bookmark. The average total of items related to the quality of the execution process was 4.25 for the Angoff group and 4.79 for the bookmark group. Pass rates pass rates percentages for the Angoff and bookmark group were 55.78 and 41.36, respectively. Correlations of passing/failing, between employer ratings and test scores were 0.69 and 0.88 for Angoff and bookmark methods, respectively.

CONCLUSION

Based on the results, it can be concluded that the process and internal validities of the bookmark method were higher than the Angoff method. For evaluation of the external validity (concordance of the cut score with the criterion score), all five external validity indices supported the bookmark method.

摘要

背景

确定学生应通过评估的能力水平的主要过程之一是标准设定。本研究旨在比较 Angoff 和书签法在标准设定中的有效性。

方法

190 名具有实验室科学硕士学位的个人参加了这项研究。一项由一组专家设计的包含 32 个项目的测试用于评估参与者的实验室技能。此外，两组各包含 12 名实验室科学内容专家自愿参与了 Angoff 和书签法的应用。为了评估过程有效性，从两组专家中询问了一份包含 5 个项目的问卷。为了研究内部有效性，使用 kappa 和 Fleiss 的 Kappa 系数计算分类一致性。外部有效性通过使用五个指标（与标准分数的相关性、特异性、敏感性以及与标准分数的相关性测试的阳性和阴性预测值）进行评估。

结果

结果表明，Angoff 法获得的切割分数为 17.67，书签法为 18.8。与执行过程质量相关的项目平均总分分别为 Angoff 组的 4.25 和书签组的 4.79。Angoff 和书签组的通过率分别为 55.78%和 41.36%。雇主评分和测试分数之间的及格/不及格相关性分别为 Angoff 和书签方法的 0.69 和 0.88。

结论

根据结果可以得出结论，书签法的过程和内部有效性高于 Angoff 法。对于外部有效性（切割分数与标准分数的一致性）的评估，所有五个外部有效性指标都支持书签法。

相似文献

Comparison of the validity of bookmark and Angoff standard setting methods in medical performance tests.书签法和 Angoff 标准设定法在医学绩效测试中有效性的比较。

BMC Med Educ. 2021 Jan 2;21(1):1. doi: 10.1186/s12909-020-02436-3.

Comparison of results between modified-Angoff and bookmark methods for estimating cut score of the Korean medical licensing examination.韩国医学执照考试及格分数估计中改良安格夫法与书签法结果的比较。

Korean J Med Educ. 2018 Dec;30(4):347-357. doi: 10.3946/kjme.2018.110. Epub 2018 Dec 1.

Comparison of standard-setting methods for the Korea Radiological technologist Licensing Examination : Angoff, Ebel, Bookmark, and Hofstee.韩国放射技师执照考试标准设定方法的比较：安格夫法、埃贝尔法、书签法和霍夫斯泰法。

J Educ Eval Health Prof. 2018;15:32. doi: 10.3352/jeehp.2018.15.32. Epub 2018 Dec 26.

Who will pass the dental OSCE? Comparison of the Angoff and the borderline regression standard setting methods.谁将通过牙科客观结构化临床考试？安格夫法与边界回归标准设定方法的比较。

Eur J Dent Educ. 2009 Aug;13(3):162-71. doi: 10.1111/j.1600-0579.2008.00568.x.

Using the Angoff method to set a standard on mock exams for the Korean Nursing Licensing Examination.运用安格夫方法为韩国护士执照考试的模拟考试设定标准。

J Educ Eval Health Prof. 2020;17:14. doi: 10.3352/jeehp.2020.17.14. Epub 2020 Apr 22.

Comparison of two methods of standard setting: the performance of the three-level Angoff method.两种标准设定方法的比较：三级 Angoff 法的表现。

Med Educ. 2011 Dec;45(12):1199-208. doi: 10.1111/j.1365-2923.2011.04073.x.

Is an Angoff standard an indication of minimal competence of examinees or of judges?安格夫标准是考生最低能力的指标还是评判者最低能力的指标？

Adv Health Sci Educ Theory Pract. 2008 May;13(2):203-11. doi: 10.1007/s10459-006-9035-1. Epub 2006 Oct 17.

Applying the Bookmark method to medical education: standard setting for an aseptic technique station.运用书签法于医学教育：无菌技术站的标准设定。

Med Teach. 2013 Jul;35(7):581-5. doi: 10.3109/0142159X.2013.778395. Epub 2013 Apr 18.

Standard setting with dichotomous and constructed response items: some Rasch model approaches.使用二分法和结构化反应题目的标准设定：一些拉施模型方法。

J Appl Meas. 2009;10(4):438-54.

A comparison of different standard-setting methods for professional qualifying dental examination.不同标准化设定方法在专业资格牙科检查中的比较。

J Dent Educ. 2021 Jul;85(7):1210-1216. doi: 10.1002/jdd.12600. Epub 2021 Mar 31.

引用本文的文献

Teaching cognitive and affective empathy in medicine: a systematic review and meta-analysis of randomized controlled trials.医学中认知共情与情感共情教学：随机对照试验的系统评价与荟萃分析

Med Educ Online. 2025 Dec;30(1):2501263. doi: 10.1080/10872981.2025.2501263. Epub 2025 May 6.

Investigating assessment standards and fixed passing marks in dental undergraduate finals: a mixed-methods approach.探究牙科本科期末考试的评估标准和固定及格分数：一种混合方法研究

BMC Med Educ. 2025 Apr 3;25(1):481. doi: 10.1186/s12909-025-06944-y.

Adaptation and modification of the professional identity formation scale for postgraduate trainees in basic health science: a mixed method study.基础健康科学研究生学员专业身份形成量表的改编与修订：一项混合方法研究

BMC Med Educ. 2025 Apr 2;25(1):475. doi: 10.1186/s12909-025-07025-w.

Intention to use eLearning-based continuing professional development and its predictors among healthcare professionals in Amhara region referral hospitals, Ethiopia, 2023: using modified UTAUT-2 model.2023年埃塞俄比亚阿姆哈拉地区转诊医院医护人员基于电子学习的持续专业发展的使用意向及其预测因素：采用改进的UTAUT-2模型

BMC Health Serv Res. 2025 Jan 30;25(1):178. doi: 10.1186/s12913-025-12317-4.

A Unique Simulation Methodology for Practicing Clinical Decision Making.一种用于临床决策实践的独特模拟方法。

J Med Educ Curric Dev. 2025 Jan 27;12:23821205241310077. doi: 10.1177/23821205241310077. eCollection 2025 Jan-Dec.

Research involvement among undergraduate medical students in Bangladesh: a multicenter cross-sectional study.孟加拉国本科医学生的研究参与情况：一项多中心横断面研究。

BMC Med Educ. 2025 Jan 25;25(1):126. doi: 10.1186/s12909-024-06566-w.

Health literacy and influencing factors in university students across diverse educational fields in Kazakhstan.哈萨克斯坦不同教育领域大学生的健康素养及其影响因素。

Sci Rep. 2025 Jan 25;15(1):3197. doi: 10.1038/s41598-025-87049-w.

Surgical portfolios: A systematic scoping review.外科手术档案：一项系统性综述。

Surg Pract Sci. 2022 Jul 6;10:100107. doi: 10.1016/j.sipas.2022.100107. eCollection 2022 Sep.

Enhancing medical English proficiency: the current status and development potential of peer-assisted learning in medical education.提高医学英语水平：医学教育中同伴互助学习的现状与发展潜力

BMC Med Educ. 2025 Jan 16;25(1):79. doi: 10.1186/s12909-024-06492-x.

Dissecting Loneliness in the Digital Age: An Insight into the Experiences of Medical Students Amid and Beyond the COVID-19 Pandemic.剖析数字时代的孤独：洞察新冠疫情期间及之后医学生的经历

F1000Res. 2024 Jun 26;12:1196. doi: 10.12688/f1000research.141325.1. eCollection 2023.

本文引用的文献

J Educ Eval Health Prof. 2018;15:32. doi: 10.3352/jeehp.2018.15.32. Epub 2018 Dec 26.

Korean J Med Educ. 2018 Dec;30(4):347-357. doi: 10.3946/kjme.2018.110. Epub 2018 Dec 1.

The sights and insights of examiners in objective structured clinical examinations.客观结构化临床考试中考官的观察与见解。

J Educ Eval Health Prof. 2017 Dec 27;14:34. doi: 10.3352/jeehp.2017.14.34. eCollection 2017.

Sensitivity, Specificity, and Predictive Values: Foundations, Pliabilities, and Pitfalls in Research and Practice.敏感性、特异性和预测值：研究与实践中的基础、灵活性及陷阱

Front Public Health. 2017 Nov 20;5:307. doi: 10.3389/fpubh.2017.00307. eCollection 2017.

Standard setting in medical education: fundamental concepts and emerging challenges.医学教育中的标准设定：基本概念与新出现的挑战。

Med J Islam Repub Iran. 2014 May 19;28:34. eCollection 2014.

Applying the Bookmark method to medical education: standard setting for an aseptic technique station.运用书签法于医学教育：无菌技术站的标准设定。

Med Teach. 2013 Jul;35(7):581-5. doi: 10.3109/0142159X.2013.778395. Epub 2013 Apr 18.

Setting pass scores for clinical skills assessment.设定临床技能评估的及格分数。

Kaohsiung J Med Sci. 2008 Dec;24(12):656-63. doi: 10.1016/S1607-551X(09)70032-4.

Setting standards for performance tests: a pilot study of a three-level Angoff method.设定性能测试标准：三级安格夫法的初步研究

Acad Med. 2008 Oct;83(10 Suppl):S13-6. doi: 10.1097/ACM.0b013e318183c683.

Standard setting for OSCEs: trial of borderline approach.客观结构化临床考试的标准设定：临界方法试验

Adv Health Sci Educ Theory Pract. 2004;9(3):201-9. doi: 10.1023/B:AHSE.0000038208.06099.9a.

A model for setting performance standards for standardized patient examinations.一种用于设定标准化患者检查绩效标准的模型。

Eval Health Prof. 2003 Dec;26(4):427-46. doi: 10.1177/0163278703258105.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验