• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

以表现最佳的学生为参照点的标准化设置方法:实用且经济实惠。

A standard setting method with the best performing students as point of reference: practical and affordable.

机构信息

Center for Research and Innovation in Medical Education, University of Groningen and University Medical Center Groningen, Groningen, The Netherlands.

出版信息

Med Teach. 2010;32(2):154-60. doi: 10.3109/01421590903196979.

DOI:10.3109/01421590903196979
PMID:20163232
Abstract

BACKGROUND

Teachers involved in test development usually prefer criterion-referenced standard setting methods using panels. Since expert panels are costly, standards are often set by a pre-fixed percentage of questions answered correctly or norm-referenced methods aimed at ranking examinees.

AIM

To discuss the (dis)advantages of commonly used criterion and norm-referenced methods and present a new compromise method: standards based on a fixed cut-off score using the best scoring students as reference point.

METHODS

Historical data from 54 Maastricht (norm-referenced) and 52 Groningen (criterion-referenced) tests were used to demonstrate huge discrepancies and variability in cut-off scores and failure rates. Subsequently, the compromise model - known as Cohen's method - was applied to the Groningen tests.

RESULTS

The Maastricht norm-referenced method led to a large variation in required cut-off scores (15-46%), but a stable failure rate (about 17%). The Groningen method with a conventional, pre-fixed standard of 60% led to a large variation in failure rates (17-97%). The compromise method reduced variation in required cut-off scores as well as failure rates.

CONCLUSION

Both the criterion and norm-referenced standards, used in practice, have disadvantages. The proposed compromise model reduces the disadvantages of both methods and is considered more acceptable. Last but not least, compared to standard setting methods using panels, this method is affordable.

摘要

背景

参与测试开发的教师通常更喜欢使用专家小组的基于准则的标准设定方法。由于专家小组成本高昂,因此通常会通过固定比例的正确回答问题或旨在对考生进行排名的常模参照方法来设定标准。

目的

讨论常用的基于准则和基于常模的方法的优缺点,并提出一种新的折衷方法:基于固定截止分数的标准,以最佳得分学生为参考点。

方法

使用来自 54 个马斯特里赫特(常模参照)和 52 个格罗宁根(准则参照)测试的历史数据,展示了截止分数和失败率之间存在巨大差异和可变性。随后,将妥协模型(即科恩方法)应用于格罗宁根测试。

结果

马斯特里赫特的常模参照方法导致所需截止分数的变化很大(15-46%),但失败率稳定(约 17%)。采用 60%的传统固定标准的格罗宁根方法导致失败率的变化很大(17-97%)。妥协方法降低了所需截止分数和失败率的变化。

结论

实践中使用的基于准则和基于常模的标准都有其缺点。所提出的折衷模型减少了两种方法的缺点,被认为更可接受。最后但并非最不重要的是,与使用小组进行标准设定的方法相比,这种方法更经济实惠。

相似文献

1
A standard setting method with the best performing students as point of reference: practical and affordable.以表现最佳的学生为参照点的标准化设置方法:实用且经济实惠。
Med Teach. 2010;32(2):154-60. doi: 10.3109/01421590903196979.
2
Setting and maintaining standards in multiple choice examinations: AMEE Guide No. 37.多项选择题考试中的标准设定与维持:医学教育与培训学会指南第37号
Med Teach. 2008;30(9-10):836-45. doi: 10.1080/01421590802402247.
3
A plea for the proper use of criterion-referenced tests in medical assessment.呼吁正确使用医学评估中的准则参照测验。
Med Educ. 2009 Dec;43(12):1141-6. doi: 10.1111/j.1365-2923.2009.03541.x.
4
Development of a modified Cohen method of standard setting.建立改良 Cohen 标准设定法。
Med Teach. 2011;33(12):e678-82. doi: 10.3109/0142159X.2011.611192.
5
A comparison of two standard-setting approaches in high-stakes clinical performance assessment using generalizability theory.使用概化理论比较两种高风险临床绩效评估中的标准设定方法。
Acad Med. 2012 Aug;87(8):1077-82. doi: 10.1097/ACM.0b013e31825cea4b.
6
Setting a standard on the pilot National Board Dental Examination.制定全国牙科医师资格考试试点的标准。
J Dent Educ. 1992 Oct;56(10):684-8.
7
A better norm-referenced grading using the standard deviation criterion.一种使用标准差标准的更好的常模参照评分。
Teach Learn Med. 2014;26(4):364-5. doi: 10.1080/10401334.2014.945031.
8
Who will pass the dental OSCE? Comparison of the Angoff and the borderline regression standard setting methods.谁将通过牙科客观结构化临床考试?安格夫法与边界回归标准设定方法的比较。
Eur J Dent Educ. 2009 Aug;13(3):162-71. doi: 10.1111/j.1600-0579.2008.00568.x.
9
Ensuring global standards for medical graduates: a pilot study of international standard-setting.确保医学毕业生的全球标准:一项国际标准制定的试点研究。
Med Teach. 2005 May;27(3):207-13. doi: 10.1080/01421590500129571.
10
Setting school-level outcome standards.设定校级成果标准。
Med Educ. 2006 Feb;40(2):166-72. doi: 10.1111/j.1365-2929.2005.02374.x.

引用本文的文献

1
Clinical reasoning education: evaluating the effect of script strategy and reflective practice on nursing students' reasoning development.临床推理教育:评估脚本策略和反思性实践对护生推理能力发展的影响。
BMC Res Notes. 2025 Jul 2;18(1):271. doi: 10.1186/s13104-025-07338-5.
2
Situational judgment test in pharmacy education: assessing professionalism capability among students.药学教育中的情境判断测试:评估学生的专业素养能力
BMC Res Notes. 2025 Mar 26;18(1):128. doi: 10.1186/s13104-025-07183-6.
3
Professionalism assessment of students in clinical education through Situational Judgment Test (SJT).
通过情境判断测验(SJT)对临床教育中学生的专业精神进行评估。
BMC Med Educ. 2024 Nov 12;24(1):1291. doi: 10.1186/s12909-024-06205-4.
4
Metabolic associated fatty liver disease (MAFLD): assessing the knowledge and practice of primary care doctors in Seremban District, Negeri Sembilan.代谢相关脂肪性肝病(MAFLD):评估森美兰州芙蓉区基层医疗医生的知识与实践情况。
Malays Fam Physician. 2024 Aug 16;19:51. doi: 10.51866/oa.629. eCollection 2024.
5
Satisfaction of adult and pediatric neurologists and neurosurgeons using telehealth during the COVID-19 pandemic in Saudi Arabia: a cross-sectional study.沙特阿拉伯在2019冠状病毒病大流行期间成人及儿科神经科医生和神经外科医生对远程医疗的满意度:一项横断面研究。
Front Digit Health. 2024 Feb 14;6:1195697. doi: 10.3389/fdgth.2024.1195697. eCollection 2024.
6
The effect of the Educational Scholar Program as a longitudinal faculty development program on the capability of educators as scholars.教育学者计划作为一项纵向的教师发展计划对教育工作者学者能力的影响。
BMC Med Educ. 2023 Sep 22;23(1):691. doi: 10.1186/s12909-023-04682-7.
7
Mother-daughter communication of sexual and reproductive health (SRH) matters and associated factors among sinhalese adolescent girls aged 14-19 years, in Sri Lanka.斯里兰卡 14-19 岁的僧伽罗少女的性与生殖健康(SRH)问题及其相关因素的母女沟通情况。
BMC Womens Health. 2023 Aug 31;23(1):461. doi: 10.1186/s12905-023-02617-4.
8
Influence of online collaborative learning on social network and academic performance of medical students: lessons learned from the COVID-19 pandemic.在线协作学习对医学生社交网络和学业成绩的影响:从新冠疫情中汲取的经验教训。
Front Med (Lausanne). 2023 Aug 9;10:1242638. doi: 10.3389/fmed.2023.1242638. eCollection 2023.
9
Competence of medical and obstetric registrars in the management of systemic lupus erythematosus in pregnancy.医学和产科住院医师在妊娠期系统性红斑狼疮管理方面的能力。
Obstet Med. 2021 Sep;14(3):153-157. doi: 10.1177/1753495X20964670. Epub 2020 Nov 4.
10
Introducing Summative Progress Testing in Radiology Residency: Little Change in Residents' Test Results After Transitioning from Formative Progress Testing.放射科住院医师培训中引入总结性进展测试:从形成性进展测试过渡后,住院医师的测试结果变化不大。
Med Sci Educ. 2020 May 13;30(2):943-953. doi: 10.1007/s40670-020-00977-2. eCollection 2020 Jun.