• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

选择题组测验评分算法比较。

Pick-N multiple choice-exams: a comparison of scoring algorithms.

机构信息

Faculty of Health, Institute for Teaching and Educational Research in Health Sciences, Witten/Herdecke University, Germany.

出版信息

Adv Health Sci Educ Theory Pract. 2011 May;16(2):211-21. doi: 10.1007/s10459-010-9256-1. Epub 2010 Oct 31.

DOI:10.1007/s10459-010-9256-1
PMID:21038082
Abstract

To compare different scoring algorithms for Pick-N multiple correct answer multiple-choice (MC) exams regarding test reliability, student performance, total item discrimination and item difficulty. Data from six 3rd year medical students' end of term exams in internal medicine from 2005 to 2008 at Munich University were analysed (1,255 students, 180 Pick-N items in total). Scoring Algorithms: Each question scored a maximum of one point. We compared: (a) Dichotomous scoring (DS): One point if all true and no wrong answers were chosen. (b) Partial credit algorithm 1 (PS(50)): One point for 100% true answers; 0.5 points for 50% or more true answers; zero points for less than 50% true answers. No point deduction for wrong choices. (c) Partial credit algorithm 2 (PS(1/m)): A fraction of one point depending on the total number of true answers was given for each correct answer identified. No point deduction for wrong choices. Application of partial crediting resulted in psychometric results superior to dichotomous scoring (DS). Algorithms examined resulted in similar psychometric data with PS(50) only slightly exceeding PS(1/m) in higher coefficients of reliability. The Pick-N MC format and its scoring using the PS(50) and PS(1/m) algorithms are suited for undergraduate medical examinations. Partial knowledge should be awarded in Pick-N MC exams.

摘要

比较 Pick-N 多选题考试中不同评分算法的测试可靠性、学生表现、整体项目区分度和项目难度。分析了 2005 年至 2008 年慕尼黑大学六名三年级医学生内科期末考试的数据(1255 名学生,180 个 Pick-N 项目)。评分算法:每题最高得一分。我们比较了:(a)二分评分(DS):如果所有答案都正确且没有错误答案,则得一分。(b)部分计分算法 1(PS(50)):100%正确答案得一分;50%或以上正确答案得 0.5 分;少于 50%正确答案得零分。错误答案不扣分。(c)部分计分算法 2(PS(1/m)):根据正确答案的总数,为每个正确答案分配一个分数。错误答案不扣分。部分计分的应用结果优于二分评分(DS)。检查的算法得出了相似的心理测量数据,其中 PS(50)在可靠性较高的系数上略高于 PS(1/m)。Pick-N MC 格式及其使用 PS(50)和 PS(1/m)算法的评分适用于本科医学考试。在 Pick-N MC 考试中应授予部分知识。

相似文献

1
Pick-N multiple choice-exams: a comparison of scoring algorithms.选择题组测验评分算法比较。
Adv Health Sci Educ Theory Pract. 2011 May;16(2):211-21. doi: 10.1007/s10459-010-9256-1. Epub 2010 Oct 31.
2
Multiple true-false items: a comparison of scoring algorithms.多项是非题:评分算法比较。
Adv Health Sci Educ Theory Pract. 2018 Aug;23(3):455-463. doi: 10.1007/s10459-017-9805-y. Epub 2017 Nov 30.
3
Answer changing in multiple choice assessment change that answer when in doubt--and spread the word!在多项选择题评估中改变答案——有疑问时就改变那个答案——并传播这个消息!
BMC Med Educ. 2007 Aug 24;7:28. doi: 10.1186/1472-6920-7-28.
4
Comparison between Long-Menu and Open-Ended Questions in computerized medical assessments. A randomized controlled trial.计算机化医学评估中长菜单式问题与开放式问题的比较:一项随机对照试验。
BMC Med Educ. 2006 Oct 10;6:50. doi: 10.1186/1472-6920-6-50.
5
Assessment of spatial anatomical knowledge with a 'three-dimensional multiple choice test' (3D-MC).使用“三维多项选择题测试”(3D-MC)评估空间解剖学知识。
Med Teach. 2009 Jan;31(1):e13-7. doi: 10.1080/01421590802334325.
6
Comparing a script concordance examination to a multiple-choice examination on a core internal medicine clerkship.将脚本一致性考试与核心内科实习的多项选择题考试进行比较。
Teach Learn Med. 2012;24(3):187-93. doi: 10.1080/10401334.2012.692239.
7
A modified electronic key feature examination for undergraduate medical students: validation threats and opportunities.针对本科医学生的改良电子钥匙功能测试:验证的威胁与机遇
Med Teach. 2005 Aug;27(5):450-5. doi: 10.1080/01421590500078471.
8
It takes only 100 true-false items to test medical students: true or false?只需100道是非题就能测试医学生:对还是错?
Med Teach. 2005 Aug;27(5):468-72. doi: 10.1080/01421590500097018.
9
Sharpening the eye of the OSCE with critical action analysis.通过关键行动分析提升客观结构化临床考试的敏锐度。
Acad Med. 2008 Oct;83(10):900-5. doi: 10.1097/ACM.0b013e3181850990.
10
Answering multiple-choice questions in high-stakes medical examinations.在高风险医学考试中回答多项选择题。
Med Educ. 2005 Sep;39(9):890-4. doi: 10.1111/j.1365-2929.2005.02243.x.

引用本文的文献

1
Use of Multiple-Choice Items in Summative Examinations: Questionnaire Survey Among German Undergraduate Dental Training Programs.形成性考核中多选题的使用:德国本科生牙科学位培训计划的问卷调查。
JMIR Med Educ. 2024 Jun 27;10:e58126. doi: 10.2196/58126.
2
Natural language processing for automatic evaluation of free-text answers - a feasibility study based on the European Diploma in Radiology examination.用于自动评估自由文本答案的自然语言处理——基于欧洲放射学文凭考试的可行性研究
Insights Imaging. 2023 Sep 19;14(1):150. doi: 10.1186/s13244-023-01507-5.
3
The Relation Between Students' Theoretical Knowledge and Practical Skills in Endodontics: Retrospective Analysis.
牙髓病学中学生理论知识与实践技能的关系:回顾性分析
Interact J Med Res. 2023 Apr 18;12:e46305. doi: 10.2196/46305.
4
Use of Multiple-Select Multiple-Choice Items in a Dental Undergraduate Curriculum: Retrospective Study Involving the Application of Different Scoring Methods.牙科本科课程中多项选择多项选择题的使用:涉及不同评分方法应用的回顾性研究
JMIR Med Educ. 2023 Mar 27;9:e43792. doi: 10.2196/43792.
5
Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit.评估针对提供部分分数的多项选择题的不同评分方法。
Educ Psychol Meas. 2022 Feb;82(1):151-176. doi: 10.1177/0013164421994636. Epub 2021 Feb 22.
6
Special Teaching Formats during the COVID-19 Pandemic-A Survey with Implications for a Crisis-Proof Education.新冠疫情期间的特殊教学形式——一项对具备抗危机能力教育有启示意义的调查
J Clin Med. 2021 Oct 30;10(21):5099. doi: 10.3390/jcm10215099.
7
Learning to diagnose accurately through virtual patients: do reflection phases have an added benefit?通过虚拟患者准确学习诊断:反思阶段有额外的好处吗?
BMC Med Educ. 2021 Oct 7;21(1):523. doi: 10.1186/s12909-021-02937-9.
8
How to assess? Perceptions and preferences of undergraduate medical students concerning traditional assessment methods.如何评估?本科医学生对传统评估方法的看法和偏好。
BMC Med Educ. 2020 Sep 17;20(1):312. doi: 10.1186/s12909-020-02239-6.
9
National Board of Medical Examiners and Curriculum Change: What Do Scores Tell Us? A Case Study at the University of Balamand Medical School.美国国家医学考试委员会与课程改革:分数能告诉我们什么?以巴拉曼德医学院为例的一项研究
J Med Educ Curric Dev. 2020 Jul 24;7:2382120520925062. doi: 10.1177/2382120520925062. eCollection 2020 Jan-Dec.
10
Effectiveness of longitudinal faculty development programs on MCQs items writing skills: A follow-up study.纵向教师发展计划对多项选择题编写技能的有效性:一项随访研究。
PLoS One. 2017 Oct 10;12(10):e0185895. doi: 10.1371/journal.pone.0185895. eCollection 2017.