• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统修复低区分度多项选择题对评估可靠性的影响:一项中断时间序列分析

The impact of systematically repairing multiple choice questions with low discrimination on assessment reliability: an interrupted time series analysis.

作者信息

Desy Janeve, Harvey Adrian, Weeks Sarah, Busche Kevin D, Martin Kerri, Paget Michael, Naugler Christopher, McLaughlin Kevin

机构信息

Office of Undergraduate Medical Education, Cummings School of Medicine, University of Calgary, Alberta, Canada.

出版信息

Can Med Educ J. 2024 Jul 12;15(3):52-56. doi: 10.36834/cmej.77596. eCollection 2024 Jul.

DOI:10.36834/cmej.77596
PMID:39114792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11302750/
Abstract

At our centre, we introduced a continuous quality improvement (CQI) initiative during academic year 2018-19 targeting for repair multiple choice question (MCQ) items with discrimination index () < 0.1. The purpose of this study was to assess the impact of this initiative on reliability/internal consistency of our assessments. Our participants were medical students during academic years 2015-16 to 2020-21 and our data were summative MCQ assessments during this time. Since the goal was to systematically review and improve summative assessments in our undergraduate program on an ongoing basis, we used interrupted time series analysis to assess the impact on reliability. Between 2015-16 and 2017-18 there was a significant negative trend in the mean alpha coefficient for MCQ exams (regression coefficient -0.027 [-0.008, -0.047], = 0.024). In the academic year following the introduction of our initiative (2018-19) there was a significant increase in the mean alpha coefficient (regression coefficient 0.113 [0.063, 0.163], = 0.010) which was then followed by a significant positive post-intervention trend (regression coefficient 0.056 [0.037, 0.075], = 0.006). In conclusion, our CQI intervention resulted in an immediate and progressive improvement reliability of our MCQ assessments.

摘要

在我们中心,于2018 - 19学年推出了一项持续质量改进(CQI)举措,旨在修订区分度指数()< 0.1的多项选择题(MCQ)题目。本研究的目的是评估该举措对我们评估的信度/内部一致性的影响。我们的参与者是2015 - 16学年至2020 - 21学年的医学生,我们的数据是这段时间内的MCQ总结性评估。由于目标是持续系统地审查和改进我们本科课程中的总结性评估,我们使用中断时间序列分析来评估对信度的影响。在2015 - 16年至2017 - 18年期间,MCQ考试的平均阿尔法系数呈显著负趋势(回归系数 -0.027 [-0.008, -0.047], = 0.024)。在我们的举措推出后的学年(2018 - 19),平均阿尔法系数显著增加(回归系数0.113 [0.063, 0.163], = 0.010),随后出现了显著的干预后正趋势(回归系数0.056 [0.037, 0.075], = 0.006)。总之,我们的CQI干预使我们的MCQ评估的信度立即且逐步得到改善。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d9c/11302750/1c37e2824e0b/CMEJ-15-052-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d9c/11302750/3e9903ce9840/CMEJ-15-052-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d9c/11302750/1c37e2824e0b/CMEJ-15-052-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d9c/11302750/3e9903ce9840/CMEJ-15-052-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d9c/11302750/1c37e2824e0b/CMEJ-15-052-g002.jpg

相似文献

1
The impact of systematically repairing multiple choice questions with low discrimination on assessment reliability: an interrupted time series analysis.系统修复低区分度多项选择题对评估可靠性的影响:一项中断时间序列分析
Can Med Educ J. 2024 Jul 12;15(3):52-56. doi: 10.36834/cmej.77596. eCollection 2024 Jul.
2
Sound Practices: An Exploratory Study of Building and Monitoring Multiple-Choice Exams at Canadian Undergraduate Medical Education Programs.声音实践:加拿大本科医学教育项目中构建和监测多项选择题考试的探索性研究。
Acad Med. 2021 Feb 1;96(2):271-277. doi: 10.1097/ACM.0000000000003659.
3
Correlation of MCQ and SEQ scores in written undergraduate ophthalmology assessment.本科眼科书面评估中选择题(MCQ)与简答题(SEQ)分数的相关性
J Coll Physicians Surg Pak. 2015 Mar;25(3):185-8.
4
Psychometrics of Multiple Choice Questions with Non-Functioning Distracters: Implications to Medical Education.具有无效干扰项的多项选择题的心理测量学:对医学教育的启示。
Indian J Physiol Pharmacol. 2015 Oct-Dec;59(4):428-35.
5
Development and validation of immediate self-feedback very short answer questions for medical students: practical implementation of generalizability theory to estimate reliability in formative examination designs.发展和验证医学生即时自我反馈简答题:应用概化理论估计形成性考试设计中的可靠性的实际操作。
BMC Med Educ. 2024 May 24;24(1):572. doi: 10.1186/s12909-024-05569-x.
6
Formative student-authored question bank: perceptions, question quality and association with summative performance.形成性学生自主命题题库:认知、问题质量与总结性表现的关联。
Postgrad Med J. 2018 Feb;94(1108):97-103. doi: 10.1136/postgradmedj-2017-135018. Epub 2017 Sep 2.
7
Answering questions in a co-created formative exam question bank improves summative exam performance, while students perceive benefits from answering, authoring, and peer discussion: A mixed methods analysis of PeerWise.在共同创建的形成性考试题库中回答问题可以提高总结性考试成绩,而学生从回答、创作和同伴讨论中受益:PeerWise 的混合方法分析。
Pharmacol Res Perspect. 2021 Aug;9(4):e00833. doi: 10.1002/prp2.833.
8
Testing anatomy: Dissecting spatial and non-spatial knowledge in multiple-choice question assessment.测试解剖学:在多选题评估中剖析空间和非空间知识。
Anat Sci Educ. 2024 Jan-Feb;17(1):102-113. doi: 10.1002/ase.2323. Epub 2023 Aug 2.
9
Peer review improves psychometric characteristics of multiple choice questions.同行评审可提高多选题的心理测量学特征。
Med Teach. 2017 Apr;39(sup1):S50-S54. doi: 10.1080/0142159X.2016.1254743. Epub 2017 Jan 20.
10
A novel student-led approach to multiple-choice question generation and online database creation, with targeted clinician input.一种由学生主导的新颖方法,用于生成多项选择题并创建在线数据库,同时有针对性地征求临床医生的意见。
Teach Learn Med. 2015;27(2):182-8. doi: 10.1080/10401334.2015.1011651.

引用本文的文献

1
Teaching suicide prevention: a Canadian medical education conundrum.自杀预防教学:加拿大医学教育的难题。
Can Med Educ J. 2024 Jul 12;15(3):1-5. doi: 10.36834/cmej.79624. eCollection 2024 Jul.

本文引用的文献

1
Methodology and reporting characteristics of studies using interrupted time series design in healthcare.利用中断时间序列设计在医疗保健中进行研究的方法学和报告特征。
BMC Med Res Methodol. 2019 Jul 4;19(1):137. doi: 10.1186/s12874-019-0777-x.
2
Sample Size Requirements for Estimation of Item Parameters in the Multidimensional Graded Response Model.多维分级反应模型中项目参数估计的样本量要求
Front Psychol. 2016 Feb 9;7:109. doi: 10.3389/fpsyg.2016.00109. eCollection 2016.
3
A contemporary approach to validity arguments: a practical guide to Kane's framework.
效度论证的当代方法:凯恩框架实用指南
Med Educ. 2015 Jun;49(6):560-75. doi: 10.1111/medu.12678.
4
A primer on classical test theory and item response theory for assessments in medical education.医学教育评估中的经典测量理论和项目反应理论简介。
Med Educ. 2010 Jan;44(1):109-17. doi: 10.1111/j.1365-2923.2009.03425.x.
5
A closer look at cueing effects in multiple-choice questions.深入探究多项选择题中的提示效应。
Med Educ. 1996 Jan;30(1):44-9. doi: 10.1111/j.1365-2923.1996.tb00716.x.
6
Developing a "clinical presentation" curriculum at the University of Calgary.在卡尔加里大学开发一门“临床表现”课程。
Acad Med. 1995 Mar;70(3):186-93. doi: 10.1097/00001888-199503000-00008.