• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过现代自动化测试组装方法优化全国医学生考试。

Optimizing a national examination for medical undergraduates via modern automated test assembly approaches.

机构信息

Peking University, Beijing, China.

出版信息

BMC Med Educ. 2024 Aug 25;24(1):919. doi: 10.1186/s12909-024-05905-1.

DOI:10.1186/s12909-024-05905-1
PMID:39183285
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11345959/
Abstract

BACKGROUND

Automated test assembly (ATA) represents a modern methodology that employs data science optimization on computer platforms to automatically create test form, thereby significantly improving the efficiency and accuracy of test assembly procedures. In the realm of medical education, large-scale high-stakes assessments often necessitate lengthy tests, leading to elevated costs in various dimensions (such as examinee fatigue and expenses associated with item development). This study aims to augment the design of the medical education assessments by leveraging modern ATA approaches.

METHODS

To achieve the objective, a four-step process employing psychometric methodologies was used to calibrate and analyze the item pool of the Standardized Competence Test for Clinical Medicine Undergraduates (SCTCMU), a nationwide summative test comprising 300 multiple-choice questions (MCQ) in China. Subsequently, two modern ATA approaches were employed to determine the optimal item combination, accounting for both statistical and content requirements specified in the test blueprint. The qualities of the assembled test form, generated using modern ATA approaches, underwent meticulous evaluation.

RESULTS

Through an exploration of the psychometric properties of the SCTCMU as a foundational step, the evaluation revealed commendable quality in the item properties. Furthermore, the evaluation of the quality of assembled test form using modern ATA approaches indicated the ability to ascertain the optimal test length within the predefined measurement precision. Specifically, this investigation demonstrates that the application of modern ATA approaches can substantially reduce the test length of assembled test form, while simultaneously maintaining the required statistical and content standards specified in the test blueprint.

CONCLUSIONS

This study harnessed modern ATA approaches to facilitate the automatic construction of test form, thereby significantly enhancing the efficiency and precision of test assembly procedures. The utilization of modern ATA approaches offers medical educators a valuable tool to enhance the efficiency and cost-effectiveness of medical education assessment.

摘要

背景

自动化测试组装(ATA)代表了一种现代方法,它利用计算机平台上的数据科学优化来自动创建测试形式,从而显著提高测试组装过程的效率和准确性。在医学教育领域,大规模高风险评估通常需要进行冗长的测试,从而导致在各个方面(如考生疲劳和与项目开发相关的费用)的成本增加。本研究旨在通过利用现代 ATA 方法来增强医学教育评估的设计。

方法

为了实现目标,采用了四步过程,使用心理测量学方法来校准和分析中国全国性总结性测试——临床医学本科标准化能力测试(SCTCMU)的项目库,该测试由 300 个多项选择题(MCQ)组成。随后,采用两种现代 ATA 方法来确定最佳项目组合,同时考虑测试蓝图中规定的统计和内容要求。使用现代 ATA 方法组装的测试形式的质量经过了仔细的评估。

结果

通过探索 SCTCMU 的心理测量特性作为基础步骤,评估结果显示项目特性质量良好。此外,使用现代 ATA 方法评估组装测试形式的质量表明,能够在预定义的测量精度内确定最佳测试长度。具体来说,这项研究表明,应用现代 ATA 方法可以大大缩短组装测试形式的测试长度,同时保持测试蓝图中规定的所需统计和内容标准。

结论

本研究利用现代 ATA 方法来促进测试形式的自动构建,从而显著提高测试组装过程的效率和精度。现代 ATA 方法的使用为医学教育工作者提供了一个有价值的工具,以提高医学教育评估的效率和成本效益。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/eee6bb6e91b9/12909_2024_5905_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/de8e0489da49/12909_2024_5905_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/a95e55be5583/12909_2024_5905_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/eee6bb6e91b9/12909_2024_5905_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/de8e0489da49/12909_2024_5905_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/a95e55be5583/12909_2024_5905_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b00/11345959/eee6bb6e91b9/12909_2024_5905_Fig3_HTML.jpg

相似文献

1
Optimizing a national examination for medical undergraduates via modern automated test assembly approaches.通过现代自动化测试组装方法优化全国医学生考试。
BMC Med Educ. 2024 Aug 25;24(1):919. doi: 10.1186/s12909-024-05905-1.
2
Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education.运用项目反应理论探索本科医学教育中扩展匹配题考试的心理测量特性。
BMC Med Educ. 2005 Mar 7;5(1):9. doi: 10.1186/1472-6920-5-9.
3
Psychometric evaluation of a national exam for clinical undergraduates.一项针对临床专业本科生的全国性考试的心理测量学评估。
Front Med (Lausanne). 2022 Dec 14;9:1037897. doi: 10.3389/fmed.2022.1037897. eCollection 2022.
4
Using Automatic Item Generation to Improve the Quality of MCQ Distractors.使用自动试题生成来提高多项选择题干扰项的质量。
Teach Learn Med. 2016;28(2):166-73. doi: 10.1080/10401334.2016.1146608.
5
Quality of multiple-choice questions in medical internship qualification examination determined by item response theory at Debre Tabor University, Ethiopia.埃塞俄比亚德布雷塔博尔大学运用项目反应理论确定医学实习资格考试多项选择题的质量。
BMC Med Educ. 2022 Aug 22;22(1):635. doi: 10.1186/s12909-022-03687-y.
6
Minimum accepted competency examination: test item analysis.最低可接受能力考试:试题分析。
BMC Med Educ. 2022 May 25;22(1):400. doi: 10.1186/s12909-022-03475-8.
7
Developing Computerized Adaptive Testing for a National Health Professionals Exam: An Attempt from Psychometric Simulations.开发全国卫生专业人员考试的计算机化自适应测验:心理计量学模拟的尝试。
Perspect Med Educ. 2023 Oct 31;12(1):462-471. doi: 10.5334/pme.855. eCollection 2023.
8
Using automatic item generation to create multiple-choice test items.使用自动项目生成技术来创建多项选择题测试项目。
Med Educ. 2012 Aug;46(8):757-65. doi: 10.1111/j.1365-2923.2012.04289.x.
9
Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: psychometric approaches to test equating.不同医学学生群体在麻醉学课程中的熟练程度比较:用于测试等值性的心理测量方法。
J Chin Med Assoc. 2014 Mar;77(3):150-4. doi: 10.1016/j.jcma.2013.10.011. Epub 2013 Nov 28.
10
A suggestive approach for assessing item quality, usability and validity of Automatic Item Generation.自动出题的项目质量、可用性和有效性评估的一种提示方法。
Adv Health Sci Educ Theory Pract. 2023 Dec;28(5):1441-1465. doi: 10.1007/s10459-023-10225-y. Epub 2023 Apr 25.

引用本文的文献

1
Application of Artificial Intelligence Generated Content in Medical Examinations.人工智能生成内容在医学检查中的应用。
Adv Med Educ Pract. 2025 Feb 25;16:331-339. doi: 10.2147/AMEP.S492895. eCollection 2025.

本文引用的文献

1
Psychometric evaluation of a national exam for clinical undergraduates.一项针对临床专业本科生的全国性考试的心理测量学评估。
Front Med (Lausanne). 2022 Dec 14;9:1037897. doi: 10.3389/fmed.2022.1037897. eCollection 2022.
2
Measurement precision at the cut score in medical multiple choice exams: Theory matters.医学多选题考试中的临界分数测量精度:理论很重要。
Perspect Med Educ. 2020 Aug;9(4):220-228. doi: 10.1007/s40037-020-00586-0.
3
Development of a computerized adaptive substance use disorder scale for screening and measurement: the CAT-SUD.
用于筛查和测量的计算机化自适应物质使用障碍量表的研制:CAT-SUD。
Addiction. 2020 Jul;115(7):1382-1394. doi: 10.1111/add.14938. Epub 2020 Jan 14.
4
Development and Validation of an Item Bank for Depression Screening in the Chinese Population Using Computer Adaptive Testing: A Simulation Study.使用计算机自适应测试开发和验证用于中国人群抑郁症筛查的题库:一项模拟研究。
Front Psychol. 2018 Jul 18;9:1225. doi: 10.3389/fpsyg.2018.01225. eCollection 2018.
5
An Automated Test Assembly Design for a Large-Scale Chinese Proficiency Test.一项针对大规模汉语水平测试的自动化试题组卷设计
Appl Psychol Meas. 2016 May;40(3):233-237. doi: 10.1177/0146621616628503. Epub 2016 Feb 10.
6
Simulating computer adaptive testing with the Mood and Anxiety Symptom Questionnaire.使用情绪与焦虑症状问卷模拟计算机自适应测试。
Psychol Assess. 2016 Aug;28(8):953-62. doi: 10.1037/pas0000240. Epub 2015 Dec 21.
7
An investigation of emotional intelligence measures using item response theory.运用项目反应理论对情商测量方法的一项调查。
Psychol Assess. 2015 Dec;27(4):1241-52. doi: 10.1037/pas0000132. Epub 2015 May 11.
8
Psychometrics behind Computerized Adaptive Testing.计算机自适应测试背后的心理测量学
Psychometrika. 2015 Mar;80(1):1-20. doi: 10.1007/s11336-014-9401-5. Epub 2014 Feb 6.
9
The new licencing examination for human medicine: from concept to implementation.新的人类医学许可考试:从概念到实施。
Swiss Med Wkly. 2013 Dec 3;143:w13897. doi: 10.4414/smw.2013.13897. eCollection 2013.
10
The assessment of professional competence: Developments, research and practical implications.专业能力评估:发展、研究与实际影响。
Adv Health Sci Educ Theory Pract. 1996 Jan;1(1):41-67. doi: 10.1007/BF00596229.