项目反应理论：现代测试理论在医学教育中的应用。

Item response theory: applications of modern test theory in medical education.

作者信息

Downing Steven M

机构信息

College of Medicine, Department of Medical Education (MC 591), University of Illinois at Chicago, Chicago, IL 60612-7309, USA.

出版信息

Med Educ. 2003 Aug;37(8):739-45. doi: 10.1046/j.1365-2923.2003.01587.x.

DOI:10.1046/j.1365-2923.2003.01587.x

PMID:12945568

Abstract

CONTEXT

Item response theory (IRT) measurement models are discussed in the context of their potential usefulness in various medical education settings such as assessment of achievement and evaluation of clinical performance.

PURPOSE

The purpose of this article is to compare and contrast IRT measurement with the more familiar classical measurement theory (CMT) and to explore the benefits of IRT applications in typical medical education settings.

SUMMARY

CMT, the more common measurement model used in medical education, is straightforward and intuitive. Its limitation is that it is sample-dependent, in that all statistics are confounded with the particular sample of examinees who completed the assessment. Examinee scores from IRT are independent of the particular sample of test questions or assessment stimuli. Also, item characteristics, such as item difficulty, are independent of the particular sample of examinees. The IRT characteristic of invariance permits easy equating of examination scores, which places scores on a constant measurement scale and permits the legitimate comparison of student ability change over time. Three common IRT models and their statistical assumptions are discussed. IRT applications in computer-adaptive testing and as a method useful for adjusting rater error in clinical performance assessments are overviewed.

CONCLUSIONS

IRT measurement is a powerful tool used to solve a major problem of CMT, that is, the confounding of examinee ability with item characteristics. IRT measurement addresses important issues in medical education, such as eliminating rater error from performance assessments.

摘要

背景

项目反应理论（IRT）测量模型在各种医学教育环境中的潜在用途背景下进行了讨论，例如成绩评估和临床表现评价。

目的

本文的目的是将IRT测量与更熟悉的经典测量理论（CMT）进行比较和对比，并探讨IRT在典型医学教育环境中的应用优势。

总结

CMT是医学教育中更常用的测量模型，简单直观。其局限性在于它依赖于样本，因为所有统计数据都与完成评估的特定考生样本混淆在一起。IRT得出的考生分数独立于特定的试题样本或评估刺激。此外，项目特征，如项目难度，独立于特定的考生样本。IRT的不变性特征允许轻松地对考试分数进行等值化，即将分数置于恒定的测量尺度上，并允许对学生能力随时间的变化进行合理比较。讨论了三种常见的IRT模型及其统计假设。概述了IRT在计算机自适应测试中的应用以及作为一种用于调整临床表现评估中评分者误差的有用方法。

结论

IRT测量是解决CMT一个主要问题的有力工具，即考生能力与项目特征的混淆。IRT测量解决了医学教育中的重要问题，如从绩效评估中消除评分者误差。

相似文献

Item response theory: applications of modern test theory in medical education.项目反应理论：现代测试理论在医学教育中的应用。

Med Educ. 2003 Aug;37(8):739-45. doi: 10.1046/j.1365-2923.2003.01587.x.

A primer on classical test theory and item response theory for assessments in medical education.医学教育评估中的经典测量理论和项目反应理论简介。

Med Educ. 2010 Jan;44(1):109-17. doi: 10.1111/j.1365-2923.2009.03425.x.

Evaluation of adding item-response theory analysis for evaluation of the European Board of Ophthalmology Diploma examination.评估添加项目反应理论分析对欧洲眼科委员会文凭考试评估的影响。

Acta Ophthalmol. 2013 Nov;91(7):e573-7. doi: 10.1111/aos.12135. Epub 2013 Aug 8.

Item response theory and the measurement of motor behavior.项目反应理论与运动行为测量

Res Q Exerc Sport. 1989 Dec;60(4):325-35. doi: 10.1080/02701367.1989.10607459.

The relationship between classical item characteristics and item response time on computer-based testing.基于计算机测试中经典项目特征与项目反应时间之间的关系。

Korean J Med Educ. 2019 Mar;31(1):1-9. doi: 10.3946/kjme.2019.113. Epub 2019 Mar 1.

Development of the Biostatistics and Clinical Epidemiology Skills (BACES) assessment for medical residents.针对住院医师的生物统计学与临床流行病学技能（BACES）评估的开发。

Postgrad Med J. 2015 Aug;91(1078):423-30. doi: 10.1136/postgradmedj-2014-133197. Epub 2015 Aug 7.

Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.运用项目反应理论比较中国抑郁症患者的五种抑郁量表：项目特性、测量精度及分数可比性检验

Health Qual Life Outcomes. 2017 Apr 4;15(1):60. doi: 10.1186/s12955-017-0631-y.

Item response theory test equating in health sciences education.健康科学教育中的项目反应理论测试等值

Adv Health Sci Educ Theory Pract. 2008 Mar;13(1):3-10. doi: 10.1007/s10459-006-9020-8. Epub 2006 Jul 18.

Item analysis and evaluation in the examinations in the faculty of medicine at Ondokuz Mayis University.于尔居普·梅伊斯大学医学院考试中的项目分析与评估

Niger J Clin Pract. 2015 May-Jun;18(3):387-94. doi: 10.4103/1119-3077.151720.

Practical Consequences of Item Response Theory Model Misfit in the Context of Test Equating with Mixed-Format Test Data.在混合格式测试数据的测试等值背景下，项目反应理论模型失配的实际后果。

Front Psychol. 2017 Apr 4;8:484. doi: 10.3389/fpsyg.2017.00484. eCollection 2017.

引用本文的文献

Psychometric Evaluation of the Core Competencies in Disaster Nursing Scale for Disaster Rescue Nurses in Mainland China.中国大陆灾害救援护士灾害护理核心能力量表的心理测量学评价

J Nurs Manag. 2025 Aug 10;2025:9957270. doi: 10.1155/jonm/9957270. eCollection 2025.

Development of Immediate Self-Feedback Very Short Answer Questions: Implementing Testlet Response Theory in Formative Examinations Across Multiple Occasions.即时自我反馈型简答题的开发：在多次形成性考试中应用组题反应理论

Med Sci Educ. 2024 Sep 18;35(1):205-217. doi: 10.1007/s40670-024-02167-w. eCollection 2025 Feb.

Computer Adaptive vs. Non-adaptive Medical Progress Testing: Feasibility, Test Performance, and Student Experiences.计算机自适应测试与非自适应医学进展测试：可行性、测试性能和学生体验。

Perspect Med Educ. 2024 Jul 26;13(1):406-416. doi: 10.5334/pme.1345. eCollection 2024.

Appraisal of ChatGPT's Aptitude for Medical Education: Comparative Analysis With Third-Year Medical Students in a Pulmonology Examination.评估 ChatGPT 在医学教育中的能力：与三年级医学生在肺病学考试中的比较分析。

JMIR Med Educ. 2024 Jul 23;10:e52818. doi: 10.2196/52818.

Graphical Item Maps: providing clearer feedback on professional exam performance.图形项目地图：提供关于专业考试表现的更清晰反馈。

MedEdPublish (2016). 2018 Jun 1;7:116. doi: 10.15694/mep.2018.0000116.1. eCollection 2018.

Psychometric properties of the International Society of Wheelchair Professionals' basic manual wheelchair-service-provision knowledge Test Version 1 and development of Version 2.国际轮椅专业人员协会基础轮椅服务提供知识测试第 1 版的心理测量学特性及其第 2 版的开发。

PLoS One. 2023 Mar 23;18(3):e0281584. doi: 10.1371/journal.pone.0281584. eCollection 2023.

Psychometric evaluation of a national exam for clinical undergraduates.一项针对临床专业本科生的全国性考试的心理测量学评估。

Front Med (Lausanne). 2022 Dec 14;9:1037897. doi: 10.3389/fmed.2022.1037897. eCollection 2022.

Modeling Diagnostic Expertise in Cases of Irreducible Uncertainty: The Decision-Aligned Response Model.建模诊断专长在不可还原不确定性的案例中：决策一致的反应模型。

Acad Med. 2023 Jan 1;98(1):88-97. doi: 10.1097/ACM.0000000000004918. Epub 2022 Aug 9.

Assessing distinguishable social skills in medical admission: does construct-driven development solve validity issues of situational judgment tests?评估医学招生中的可区分社会技能：构建驱动的发展是否解决了情境判断测验的效度问题？

BMC Med Educ. 2022 Apr 20;22(1):293. doi: 10.1186/s12909-022-03305-x.

The do's, don'ts and don't knows of redressing differential attainment related to race/ethnicity in medical schools.纠正医学院种族/民族相关差异的方法、禁忌和未知因素。

Perspect Med Educ. 2022 Jan;11(1):1-14. doi: 10.1007/s40037-021-00696-3. Epub 2021 Dec 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

项目反应理论：现代测试理论在医学教育中的应用。

Item response theory: applications of modern test theory in medical education.

作者信息

机构信息

出版信息

CONTEXT

PURPOSE

SUMMARY

CONCLUSIONS

背景

目的

总结

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献