评估添加项目反应理论分析对欧洲眼科委员会文凭考试评估的影响。

Evaluation of adding item-response theory analysis for evaluation of the European Board of Ophthalmology Diploma examination.

机构信息

Department of Ophthalmology, Antwerp University Hospital, Antwerp, BelgiumDepartment of Ophthalmology, Faculty of Medicine, University of Antwerp, Antwerp, Belgium;Department of Ophthalmology, King's College Hospital, London, UK;Department of Scientific Research and Statistics, Antwerp University Hospital, Antwerp, Belgium;Institute of Health and Society, Newcastle University, Newcastle upon Tyne, UK;Department of Ophthalmology, Centre Hospitalier Universitaire, Dijon, France;Department of Ophthalmology, Maastricht Universitair Medisch Centrum (MUMC), Maastricht, The NetherlandsUniversity Eye Clinic, Ljubljana, Slovenia.

出版信息

Acta Ophthalmol. 2013 Nov;91(7):e573-7. doi: 10.1111/aos.12135. Epub 2013 Aug 8.

DOI:10.1111/aos.12135

PMID:23927770

Abstract

PURPOSE

To investigate whether introduction of item-response theory (IRT) analysis, in parallel to the 'traditional' statistical analysis methods available for performance evaluation of multiple T/F items as used in the European Board of Ophthalmology Diploma (EBOD) examination, has proved beneficial, and secondly, to study whether the overall assessment performance of the current written part of EBOD is sufficiently high (KR-20≥ 0.90) to be kept as examination format in future EBOD editions.

METHODS

'Traditional' analysis methods for individual MCQ item performance comprise P-statistics, Rit-statistics and item discrimination, while overall reliability is evaluated through KR-20 for multiple T/F items. The additional set of statistical analysis methods for the evaluation of EBOD comprises mainly IRT analysis. These analysis techniques are used to monitor whether the introduction of negative marking for incorrect answers (since EBOD 2010) has a positive influence on the statistical performance of EBOD as a whole and its individual test items in particular.

RESULTS

Item-response theory analysis demonstrated that item performance parameters should not be evaluated individually, but should be related to one another. Before the introduction of negative marking, the overall EBOD reliability (KR-20) was good though with room for improvement (EBOD 2008: 0.81; EBOD 2009: 0.78). After the introduction of negative marking, the overall reliability of EBOD improved significantly (EBOD 2010: 0.92; EBOD 2011:0.91; EBOD 2012: 0.91).

CONCLUSION

Although many statistical performance parameters are available to evaluate individual items, our study demonstrates that the overall reliability assessment remains the only crucial parameter to be evaluated allowing comparison. While individual item performance analysis is worthwhile to undertake as secondary analysis, drawing final conclusions seems to be more difficult. Performance parameters need to be related, as shown by IRT analysis. Therefore, IRT analysis has proved beneficial for the statistical analysis of EBOD. Introduction of negative marking has led to a significant increase in the reliability (KR-20 > 0.90), indicating that the current examination format can be kept for future EBOD examinations.

摘要

目的

探讨在欧洲眼科委员会文凭（EBOD）考试中，引入项目反应理论（IRT）分析是否与“传统”统计分析方法一样，有利于评估多项是非题项目的表现。其次，研究当前 EBOD 笔试部分的整体评估表现是否足够高（KR-20≥0.90），以保持其在未来 EBOD 考试版本中的考试形式。

方法

评估单个多项选择题项目表现的“传统”分析方法包括 P 统计量、Rit 统计量和项目区分度，而多个是非题的整体可靠性则通过 KR-20 进行评估。EBOD 的额外一套统计分析方法主要包括 IRT 分析。这些分析技术用于监测自 2010 年 EBOD 以来引入错误答案的负分制是否对 EBOD 的整体统计表现，特别是其各个测试项目产生积极影响。

结果

IRT 分析表明，项目表现参数不应该单独评估，而应该相互关联。在引入负分制之前，EBOD 的整体可靠性（KR-20）虽然有改进的空间，但表现良好（EBOD 2008：0.81；EBOD 2009：0.78）。引入负分制后，EBOD 的整体可靠性显著提高（EBOD 2010：0.92；EBOD 2011：0.91；EBOD 2012：0.91）。

结论

尽管有许多统计表现参数可用于评估单个项目，但我们的研究表明，整体可靠性评估仍然是唯一需要评估的关键参数，允许进行比较。虽然单项表现分析值得进行二次分析，但得出最终结论似乎更加困难。表现参数需要相互关联，如 IRT 分析所示。因此，IRT 分析已被证明对 EBOD 的统计分析有益。引入负分制后，可靠性显著提高（KR-20＞0.90），表明当前的考试形式可以保留用于未来的 EBOD 考试。

相似文献

Evaluation of adding item-response theory analysis for evaluation of the European Board of Ophthalmology Diploma examination.评估添加项目反应理论分析对欧洲眼科委员会文凭考试评估的影响。

Acta Ophthalmol. 2013 Nov;91(7):e573-7. doi: 10.1111/aos.12135. Epub 2013 Aug 8.

History and future of the European Board of Ophthalmology Diploma examination.欧洲眼科委员会文凭考试的历史与未来。

Acta Ophthalmol. 2013 Sep;91(6):589-93. doi: 10.1111/j.1755-3768.2012.02422.x. Epub 2012 May 3.

Introduction of subspecialty examinations by the European Board of Ophthalmology (EBO) in close collaboration with European Subspecialty Ophthalmological Societies: FEBO-SA.欧洲眼科委员会（EBO）与欧洲眼科亚专业学会：FEBO-SA密切合作推出亚专业考试。

Acta Ophthalmol. 2015 Dec;93(8):778-81. doi: 10.1111/aos.12738.

Procedural aspects of the organization of the comprehensive European Board of Ophthalmology Diploma examination.欧洲眼科综合委员会文凭考试组织的程序方面。

J Educ Eval Health Prof. 2016 Jul 26;13:27. doi: 10.3352/jeehp.2016.13.27. eCollection 2016.

EBOD--The european standard examination in Ophthalmology.欧洲眼科标准考试

Rom J Ophthalmol. 2015 Jul-Sep;59(3):127-8.

The ophthalmic clinical evaluation exercise: reliability determination.眼科临床评估练习：可靠性测定

Ophthalmology. 2005 Oct;112(10):1649-54. doi: 10.1016/j.ophtha.2005.06.006.

The Ophthalmic Clinical Evaluation Exercise (OCEX).眼科临床评估练习（OCEX）。

Ophthalmology. 2004 Jul;111(7):1271-4. doi: 10.1016/j.ophtha.2004.04.014.

OCEX reliability.OCEX可靠性。

Ophthalmology. 2006 Apr;113(4):717; author reply 717-8. doi: 10.1016/j.ophtha.2006.01.015.

A multicenter analysis of the ophthalmic knowledge assessment program and American Board of Ophthalmology written qualifying examination performance.多中心眼科知识评估计划和美国眼科学董事会笔试成绩分析。

Ophthalmology. 2012 Oct;119(10):1949-53. doi: 10.1016/j.ophtha.2012.06.010. Epub 2012 Jul 28.

Ophthalmic clinical evaluation exercise.眼科临床评估练习。

Ophthalmology. 2006 Oct;113(10):1892; author reply 1892-3. doi: 10.1016/j.ophtha.2006.03.006.

引用本文的文献

Fellow of the European board of ophthalmology glaucoma examination and diploma (FEBOS-Gl): update on 8 years of experience and future perspectives.欧洲眼科委员会青光眼检查与文凭会员（FEBOS-Gl）：8年经验更新与未来展望

Front Med (Lausanne). 2023 Jun 16;10:1163264. doi: 10.3389/fmed.2023.1163264. eCollection 2023.

Scoring Single-Response Multiple-Choice Items: Scoping Review and Comparison of Different Scoring Methods.单项选择题评分：不同评分方法的范围审查与比较

JMIR Med Educ. 2023 May 19;9:e44084. doi: 10.2196/44084.

ESHRE Clinical Embryologist certification: the first 10 years.欧洲人类生殖与胚胎学会临床胚胎学家认证：头十年

Hum Reprod Open. 2020 Jul 2;2020(3):hoaa026. doi: 10.1093/hropen/hoaa026. eCollection 2020.

Procedural aspects of the organization of the comprehensive European Board of Ophthalmology Diploma examination.欧洲眼科综合委员会文凭考试组织的程序方面。

J Educ Eval Health Prof. 2016 Jul 26;13:27. doi: 10.3352/jeehp.2016.13.27. eCollection 2016.

Can 'Fellow of the European Board of Ophthalmology Subspecialty Diploma in Glaucoma,' a subspecialty examination on glaucoma induce the qualification standard of glaucoma clinical practice in Europe?欧洲眼科委员会青光眼亚专科文凭的“欧洲眼科委员会青光眼亚专科文凭会员”，一项关于青光眼的亚专科考试能否引发欧洲青光眼临床实践的资格标准？

J Educ Eval Health Prof. 2016 Jul 28;13:28. doi: 10.3352/jeehp.2016.13.28. eCollection 2016.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估添加项目反应理论分析对欧洲眼科委员会文凭考试评估的影响。

Evaluation of adding item-response theory analysis for evaluation of the European Board of Ophthalmology Diploma examination.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献