Suppr超能文献

基于美国医师执照考试第二步临床技能考试的综合临床问诊分数子成分测量的差异加权:对综合分数可靠性及通过-未通过决策的影响

Differential Weighting for Subcomponent Measures of Integrated Clinical Encounter Scores Based on the USMLE Step 2 CS Examination: Effects on Composite Score Reliability and Pass-Fail Decisions.

作者信息

Park Yoon Soo, Lineberry Matthew, Hyderi Abbas, Bordage Georges, Xing Kuan, Yudkowsky Rachel

机构信息

Y.S. Park is assistant professor, Department of Medical Education, University of Illinois at Chicago College of Medicine, Chicago, Illinois. M. Lineberry is director of Simulation Research, Assessment, and Outcomes, Zamierowski Institute for Experiential Learning, and assistant professor, Department of Health Policy and Management, University of Kansas Medical Center, Kansas City, Kansas. A. Hyderi is associate dean for curriculum and associate professor, Department of Family Medicine, University of Illinois at Chicago College of Medicine, Chicago, Illinois. G. Bordage is professor, Department of Medical Education, University of Illinois at Chicago College of Medicine, Chicago, Illinois. K. Xing is a doctoral student, Department of Educational Psychology, University of Illinois at Chicago College of Education, Chicago, Illinois. R. Yudkowsky is director, Graham Clinical Performance Center, and associate professor, Department of Medical Education, University of Illinois at Chicago College of Medicine, Chicago, Illinois.

出版信息

Acad Med. 2016 Nov;91(11 Association of American Medical Colleges Learn Serve Lead: Proceedings of the 55th Annual Research in Medical Education Sessions):S24-S30. doi: 10.1097/ACM.0000000000001359.

Abstract

PURPOSE

Medical schools administer locally developed graduation competency examinations (GCEs) following the structure of the United States Medical Licensing Examination Step 2 Clinical Skills that combine standardized patient (SP)-based physical examination and the patient note (PN) to create integrated clinical encounter (ICE) scores. This study examines how different subcomponent scoring weights in a locally developed GCE affect composite score reliability and pass-fail decisions for ICE scores, contributing to internal structure and consequential validity evidence.

METHOD

Data from two M4 cohorts (2014: n = 177; 2015: n = 182) were used. The reliability of SP encounter (history taking and physical examination), PN, and communication and interpersonal skills scores were estimated with generalizability studies. Composite score reliability was estimated for varying weight combinations. Faculty were surveyed for preferred weights on the SP encounter and PN scores. Composite scores based on Kane's method were compared with weighted mean scores.

RESULTS

Faculty suggested weighting PNs higher (60%-70%) than the SP encounter scores (30%-40%). Statistically, composite score reliability was maximized when PN scores were weighted at 40% to 50%. Composite score reliability of ICE scores increased by up to 0.20 points when SP-history taking (SP-Hx) scores were included; excluding SP-Hx only increased composite score reliability by 0.09 points. Classification accuracy for pass-fail decisions between composite and weighted mean scores was 0.77; misclassification was < 5%.

CONCLUSIONS

Medical schools and certification agencies should consider implications of assigning weights with respect to composite score reliability and consequences on pass-fail decisions.

摘要

目的

医学院校按照美国医师执照考试第二步临床技能的结构,进行本地开发的毕业能力考试(GCE),该考试将基于标准化病人(SP)的体格检查和病历记录(PN)相结合,以得出综合临床问诊(ICE)分数。本研究探讨了本地开发的GCE中不同子组件评分权重如何影响ICE分数的综合评分可靠性及及格与否的判定,为内部结构和效标效度证据提供支持。

方法

使用了来自两个M4队列(2014年: n = 177;2015年:n = 182)的数据。通过概化研究估计SP问诊(病史采集和体格检查)、PN以及沟通和人际技能分数的可靠性。针对不同的权重组合估计综合评分可靠性。就SP问诊和PN分数的首选权重对教员进行了调查。将基于凯恩方法的综合分数与加权平均分进行比较。

结果

教员建议病历记录的权重(60%-70%)应高于SP问诊分数(30%-40%)。从统计学角度看,当病历记录分数的权重为40%至50%时,综合评分可靠性达到最大值。纳入SP病史采集(SP-Hx)分数时,ICE分数的综合评分可靠性提高了0.20分;仅排除SP-Hx时,综合评分可靠性仅提高了0.09分。综合分数与加权平均分之间及格与否判定的分类准确率为0.77;错误分类率<5%。

结论

医学院校和认证机构应考虑权重分配对综合评分可靠性的影响以及对及格与否判定结果的影响。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验