评分员培训对体格检查技能评估中评分准确性的影响。

Effects of a rater training on rating accuracy in a physical examination skills assessment.

作者信息

Weitz Gunther, Vinzentius Christian, Twesten Christoph, Lehnert Hendrik, Bonnemeier Hendrik, König Inke R

机构信息

Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Medizinische Klinik I, Lübeck, Deutschland.

Institut für Qualitätsentwicklung an Schulen Schleswig-Holstein, Kronshagen, Deutschland.

出版信息

GMS Z Med Ausbild. 2014 Nov 17;31(4):Doc41. doi: 10.3205/zma000933. eCollection 2014.

DOI:10.3205/zma000933

PMID:25489341

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4259060/

Abstract

BACKGROUND

The accuracy and reproducibility of medical skills assessment is generally low. Rater training has little or no effect. Our knowledge in this field, however, relies on studies involving video ratings of overall clinical performances. We hypothesised that a rater training focussing on the frame of reference could improve accuracy in grading the curricular assessment of a highly standardised physical head-to-toe examination.

METHODS

Twenty-one raters assessed the performance of 242 third-year medical students. Eleven raters had been randomly assigned to undergo a brief frame-of-reference training a few days before the assessment. 218 encounters were successfully recorded on video and re-assessed independently by three additional observers. Accuracy was defined as the concordance between the raters' grade and the median of the observers' grade. After the assessment, both students and raters filled in a questionnaire about their views on the assessment.

RESULTS

Rater training did not have a measurable influence on accuracy. However, trained raters rated significantly more stringently than untrained raters, and their overall stringency was closer to the stringency of the observers. The questionnaire indicated a higher awareness of the halo effect in the trained raters group. Although the self-assessment of the students mirrored the assessment of the raters in both groups, the students assessed by trained raters felt more discontent with their grade.

CONCLUSIONS

While training had some marginal effects, it failed to have an impact on the individual accuracy. These results in real-life encounters are consistent with previous studies on rater training using video assessments of clinical performances. The high degree of standardisation in this study was not suitable to harmonize the trained raters' grading. The data support the notion that the process of appraising medical performance is highly individual. A frame-of-reference training as applied does not effectively adjust the physicians' judgement on medical students in real-live assessments.

摘要

背景

医学技能评估的准确性和可重复性普遍较低。评分者培训效果甚微或没有效果。然而，我们在这一领域的认知依赖于涉及对整体临床操作进行视频评分的研究。我们假设，聚焦于参照框架的评分者培训能够提高对高度标准化的从头到脚体格检查课程评估进行评分的准确性。

方法

21名评分者对242名三年级医学生的操作进行评估。11名评分者在评估前几天被随机分配接受简短的参照框架培训。218次评估过程被成功录制在视频中，并由另外三名观察者独立重新评估。准确性定义为评分者给出的分数与观察者分数中位数之间的一致性。评估后，学生和评分者都填写了一份关于他们对评估看法的问卷。

结果

评分者培训对准确性没有可衡量的影响。然而，经过培训的评分者比未经过培训的评分者评分更为严格，并且他们的总体严格程度更接近观察者的严格程度。问卷显示，经过培训的评分者组对光环效应的认识更高。虽然两组学生的自我评估都反映了评分者的评估，但接受过培训的评分者评估的学生对自己的分数更不满意。

结论

虽然培训有一些边际效应，但未能对个体准确性产生影响。这些在实际评估中的结果与之前关于使用临床操作视频评估进行评分者培训的研究一致。本研究中的高度标准化并不适合使经过培训的评分者的评分趋于一致。数据支持这样一种观点，即评估医学操作的过程具有高度个体性。所应用的参照框架培训并不能有效地在实际评估中调整医生对医学生的判断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3046/4259060/4a41c7eb1a6e/ZMA-31-41-t-001.jpg

相似文献

Effects of a rater training on rating accuracy in a physical examination skills assessment.评分员培训对体格检查技能评估中评分准确性的影响。

GMS Z Med Ausbild. 2014 Nov 17;31(4):Doc41. doi: 10.3205/zma000933. eCollection 2014.

Inter-rater variability as mutual disagreement: identifying raters' divergent points of view.组内评分者间变异性作为相互不一致性：确定评分者的不同观点。

Adv Health Sci Educ Theory Pract. 2017 Oct;22(4):819-838. doi: 10.1007/s10459-016-9711-8. Epub 2016 Sep 20.

Short communication: final year students' deficits in physical examination skills performance in Germany.简短通讯：德国最后一年医学生体格检查技能表现的不足

Z Evid Fortbild Qual Gesundhwes. 2015;109(1):59-61. doi: 10.1016/j.zefq.2015.01.003. Epub 2015 Feb 19.

Assessing the head-to-toe physical examination skills of medical students.评估医学生从头到脚的体格检查技能。

Med Teach. 2004 Aug;26(5):415-9. doi: 10.1080/01421590410001696452.

Inter-rater reliability and generalizability of patient note scores using a scoring rubric based on the USMLE Step-2 CS format.使用基于美国医师执照考试第二步临床技能考试（USMLE Step-2 CS）格式的评分标准时，评分者间信度及患者记录分数的可推广性。

Adv Health Sci Educ Theory Pract. 2016 Oct;21(4):761-73. doi: 10.1007/s10459-015-9664-3. Epub 2016 Jan 12.

Effects of rater selection on peer assessment among medical students.评分者选择对医学生同伴评估的影响。

Med Educ. 2006 Nov;40(11):1088-97. doi: 10.1111/j.1365-2929.2006.02613.x.

Training less-experienced faculty improves reliability of skills assessment in cardiac surgery.培训经验不足的教员可提高心脏外科手术技能评估的可靠性。

J Thorac Cardiovasc Surg. 2014 Dec;148(6):2491-6.e1-2. doi: 10.1016/j.jtcvs.2014.09.017. Epub 2014 Sep 16.

Assessing musculoskeletal examination skills and diagnostic reasoning of 4th year medical students using a novel objective structured clinical exam.使用新型客观结构化临床考试评估四年级医学生的肌肉骨骼检查技能和诊断推理能力。

BMC Med Educ. 2016 Oct 14;16(1):268. doi: 10.1186/s12909-016-0780-4.

Development and evaluation of the "BRISK Scale," a brief observational measure of risk communication competence.“BRISK量表”的开发与评估，这是一种风险沟通能力的简短观察性测量工具。

Patient Educ Couns. 2016 Dec;99(12):2091-2094. doi: 10.1016/j.pec.2016.08.013. Epub 2016 Aug 12.

The influence of first impressions on subsequent ratings within an OSCE station.第一印象对 OSCE 站后续评分的影响。

Adv Health Sci Educ Theory Pract. 2017 Oct;22(4):969-983. doi: 10.1007/s10459-016-9736-z. Epub 2016 Nov 15.

引用本文的文献

Faculty Perceptions of Frame of Reference Training to Improve Workplace-Based Assessment.教师对参照框架培训以改善基于工作场所的评估的看法。

J Grad Med Educ. 2023 Feb;15(1):81-91. doi: 10.4300/JGME-D-22-00287.1.

Validity Evidence for the Emergency Medicine Standardized Letter of Evaluation.急诊医学标准化评估信的有效性证据。

J Grad Med Educ. 2021 Aug;13(4):490-499. doi: 10.4300/JGME-D-20-01110.1. Epub 2021 Aug 13.

Rater Training in Medical Education: A Scoping Review.医学教育中的评分者培训：一项范围综述

Cureus. 2020 Nov 6;12(11):e11363. doi: 10.7759/cureus.11363.

Fairness and objectivity of a multiple scenario objective structured clinical examination.多场景客观结构化临床考试的公平性与客观性

GMS J Med Educ. 2019 May 16;36(3):Doc26. doi: 10.3205/zma001234. eCollection 2019.

Physical examination in undergraduate medical education in the field of general practice - a scoping review.本科医学教育中全科医学领域的体格检查 - 范围综述。

BMC Med Educ. 2017 Nov 25;17(1):230. doi: 10.1186/s12909-017-1074-1.

Examiner effect on the objective structured clinical exam - a study at five medical schools.考官对客观结构化临床考试的影响——一项在五所医学院校开展的研究

BMC Med Educ. 2017 Apr 24;17(1):71. doi: 10.1186/s12909-017-0908-1.

本文引用的文献

An argument for reviving the disappearing skill of cardiac auscultation.关于恢复正在消失的心脏听诊技能的一项论据。

Cleve Clin J Med. 2012 Aug;79(8):536-7, 544. doi: 10.3949/ccjm.79a.12001.

Seeing the same thing differently: mechanisms that contribute to assessor differences in directly-observed performance assessments.从不同角度看待相同事物：直接观察绩效评估中评估者差异的促成机制。

Adv Health Sci Educ Theory Pract. 2013 Aug;18(3):325-41. doi: 10.1007/s10459-012-9372-1. Epub 2012 May 12.

A pilot study assessing knowledge of clinical signs and physical examination skills in incoming medicine residents.一项评估医学住院医师入职时临床体征知识和体格检查技能的试点研究。

J Grad Med Educ. 2010 Jun;2(2):232-5. doi: 10.4300/JGME-D-09-00107.1.

Rater-based assessments as social judgments: rethinking the etiology of rater errors.基于评定者的评估即社会判断：重新思考评定者误差的病因。

Acad Med. 2011 Oct;86(10 Suppl):S1-7. doi: 10.1097/ACM.0b013e31822a6cf8.

Opening the black box of clinical skills assessment via observation: a conceptual model.通过观察打开临床技能评估的“黑箱”：一个概念模型。

Med Educ. 2011 Oct;45(10):1048-60. doi: 10.1111/j.1365-2923.2011.04025.x.

Internal medicine residency redesign: proposal of the Internal Medicine Working Group.内科住院医师培训改革：内科工作组的建议。

Am J Med. 2011 Sep;124(9):806-12. doi: 10.1016/j.amjmed.2011.03.007.

What drives faculty ratings of residents' clinical skills? The impact of faculty's own clinical skills.是什么驱动教师对住院医师临床技能的评价？教师自身临床技能的影响。

Acad Med. 2010 Oct;85(10 Suppl):S25-8. doi: 10.1097/ACM.0b013e3181ed1aa3.

In-training assessment using direct observation of single-patient encounters: a literature review.使用单次患者就诊的直接观察进行培训评估：文献综述。

Adv Health Sci Educ Theory Pract. 2011 Mar;16(1):131-42. doi: 10.1007/s10459-010-9235-6. Epub 2010 Jun 18.

Scoring objective structured clinical examinations using video monitors or video recordings.使用视频监视器或视频记录对客观结构化临床考试进行评分。

Am J Pharm Educ. 2010 Apr 12;74(3):44. doi: 10.5688/aj740344.

Evaluating frame-of-reference rater training effectiveness using performance schema accuracy.使用绩效模式准确性评估参照系评分者培训效果。

J Appl Psychol. 2009 Sep;94(5):1336-44. doi: 10.1037/a0016476.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评分员培训对体格检查技能评估中评分准确性的影响。

Effects of a rater training on rating accuracy in a physical examination skills assessment.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献