暴露于优秀与较差的实习医生表现对主治医生评价后续表现的影响。

School of Translational Medicine, Academy at UHSM, [corrected] University of Manchester, Manchester, United Kingdom.

JAMA. 2012 Dec 5;308(21):2226-32. doi: 10.1001/jama.2012.36515.

CONTEXT

Competency-based models of education require assessments to be based on individuals' capacity to perform, yet the nature of human judgment may fundamentally limit the extent to which such assessment is accurately possible.

OBJECTIVE

To determine whether recent observations of the Mini Clinical Evaluation Exercise (Mini-CEX) performance of postgraduate year 1 physicians influence raters' scores of subsequent performances, consistent with either anchoring bias (scores biased similar to previous experience) or contrast bias (scores biased away from previous experience).

DESIGN, SETTING, AND PARTICIPANTS: Internet-based randomized, blinded experiment using videos of Mini-CEX assessments of postgraduate year 1 trainees interviewing new internal medicine patients. Participants were 41 attending physicians from England and Wales experienced with the Mini-CEX, with 20 watching and scoring 3 good trainee performances and 21 watching and scoring 3 poor performances. All then watched and scored the same 3 borderline video performances. The study was completed between July and November 2011.

MAIN OUTCOME MEASURES

The primary outcome was scores assigned to the borderline videos, using a 6-point Likert scale (anchors included: 1, well below expectations; 3, borderline; 6, well above expectations). Associations were tested in a multivariable analysis that included participants' sex, years of practice, and the stringency index (within-group z score of initial 3 ratings).

RESULTS

The mean rating scores assigned by physicians who viewed borderline video performances following exposure to good performances was 2.7 (95% CI, 2.4-3.0) vs 3.4 (95% CI, 3.1-3.7) following exposure to poor performances (difference of 0.67 [95% CI, 0.28-1.07]; P = .001). Borderline videos were categorized as consistent with failing scores in 33 of 60 assessments (55%) in those exposed to good performances and in 15 of 63 assessments (24%) in those exposed to poor performances (P < .001). They were categorized as consistent with passing scores in 5 of 60 assessments (8.3%) in those exposed to good performances compared with 25 of 63 assessments (39.5%) in those exposed to poor performances (P < .001). Sex and years of attending practice were not associated with scores. The priming condition (good vs poor performances) and the stringency index jointly accounted for 45% of the observed variation in raters' scores for the borderline videos (P < .001).

CONCLUSION

In an experimental setting, attending physicians exposed to videos of good medical trainee performances rated subsequent borderline performances lower than those who had been exposed to poor performances, consistent with a contrast bias.

背景

基于能力的教育模式要求评估基于个人的执行能力，然而人类判断的本质可能从根本上限制了这种评估的准确性。

目的

确定最近观察到的住院医师第 1 年医学生的迷你临床评估练习（Mini-CEX）表现是否会影响评分者对随后表现的评分，这与锚定偏差（评分偏向于先前的经验）或对比偏差（评分偏离先前的经验）一致。

设计、地点和参与者：使用住院医师第 1 年受训者对新内科患者进行访谈的 Mini-CEX 评估视频，进行基于互联网的随机、盲法实验。参与者为来自英格兰和威尔士的 41 名有经验的主治医生，其中 20 人观看并评分 3 次表现良好的学员，21 人观看并评分 3 次表现较差的学员。然后所有人都观看并对相同的 3 个边缘视频表现进行评分。该研究于 2011 年 7 月至 11 月间完成。

主要观察指标

主要结局指标为使用 6 分李克特量表（锚定物包括：1，远低于预期；3，边缘；6，远高于预期）对边缘视频进行评分。在多变量分析中测试了关联，其中包括参与者的性别、实践年限和严格度指数（初始 3 次评分的组内 z 分数）。

结果

在暴露于表现良好的视频后，对边缘视频表现评分的医生平均评分为 2.7（95%置信区间，2.4-3.0），而在暴露于表现较差的视频后评分为 3.4（95%置信区间，3.1-3.7）（差异为 0.67[95%置信区间，0.28-1.07]；P=0.001）。在暴露于表现良好的视频的 60 次评估中，有 33 次（55%）被归类为不及格，而在暴露于表现较差的视频的 63 次评估中，有 15 次（24%）被归类为不及格（P<0.001）。在暴露于表现良好的视频的 60 次评估中，有 5 次（8.3%）被归类为及格，而在暴露于表现较差的视频的 63 次评估中，有 25 次（39.5%）被归类为及格（P<0.001）。性别和行医年限与评分无关。启动条件（表现良好与表现较差的视频）和严格度指数共同解释了评分者对边缘视频评分的 45%的观察变异（P<0.001）。

结论

在实验环境中，接触表现良好的医学生视频的主治医生对随后的边缘表现评分低于接触表现较差的医生，这与对比偏差一致。

相似文献

Effect of exposure to good vs poor medical trainee performance on attending physician ratings of subsequent performances.

JAMA. 2012 Dec 5;308(21):2226-32. doi: 10.1001/jama.2012.36515.

'You're certainly relatively competent': assessor bias due to recent experiences.

Med Educ. 2013 Sep;47(9):910-22. doi: 10.1111/medu.12254.

Relatively speaking: contrast effects influence assessors' scores and narrative feedback.

Med Educ. 2015 Sep;49(9):909-19. doi: 10.1111/medu.12777.

Does scale length matter? A comparison of nine- versus five-point rating scales for the mini-CEX.

Adv Health Sci Educ Theory Pract. 2009 Dec;14(5):655-64. doi: 10.1007/s10459-008-9147-x. Epub 2008 Nov 26.

More consensus than idiosyncrasy: Categorizing social judgments to examine variability in Mini-CEX ratings.

Acad Med. 2014 Nov;89(11):1510-9. doi: 10.1097/ACM.0000000000000486.

How biased are you? The effect of prior performance information on attending physician ratings and implications for learner handover.

Adv Health Sci Educ Theory Pract. 2021 Mar;26(1):199-214. doi: 10.1007/s10459-020-09979-6. Epub 2020 Jun 23.

Inter-rater variability as mutual disagreement: identifying raters' divergent points of view.

Adv Health Sci Educ Theory Pract. 2017 Oct;22(4):819-838. doi: 10.1007/s10459-016-9711-8. Epub 2016 Sep 20.

Implicit versus explicit first impressions in performance-based assessment: will raters overcome their first impressions when learner performance changes?

Adv Health Sci Educ Theory Pract. 2024 Sep;29(4):1155-1168. doi: 10.1007/s10459-023-10302-2. Epub 2023 Nov 27.

Workplace-based assessments of junior doctors: do scores predict training difficulties?

Med Educ. 2011 Dec;45(12):1190-8. doi: 10.1111/j.1365-2923.2011.04056.x. Epub 2011 Oct 13.

Evaluation of a novel assessment form for observing medical residents: a randomised, controlled trial.

Med Educ. 2008 Dec;42(12):1234-42. doi: 10.1111/j.1365-2923.2008.03230.x.

引用本文的文献

Inconsistencies in rater-based assessments mainly affect borderline candidates: but using simple heuristics might improve pass-fail decisions.

Adv Health Sci Educ Theory Pract. 2024 Nov;29(5):1749-1767. doi: 10.1007/s10459-024-10328-0. Epub 2024 Apr 23.

Examining novice anaesthesia trainee simulation performance: a tale of two clusters.

BMJ Simul Technol Enhanc Learn. 2021 Jun 16;7(6):548-554. doi: 10.1136/bmjstel-2020-000812. eCollection 2021.

Determining the influence of different linking patterns on the stability of students' score adjustments produced using Video-based Examiner Score Comparison and Adjustment (VESCA).

BMC Med Educ. 2022 Jan 17;22(1):41. doi: 10.1186/s12909-022-03115-1.

Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams.

Med Educ. 2022 Mar;56(3):292-302. doi: 10.1111/medu.14713. Epub 2022 Jan 11.

Clinical assessors' working conceptualisations of undergraduate consultation skills: a framework analysis of how assessors make expert judgements in practice.

Adv Health Sci Educ Theory Pract. 2020 Oct;25(4):845-875. doi: 10.1007/s10459-020-09960-3. Epub 2020 Jan 29.

Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.

Med Educ. 2019 Mar;53(3):250-263. doi: 10.1111/medu.13783. Epub 2018 Dec 21.

The heterogeneity of clinical practice patterns among an international cohort of pulmonary arterial hypertension experts.

Pulm Circ. 2014 Sep;4(3):441-51. doi: 10.1086/677357.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Effect of exposure to good vs poor medical trainee performance on attending physician ratings of subsequent performances.

JAMA. 2012 Dec 5;308(21):2226-32. doi: 10.1001/jama.2012.36515.

'You're certainly relatively competent': assessor bias due to recent experiences.

Med Educ. 2013 Sep;47(9):910-22. doi: 10.1111/medu.12254.

Relatively speaking: contrast effects influence assessors' scores and narrative feedback.

Med Educ. 2015 Sep;49(9):909-19. doi: 10.1111/medu.12777.

Does scale length matter? A comparison of nine- versus five-point rating scales for the mini-CEX.

Adv Health Sci Educ Theory Pract. 2009 Dec;14(5):655-64. doi: 10.1007/s10459-008-9147-x. Epub 2008 Nov 26.

More consensus than idiosyncrasy: Categorizing social judgments to examine variability in Mini-CEX ratings.

Acad Med. 2014 Nov;89(11):1510-9. doi: 10.1097/ACM.0000000000000486.

How biased are you? The effect of prior performance information on attending physician ratings and implications for learner handover.

Adv Health Sci Educ Theory Pract. 2021 Mar;26(1):199-214. doi: 10.1007/s10459-020-09979-6. Epub 2020 Jun 23.

Inter-rater variability as mutual disagreement: identifying raters' divergent points of view.

Adv Health Sci Educ Theory Pract. 2017 Oct;22(4):819-838. doi: 10.1007/s10459-016-9711-8. Epub 2016 Sep 20.

Implicit versus explicit first impressions in performance-based assessment: will raters overcome their first impressions when learner performance changes?

Adv Health Sci Educ Theory Pract. 2024 Sep;29(4):1155-1168. doi: 10.1007/s10459-023-10302-2. Epub 2023 Nov 27.

Workplace-based assessments of junior doctors: do scores predict training difficulties?

Med Educ. 2011 Dec;45(12):1190-8. doi: 10.1111/j.1365-2923.2011.04056.x. Epub 2011 Oct 13.

Evaluation of a novel assessment form for observing medical residents: a randomised, controlled trial.

Med Educ. 2008 Dec;42(12):1234-42. doi: 10.1111/j.1365-2923.2008.03230.x.

引用本文的文献

Inconsistencies in rater-based assessments mainly affect borderline candidates: but using simple heuristics might improve pass-fail decisions.

Adv Health Sci Educ Theory Pract. 2024 Nov;29(5):1749-1767. doi: 10.1007/s10459-024-10328-0. Epub 2024 Apr 23.

Examining novice anaesthesia trainee simulation performance: a tale of two clusters.

BMJ Simul Technol Enhanc Learn. 2021 Jun 16;7(6):548-554. doi: 10.1136/bmjstel-2020-000812. eCollection 2021.

Determining the influence of different linking patterns on the stability of students' score adjustments produced using Video-based Examiner Score Comparison and Adjustment (VESCA).

BMC Med Educ. 2022 Jan 17;22(1):41. doi: 10.1186/s12909-022-03115-1.

Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams.

Med Educ. 2022 Mar;56(3):292-302. doi: 10.1111/medu.14713. Epub 2022 Jan 11.

Clinical assessors' working conceptualisations of undergraduate consultation skills: a framework analysis of how assessors make expert judgements in practice.

Adv Health Sci Educ Theory Pract. 2020 Oct;25(4):845-875. doi: 10.1007/s10459-020-09960-3. Epub 2020 Jan 29.

Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.

Med Educ. 2019 Mar;53(3):250-263. doi: 10.1111/medu.13783. Epub 2018 Dec 21.

The heterogeneity of clinical practice patterns among an international cohort of pulmonary arterial hypertension experts.

Pulm Circ. 2014 Sep;4(3):441-51. doi: 10.1086/677357.

Effect of exposure to good vs poor medical trainee performance on attending physician ratings of subsequent performances.

机构信息

出版信息

CONTEXT

OBJECTIVE

MAIN OUTCOME MEASURES

RESULTS

CONCLUSION

背景

目的

主要观察指标

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献