• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

“在同一页上”?GP 考官反馈对临床评估中评分严重程度差异的影响:一项干预前后研究。

"On the same page"? The effect of GP examiner feedback on differences in rating severity in clinical assessments: a pre/post intervention study.

机构信息

Primary Care Clinical Unit, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.

Royal Brisbane & Women's Hospitals, Level 8, Health Sciences Building, Herston, QLD, Australia.

出版信息

BMC Med Educ. 2017 Jun 6;17(1):101. doi: 10.1186/s12909-017-0929-9.

DOI:10.1186/s12909-017-0929-9
PMID:28587597
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5461633/
Abstract

BACKGROUND

Robust and defensible clinical assessments attempt to minimise differences in student grades which are due to differences in examiner severity (stringency and leniency). Unfortunately there is little evidence to date that examiner training and feedback interventions are effective; "physician raters" have indeed been deemed "impervious to feedback". Our aim was to investigate the effectiveness of a general practitioner examiner feedback intervention, and explore examiner attitudes to this.

METHODS

Sixteen examiners were provided with a written summary of all examiner ratings in medical student clinical case examinations over the preceding 18 months, enabling them to identify their own rating data and compare it with other examiners. Examiner ratings and examiner severity self-estimates were analysed pre and post intervention, using non-parametric bootstrapping, multivariable linear regression, intra-class correlation and Spearman's correlation analyses. Examiners completed a survey exploring their perceptions of the usefulness and acceptability of the intervention, including what (if anything) examiners planned to do differently as a result of the feedback.

RESULTS

Examiner severity self-estimates were relatively poorly correlated with measured severity on the two clinical case examination types pre-intervention (0.29 and 0.67) and were less accurate post-intervention. No significant effect of the intervention was identified, when differences in case difficulty were controlled for, although there were fewer outlier examiners post-intervention. Drift in examiner severity over time prior to the intervention was observed. Participants rated the intervention as interesting and useful, and survey comments indicated that fairness, reassurance, and understanding examiner colleagues are important to examiners.

CONCLUSIONS

Despite our participants being receptive to our feedback and wanting to be "on the same page", we did not demonstrate effective use of the feedback to change their rating behaviours. Calibration of severity appears to be difficult for examiners, and further research into better ways of providing more effective feedback is indicated.

摘要

背景

稳健且合理的临床评估旨在将学生成绩的差异最小化,这些差异归因于考试者的严格程度(严格和宽松)差异。不幸的是,迄今为止,几乎没有证据表明考试者培训和反馈干预措施是有效的;“医生评分者”确实被认为“对反馈无动于衷”。我们的目的是调查普通科医生考试者反馈干预措施的有效性,并探讨考试者对此的态度。

方法

为 16 名考试者提供了过去 18 个月中所有考试者对医学生临床病例考试评分的书面摘要,使他们能够识别自己的评分数据,并将其与其他考试者进行比较。在干预前后,使用非参数引导、多变量线性回归、组内相关和斯皮尔曼相关分析,分析考试者评分和考试者严重程度自我估计。考试者完成了一项调查,探讨他们对干预措施的有用性和可接受性的看法,包括(如果有的话)考试者计划因反馈而有所不同。

结果

在干预前,考试者严重程度自我估计与两种临床病例考试类型的测量严重程度相关性较差(0.29 和 0.67),且干预后准确性降低。在控制病例难度差异的情况下,干预没有产生显著效果,尽管干预后考试者的异常值较少。在干预前,考试者的严重程度随时间推移而发生漂移。参与者认为干预措施有趣且有用,调查评论表明公平、放心和了解考试者同事对考试者很重要。

结论

尽管我们的参与者对我们的反馈持开放态度并希望“保持一致”,但我们没有证明有效地利用反馈来改变他们的评分行为。考试者的严重程度似乎难以校准,需要进一步研究如何提供更有效的反馈。

相似文献

1
"On the same page"? The effect of GP examiner feedback on differences in rating severity in clinical assessments: a pre/post intervention study.“在同一页上”?GP 考官反馈对临床评估中评分严重程度差异的影响:一项干预前后研究。
BMC Med Educ. 2017 Jun 6;17(1):101. doi: 10.1186/s12909-017-0929-9.
2
Exploration of a possible relationship between examiner stringency and personality factors in clinical assessments: a pilot study.临床评估中检查者严格程度与人格因素之间可能关系的探索:一项试点研究。
BMC Med Educ. 2014 Dec 31;14:1052. doi: 10.1186/s12909-014-0280-3.
3
Assessment of examiner leniency and stringency ('hawk-dove effect') in the MRCP(UK) clinical examination (PACES) using multi-facet Rasch modelling.使用多维度Rasch模型评估英国皇家内科医师学会临床考试(PACES)中主考官的宽松和严格程度(“鹰鸽效应”)。
BMC Med Educ. 2006 Aug 18;6:42. doi: 10.1186/1472-6920-6-42.
4
Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.开发一种基于视频的方法,以比较和调整完全嵌套 OSCE 中的考官效应。
Med Educ. 2019 Mar;53(3):250-263. doi: 10.1111/medu.13783. Epub 2018 Dec 21.
5
Comparison of student examiner to faculty examiner scoring and feedback in an OSCE.学生考官与教师考官在 OSCE 中的评分和反馈比较。
Med Educ. 2011 Feb;45(2):183-91. doi: 10.1111/j.1365-2923.2010.03800.x. Epub 2010 Dec 17.
6
Peer feedback for examiner quality assurance on MRCGP International South Asia: a mixed methods study.MRCGP 国际南亚考试官质量保证的同行反馈:混合方法研究。
BMC Med Educ. 2017 Dec 8;17(1):244. doi: 10.1186/s12909-017-1090-1.
7
MRCGP CSA: are the examiners biased, favouring their own by sex, ethnicity, and degree source?MRCGP CSA:考官是否存在偏见,偏向于自己的性别、种族和学位来源?
Br J Gen Pract. 2013 Nov;63(616):e718-25. doi: 10.3399/bjgp13X674396.
8
Examiner training: A study of examiners making sense of norm-referenced feedback.主考培训:对主考理解常模参照反馈的研究。
Med Teach. 2019 Jul;41(7):787-794. doi: 10.1080/0142159X.2019.1579902. Epub 2019 Mar 26.
9
Examiner fatigue in communication skills objective structured clinical examinations.沟通技能客观结构化临床考试中的考官疲劳
Med Educ. 2001 May;35(5):444-9. doi: 10.1046/j.1365-2923.2001.00893.x.
10
Undesired variance due to examiner stringency/leniency effect in communication skill scores assessed in OSCEs.客观结构化临床考试(OSCE)中评估沟通技能得分时,因考官的严格/宽松效应导致的非期望方差。
Adv Health Sci Educ Theory Pract. 2008 Dec;13(5):617-32. doi: 10.1007/s10459-007-9068-0. Epub 2007 Jul 3.

本文引用的文献

1
A method for identifying extreme OSCE examiners.一种识别极端客观结构化临床考试考官的方法。
Clin Teach. 2013 Feb;10(1):27-31. doi: 10.1111/j.1743-498X.2012.00607.x.
2
Generalizability theory for the perplexed: a practical introduction and guide: AMEE Guide No. 68.通用化理论研究:实用入门与指导——AMEE 指南第 68 号。
Med Teach. 2012;34(11):960-92. doi: 10.3109/0142159X.2012.703791.
3
Expertise in performance assessment: assessors' perspectives.绩效评估专业知识:评估者的视角。
Adv Health Sci Educ Theory Pract. 2013 Oct;18(4):559-71. doi: 10.1007/s10459-012-9392-x. Epub 2012 Jul 31.
4
Workplace-based assessment: raters' performance theories and constructs.基于工作场所的评估:评估者的绩效理论和构念。
Adv Health Sci Educ Theory Pract. 2013 Aug;18(3):375-96. doi: 10.1007/s10459-012-9376-x. Epub 2012 May 17.
5
Seeing the same thing differently: mechanisms that contribute to assessor differences in directly-observed performance assessments.从不同角度看待相同事物:直接观察绩效评估中评估者差异的促成机制。
Adv Health Sci Educ Theory Pract. 2013 Aug;18(3):325-41. doi: 10.1007/s10459-012-9372-1. Epub 2012 May 12.
6
Rater-based assessments as social judgments: rethinking the etiology of rater errors.基于评定者的评估即社会判断:重新思考评定者误差的病因。
Acad Med. 2011 Oct;86(10 Suppl):S1-7. doi: 10.1097/ACM.0b013e31822a6cf8.
7
Should candidate scores be adjusted for interviewer stringency or leniency in the multiple mini-interview?多站式面试中,应否根据面试官的严格或宽松程度调整考生分数?
Med Educ. 2010 Jul;44(7):690-8. doi: 10.1111/j.1365-2923.2010.03689.x.
8
Effect of rater training on reliability and accuracy of mini-CEX scores: a randomized, controlled trial.评估者培训对迷你临床评估练习(mini-CEX)评分可靠性和准确性的影响:一项随机对照试验。
J Gen Intern Med. 2009 Jan;24(1):74-9. doi: 10.1007/s11606-008-0842-3. Epub 2008 Nov 11.
9
Undesired variance due to examiner stringency/leniency effect in communication skill scores assessed in OSCEs.客观结构化临床考试(OSCE)中评估沟通技能得分时,因考官的严格/宽松效应导致的非期望方差。
Adv Health Sci Educ Theory Pract. 2008 Dec;13(5):617-32. doi: 10.1007/s10459-007-9068-0. Epub 2007 Jul 3.
10
Assessment of examiner leniency and stringency ('hawk-dove effect') in the MRCP(UK) clinical examination (PACES) using multi-facet Rasch modelling.使用多维度Rasch模型评估英国皇家内科医师学会临床考试(PACES)中主考官的宽松和严格程度(“鹰鸽效应”)。
BMC Med Educ. 2006 Aug 18;6:42. doi: 10.1186/1472-6920-6-42.