• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

行为锚定在住院医师培训申请人评估中的影响:减少评分者偏差

The Impact of Behavioral Anchors in the Assessment of Fellowship Applicants: Reducing Rater Biases.

作者信息

Langhan Melissa L, Tiyyagura Gunjan

机构信息

Departments of Pediatrics and Emergency Medicine, Section of Pediatric Emergency Medicine, Yale University School of Medicine, New Haven, Conn 06510.

Departments of Pediatrics and Emergency Medicine, Section of Pediatric Emergency Medicine, Yale University School of Medicine, New Haven, Conn 06510.

出版信息

Acad Pediatr. 2022 Mar;22(2):313-318. doi: 10.1016/j.acap.2021.11.018. Epub 2021 Dec 2.

DOI:10.1016/j.acap.2021.11.018
PMID:34864133
Abstract

INTRODUCTION

No standardized evaluation tool for fellowship applicant assessment exists. Assessment tools are subject to biases and scoring tendencies which can skew scores and impact rankings. We aimed to develop and evaluate an objective assessment tool for fellowship applicants.

METHODS

We detected rater effects in our numerically scaled assessment tool (NST), which consisted of 10 domains rated from 0 to 9. We evaluated each domain, consolidated redundant categories, and removed subjective categories. For 7 remaining domains, we described each quality and developed a question with a behaviorally-anchored rating scale (BARS). Applicants were rated by 6 attendings. Ratings from the NST in 2018 were compared with the BARS from 2020 for distribution of data, skewness, and inter-rater reliability.

RESULTS

Thirty-four applicants were evaluated with the NST and 38 with the BARS. Demographics were similar between groups. The median score on the NST was 8 out of 9; scores <5 were used in less than 1% of all evaluations. Distribution of data was improved in the BARS tool. In the NST, scores from 6 of 10 domains demonstrated moderate skewness and 3 high skewness. Three of the 7 domains in the BARS showed moderate skewness and none had high skewness. Two of 10 domains in the NST vs 5 of 7 domains in the BARS achieved good inter-rater reliability.

CONCLUSION

Replacing a standard numeric scale with a BARS normalized the distribution of data, reduced skewness, and enhanced inter-rater reliability in our evaluation tool. This provides some validity evidence for improved applicant assessment and ranking.

摘要

引言

目前尚无用于评估专科住院医师培训申请人的标准化评估工具。评估工具容易受到偏见和评分倾向的影响,这可能会使分数产生偏差并影响排名。我们旨在开发并评估一种针对专科住院医师培训申请人的客观评估工具。

方法

我们在数字评分评估工具(NST)中检测到评分者效应,该工具由10个领域组成,评分范围为0至9。我们对每个领域进行了评估,合并了冗余类别,并删除了主观类别。对于剩下的7个领域,我们描述了每个质量特征,并制定了带有行为锚定评分量表(BARS)的问题。6名主治医师对申请人进行评分。将2018年NST的评分与2020年BARS的评分进行比较,以分析数据分布、偏度和评分者间信度。

结果

34名申请人接受了NST评估,38名接受了BARS评估。两组的人口统计学特征相似。NST的中位数分数为9分中的8分;所有评估中,分数低于5分的情况不到1%。BARS工具的数据分布得到了改善。在NST中,10个领域中有6个领域的分数呈现中度偏度,3个领域呈现高度偏度。BARS的7个领域中有3个领域呈现中度偏度,没有一个领域呈现高度偏度。NST的10个领域中有2个领域与BARS的7个领域中有5个领域实现了良好的评分者间信度。

结论

在我们的评估工具中,用BARS取代标准数字量表使数据分布标准化,减少了偏度,并提高了评分者间信度。这为改进申请人评估和排名提供了一些有效性证据。

相似文献

1
The Impact of Behavioral Anchors in the Assessment of Fellowship Applicants: Reducing Rater Biases.行为锚定在住院医师培训申请人评估中的影响:减少评分者偏差
Acad Pediatr. 2022 Mar;22(2):313-318. doi: 10.1016/j.acap.2021.11.018. Epub 2021 Dec 2.
2
Can Behavior-Based Interviews Reduce Bias in Fellowship Applicant Assessment?基于行为的面试能否减少住院医师培训申请人评估中的偏见?
Acad Pediatr. 2022 Apr;22(3):478-485. doi: 10.1016/j.acap.2021.12.017. Epub 2021 Dec 18.
3
Inter-rater reliability and validity of two ataxia rating scales in children with brain tumours.两种共济失调评定量表在脑肿瘤患儿中的评分者间信度和效度
Childs Nerv Syst. 2015 May;31(5):693-7. doi: 10.1007/s00381-015-2650-5. Epub 2015 Mar 4.
4
Disparities in ratings of internal and external applicants: A case for model-based inter-rater reliability.内部和外部申请人评级的差异:基于模型的评级者间可靠性的案例。
PLoS One. 2018 Oct 5;13(10):e0203002. doi: 10.1371/journal.pone.0203002. eCollection 2018.
5
Effect of virtual interviewing on applicant approach to and perspective of the Maternal-Fetal Medicine Subspecialty Fellowship Match.虚拟面试对申请者对母胎医学专科研究员匹配的方法和看法的影响。
Am J Obstet Gynecol MFM. 2021 May;3(3):100326. doi: 10.1016/j.ajogmf.2021.100326. Epub 2021 Feb 3.
6
Interviewer bias in selection of anaesthesia Fellows: A single-institution quality assessment study.
Anaesth Intensive Care. 2020 Sep;48(5):358-365. doi: 10.1177/0310057X20945326. Epub 2020 Oct 5.
7
Evaluation of a Simpler Tool to Assess Nontechnical Skills During Simulated Critical Events.评估一种用于在模拟危急事件中评估非技术技能的更简单工具。
Simul Healthc. 2017 Apr;12(2):69-75. doi: 10.1097/SIH.0000000000000199.
8
Inter-rater Reliability Assessment of ASPECT-R: (A Study Pragmatic-Explanatory Characterization Tool-Rating).ASPECT-R的评分者间信度评估:(一项实用-解释性特征工具-评分研究)
Innov Clin Neurosci. 2016 Apr 1;13(3-4):27-31. eCollection 2016 Mar-Apr.
9
Association of Mentor-to-Program Contact and Applicant Rank Disclosure With Vitreoretinal Fellowship Applicant's Final Match Outcome in 2016 and 2017.2016 年和 2017 年导师与项目联系和申请人排名披露与玻璃体视网膜研究员申请人最终匹配结果的关联。
JAMA Ophthalmol. 2018 Jun 1;136(6):642-647. doi: 10.1001/jamaophthalmol.2018.1107.
10
Choosing a fellow or fellowship: a survey of pediatric otolaryngologists.选择研究员或研究员职位:儿科耳鼻喉科医生的调查。
JAMA Otolaryngol Head Neck Surg. 2014 Feb;140(2):102-5. doi: 10.1001/jamaoto.2013.5859.