Northwestern Center for Advanced Surgical Education, Simulation Technology and Immersive Learning, Feinberg School of Medicine, Chicago, IL 60611, USA.
Surgery. 2012 Jun;151(6):796-802. doi: 10.1016/j.surg.2012.03.016.
The purpose of this study was to compare faculty ratings between live versus video-recorded resident performances and faculty versus skills coaches' ratings of video-recorded resident performances.
PGY1 residents were observed, video-recorded, and rated during a Verification of Proficiency examination on 4 stations (ie, suturing, laparotomy, central line, and cricothyroidotomy). One surgeon and 2 trained skills coaches independently rated each video-recorded performance (N = 25). The chi-square test was used to compare checklist ratings. Analysis of variance was used to compare global ratings. Intraclass correlations were used to evaluate inter-rater agreement.
There were no statistical differences in faculty checklist ratings for live versus video-recorded performances (P > .05), and we found a nearly perfect interrater agreement, intraclass correlation coefficient (ICC) = 0.99 (P < .001). When comparing faculty versus skills coaches' ratings on video-recorded performances, we found no differences for the global or checklist ratings. Inter-rater agreement was moderately high for the global ratings, ICC = 0.71 (P <. 0.01, 95% confidence interval 0.23-0.96), and nearly perfect for the checklist ratings, ICC = 0.99 (P < .001, 95% confidence interval 0.94-1.00).
When assessing residents' performances, use of video-recorded performance ratings and skills coaches may be viable alternatives to live ratings performed by surgical faculty.
本研究的目的是比较现场与视频记录的住院医师表现以及教员与技能教练对视频记录的住院医师表现的评分。
PGY1 住院医师在 4 个站点(即缝合、剖腹术、中央线和环甲膜切开术)的熟练程度验证考试中进行了观察、视频记录和评分。一位外科医生和 2 名经过培训的技能教练独立对每个视频记录的表现进行了评分(N=25)。卡方检验用于比较检查表评分。方差分析用于比较总体评分。组内相关系数用于评估评分者间的一致性。
现场与视频记录的表现相比,教员检查表评分无统计学差异(P>.05),我们发现评分者间的一致性非常高,组内相关系数(ICC)=0.99(P<.001)。当比较教员与技能教练对视频记录的表现评分时,我们发现全球或检查表评分没有差异。全球评分的评分者间一致性较高,ICC=0.71(P<.01,95%置信区间为 0.23-0.96),检查表评分的评分者间一致性几乎完美,ICC=0.99(P<.001,95%置信区间为 0.94-1.00)。
在评估住院医师的表现时,使用视频记录的表现评分和技能教练可能是替代外科教员进行现场评分的可行方法。