Guerrasio Jeannette, Cumbler Ethan, Trosterman Adam, Wald Heidi, Brandenburg Suzanne, Aagaard Eva
J Grad Med Educ. 2012 Mar;4(1):47-51. doi: 10.4300/JGME-D-11-00145.1.
Postrotation evaluations are frequently used by residency program directors for early detection of residents with academic difficulties; however, the accuracy of these evaluations in assessing resident performance has been questioned.
This retrospective case-control study examines the ability of postrotation evaluation characteristics to predict the need for remediation. We compared the evaluations of 17 residents who were placed on academic warning or probation, from 2000 to 2007, with those for a group of peers matched on sex, postgraduate year (PGY), and entering class.
The presence of an outlier evaluation, the number of words written in the comments section, and the percentage of evaluations with negative or ambiguous comments were all associated with the need for remediation (P = .01, P = .001, P = .002, P = < .001, respectively). In contrast, United States Medical Licensing Examination step 1 and step 2 scores, total number of evaluations received, and percentage of positive comments on the evaluations were not associated with the need for remediation (P = .06, P = .87, P = .55, respectively).
Despite ambiguous evaluation comments, the length and percentage of ambiguous or negative comments did indicate future need for remediation.
Our study demonstrates that postrotation evaluation characteristics can be used to identify residents as risk. However, larger prospective studies, encompassing multiple institutions, are needed to validate various evaluation methods in measuring resident performance and to accurately predict the need for remediation.
住院医师培训项目主任经常使用轮转后评估来早期发现有学业困难的住院医师;然而,这些评估在评估住院医师表现方面的准确性受到了质疑。
这项回顾性病例对照研究考察了轮转后评估特征预测补救需求的能力。我们比较了2000年至2007年期间17名被置于学业警告或留校察看的住院医师的评估结果与一组在性别、研究生年级(PGY)和入学班级相匹配的同行的评估结果。
存在异常值评估、评语部分所写的字数以及带有负面或模糊评语的评估所占百分比均与补救需求相关(P分别为0.01、0.001、0.002、<0.001)。相比之下,美国医学执照考试第一步和第二步的分数、收到的评估总数以及评估中正面评语的百分比与补救需求无关(P分别为0.06、0.87、0.55)。
尽管评估评语模糊,但评语的长度以及模糊或负面评语的百分比确实表明了未来的补救需求。
我们的研究表明,轮转后评估特征可用于识别有风险的住院医师。然而,需要开展涵盖多个机构的更大规模的前瞻性研究,以验证各种评估方法在衡量住院医师表现方面的有效性,并准确预测补救需求。