Sloan D A, Donnelly M B, Schwartz R W, Felts J L, Blue A V, Strodel W E
Department of Surgery, University of Kentucky College of Medicine, Lexington 40536, USA.
J Surg Res. 1996 Jun;63(1):225-30. doi: 10.1006/jsre.1996.0252.
This study had two purposes: determining the reliability and validity of the Objective Structured Clinical Examination (OSCE) in assessing performance by trainees at all levels, including medical students and chief residents; and estimating the impact of providing OSCE participants with immediate feedback about their performance. A comprehensive 210-min OSCE was administered to 53 surgical residents and 6 junior medical students. Faculty experts proctored all patient stations and provided immediate feedback to participants after the patient interaction segments (Part A). The participants then answered questions about the patients seen (Part B). The reliability of the OSCE was high (.91), identical to that of a previous resident OSCE with no feedback. The standard error of measurement for both parts was approximately 4%. At the 95% confidence interval, each participant's actual level of clinical performance (Part A) and clinical knowledge (Part B) could be estimated with an error of +/-8%. Participants showed significant differences in clinical performance (Part A, P < 0.01) and knowledge (Part B, P < 0.01) by level of training. Most participants (74%) rated the OSCE as an above average or outstanding educational method. The OSCE is a valid and reliable test of residents' clinical skills. Feedback to participants during the OSCE was positively received and did not perturb test reliability.
确定客观结构化临床考试(OSCE)在评估包括医学生和住院总医师在内的各级学员表现时的可靠性和有效性;以及评估向OSCE参与者提供其表现的即时反馈所产生的影响。对53名外科住院医师和6名低年级医学生进行了一场时长210分钟的全面OSCE。教员专家监考所有患者站,并在患者互动环节(A部分)结束后向参与者提供即时反馈。然后参与者回答有关所诊治患者的问题(B部分)。OSCE的可靠性很高(.91),与之前未提供反馈的住院医师OSCE相同。两部分的测量标准误差约为4%。在95%置信区间内,每位参与者的实际临床操作水平(A部分)和临床知识水平(B部分)的估计误差为±8%。参与者在临床操作(A部分,P < 0.01)和知识(B部分,P < 0.01)方面因培训水平不同而存在显著差异。大多数参与者(74%)将OSCE评为高于平均水平或出色的教育方法。OSCE是对住院医师临床技能的有效且可靠的测试。在OSCE期间向参与者提供的反馈得到了积极认可,且未干扰测试的可靠性。