Shippey Stuart, Handa Victoria L, Chen Tiffany L, Chou Betty, Bowen Craig W
Department Gynecology and Obstetrics, Johns Hopkins University, School of Medicine, Baltimore, Maryland 21224, USA.
J Surg Educ. 2009 Jan-Feb;66(1):31-4. doi: 10.1016/j.jsurg.2008.09.001.
To collect evidence for the validity and reliability of an assessment tool for simulated subcuticular suturing.
Three subjects were videotaped while closing a simulated incision in a plastic model. The 3 trials were viewed independently by 7 faculty examiners masked to subject identity. Global rating and task-specific scales were used to assess subject competence. The mean scores were compared among the 3 subjects and 7 evaluators using analysis of variance.
Significant differences were found among the mean global rating scores for the 3 subjects but not among the evaluators. Similarly, significant differences were found between mean task-specific scale scores for the 3 subjects but not among the evaluators. Cronbach's alpha for global rating (0.89) and task-specific (0.93) scores suggested high internal consistency for each scale.
These findings provide evidence for the discriminant validity, internal consistency, and inter-rater reliability of both the global rating and task-specific scales of our assessment tool.
收集关于模拟皮下缝合评估工具有效性和可靠性的证据。
三名受试者在塑料模型上闭合模拟切口时被录像。7名教员考官在不知道受试者身份的情况下独立观看这3次试验。使用整体评分和特定任务量表来评估受试者的能力。使用方差分析比较3名受试者和7名评估者的平均分数。
3名受试者的平均整体评分分数之间存在显著差异,但评估者之间不存在显著差异。同样,3名受试者的平均特定任务量表分数之间存在显著差异,但评估者之间不存在显著差异。整体评分(0.89)和特定任务(0.93)分数的克朗巴哈系数表明每个量表具有较高的内部一致性。
这些发现为我们评估工具的整体评分和特定任务量表的区分效度、内部一致性和评分者间信度提供了证据。