Interface Lab, Ohio Supercomputer Center, Columbus, Ohio.
Interface Lab, Ohio Supercomputer Center, Columbus, Ohio.
J Surg Educ. 2018 May-Jun;75(3):678-687. doi: 10.1016/j.jsurg.2017.10.006. Epub 2017 Nov 1.
The objective of this work is to obtain validity evidence for an evaluation instrument used to assess the performance level of a mastoidectomy. The instrument has been previously described and had been formulated by a multi-institutional consortium.
Mastoidectomies were performed on a virtual temporal bone system and then rated by experts using a previously described 15 element task-based checklist. Based on the results, a second, similar checklist was created and a second round of rating was performed.
Twelve otolaryngological surgical training programs in the United States.
In all, 65 mastoidectomy performances were evaluated coming from 37 individuals with a variety of temporal bone dissection experience, from medical students to attending physicians. Raters were attending surgeons from 12 different institutions.
Intraclass correlation scores varied greatly between items in the checklist with some being low and some being high. Percentage agreement scores were similar to previous rating instruments. There is strong evidence that a high score on the task-based checklist is necessary for a rater to consider a mastoidectomy to be performed at the level of an expert but a high score is not a sufficient condition.
Rewording of the instrument items to focus on safety does not result in increased reliability of the instrument. The strong result of the Necessary Condition Analysis suggests that going beyond simple correlation measures can give extra insight into grading results. Additionally, we suggest using a multiple point scale instead of a binary pass/fail question combined with descriptive mastery levels.
本研究旨在获得用于评估乳突切除术绩效水平的评估工具的效度证据。该工具先前已被描述并由多机构联盟制定。
在虚拟颞骨系统上进行乳突切除术,然后由专家使用先前描述的 15 项基于任务的检查表进行评分。根据结果,创建了第二个类似的检查表,并进行了第二轮评分。
美国 12 个耳鼻喉科手术培训计划。
共有 65 例乳突切除术表现来自 37 名具有不同颞骨解剖经验的个体,包括医学生和主治医生。评估者为来自 12 个不同机构的主治外科医生。
检查表中的项目之间的组内相关系数差异很大,有些项目的相关性较低,有些项目的相关性较高。百分比一致评分与先前的评分工具相似。有强有力的证据表明,任务检查表上的高分对于评估者认为乳突切除术达到专家水平是必要的,但高分不是充分条件。
将仪器项目重新措辞以关注安全性并不会提高仪器的可靠性。必要条件分析的有力结果表明,超越简单的相关性测量可以深入了解评分结果。此外,我们建议使用多点量表而不是二进制通过/失败问题,并结合描述性掌握水平。