AdventHealth Nicholson Center, 404 Celebration Place, Celebration, FL, 34747, USA.
Kaiser Permanente, Modesto, CA, USA.
J Robot Surg. 2022 Jun;16(3):559-562. doi: 10.1007/s11701-021-01284-7. Epub 2021 Jul 15.
Surgical education courses and certification tests require human evaluators to assess performance. Deep neural network (DNN) methods include techniques for classifying the content of videos which may enable automated scoring of video performance. Researchers collected 254 videos of two simulation-based exercises performed by attending surgeons. The performance in each video was scored by experienced instructors and converted into three class labels-expert, intermediate, and novice. The videos were cut into 2227 10 s clips for training DNNs in the Google Video Intelligence AutoML service. The DNN models matched the classifications applied by human evaluators with 83.1% accuracy for the Ring & Rail exercise and 80.8% for the Suture Sponge exercise. DNN models trained on individual exercises delivered very good results (80 + % accuracy) in matching the classifications assigned by human instructors and may eventually be able to supplement or replace human evaluators.
外科教育课程和认证测试需要人类评估者来评估表现。深度神经网络 (DNN) 方法包括用于对视频内容进行分类的技术,这些技术可能使视频表现的自动评分成为可能。研究人员收集了 254 段由主治外科医生执行的两个基于模拟的练习的视频。每个视频的表现都由经验丰富的讲师进行评分,并转换为三个类别标签——专家、中级和新手。这些视频被剪辑成 2227 个 10 秒的片段,用于在谷歌视频智能 AutoML 服务中训练 DNN。DNN 模型与人类评估者应用的分类相匹配,在 Ring & Rail 练习中准确率为 83.1%,在 Suture Sponge 练习中准确率为 80.8%。在单独的练习上训练的 DNN 模型在匹配人类讲师分配的分类方面取得了非常好的结果(80%+准确率),并且最终可能能够补充或替代人类评估者。