Neurosurgical Simulation and Artificial Intelligence Learning Centre, Department of Neurology and Neurosurgery, Montreal Neurological Institute and Hospital, McGill University, Montreal, Canada.
Faculty of Medicine and Health Sciences, McGill University, Montreal, Canada.
JAMA Netw Open. 2022 Feb 1;5(2):e2149008. doi: 10.1001/jamanetworkopen.2021.49008.
To better understand the emerging role of artificial intelligence (AI) in surgical training, efficacy of AI tutoring systems, such as the Virtual Operative Assistant (VOA), must be tested and compared with conventional approaches.
To determine how VOA and remote expert instruction compare in learners' skill acquisition, affective, and cognitive outcomes during surgical simulation training.
DESIGN, SETTING, AND PARTICIPANTS: This instructor-blinded randomized clinical trial included medical students (undergraduate years 0-2) from 4 institutions in Canada during a single simulation training at McGill Neurosurgical Simulation and Artificial Intelligence Learning Centre, Montreal, Canada. Cross-sectional data were collected from January to April 2021. Analysis was conducted based on intention-to-treat. Data were analyzed from April to June 2021.
The interventions included 5 feedback sessions, 5 minutes each, during a single 75-minute training, including 5 practice sessions followed by 1 realistic virtual reality brain tumor resection. The 3 intervention arms included 2 treatment groups, AI audiovisual metric-based feedback (VOA group) and synchronous verbal scripted debriefing and instruction from a remote expert (instructor group), and a control group that received no feedback.
The coprimary outcomes were change in procedural performance, quantified as Expertise Score by a validated assessment algorithm (Intelligent Continuous Expertise Monitoring System [ICEMS]; range, -1.00 to 1.00) for each practice resection, and learning and retention, measured from performance in realistic resections by ICEMS and blinded Objective Structured Assessment of Technical Skills (OSATS; range 1-7). Secondary outcomes included strength of emotions before, during, and after the intervention and cognitive load after intervention, measured in self-reports.
A total of 70 medical students (41 [59%] women and 29 [41%] men; mean [SD] age, 21.8 [2.3] years) from 4 institutions were randomized, including 23 students in the VOA group, 24 students in the instructor group, and 23 students in the control group. All participants were included in the final analysis. ICEMS assessed 350 practice resections, and ICEMS and OSATS evaluated 70 realistic resections. VOA significantly improved practice Expertise Scores by 0.66 (95% CI, 0.55 to 0.77) points compared with the instructor group and by 0.65 (95% CI, 0.54 to 0.77) points compared with the control group (P < .001). Realistic Expertise Scores were significantly higher for the VOA group compared with instructor (mean difference, 0.53 [95% CI, 0.40 to 0.67] points; P < .001) and control (mean difference. 0.49 [95% CI, 0.34 to 0.61] points; P < .001) groups. Mean global OSATS ratings were not statistically significant among the VOA (4.63 [95% CI, 4.06 to 5.20] points), instructor (4.40 [95% CI, 3.88-4.91] points), and control (3.86 [95% CI, 3.44 to 4.27] points) groups. However, on the OSATS subscores, VOA significantly enhanced the mean OSATS overall subscore compared with the control group (mean difference, 1.04 [95% CI, 0.13 to 1.96] points; P = .02), whereas expert instruction significantly improved OSATS subscores for instrument handling vs control (mean difference, 1.18 [95% CI, 0.22 to 2.14]; P = .01). No significant differences in cognitive load, positive activating, and negative emotions were found.
In this randomized clinical trial, VOA feedback demonstrated superior performance outcome and skill transfer, with equivalent OSATS ratings and cognitive and emotional responses compared with remote expert instruction, indicating advantages for its use in simulation training.
ClinicalTrials.gov Identifier: NCT04700384.
重要性:为了更好地理解人工智能(AI)在外科培训中的新兴作用,必须测试和比较 AI 辅导系统(如虚拟手术助手(VOA))的效果,以了解其与传统方法的差异。
目的:确定在手术模拟训练中,VOA 和远程专家指导在学习者技能获取、情感和认知结果方面的比较。
设计、地点和参与者:这是一项由导师主导的随机临床试验,参与者来自加拿大 4 所机构的医学生(本科 0-2 年级),在加拿大麦吉尔神经外科模拟和人工智能学习中心进行了单次模拟培训。参与者于 2021 年 1 月至 4 月期间进行了横断面数据收集。基于意向治疗进行分析。数据分析于 2021 年 4 月至 6 月进行。
干预措施:干预措施包括在单次 75 分钟的培训中进行 5 次反馈,每次 5 分钟,其中包括 5 次练习和 1 次现实的虚拟现实脑瘤切除术。3 个干预组包括 2 个治疗组,即基于 AI 视听指标的反馈(VOA 组)和来自远程专家的同步口头脚本式讨论和指导(指导组),以及不接受反馈的对照组。
主要结果和测量:主要结果是通过经验证的评估算法(智能连续专家监测系统(ICEMS))评估的每个练习切除的程序性表现的变化,以专家评分表示(范围为-1.00 至 1.00),以及通过 ICEMS 和客观结构化手术技能评估(OSATS)评估的现实切除的学习和保留情况,OSATS 的范围为 1 至 7。次要结果包括干预前后的情绪强度以及干预后的认知负荷,通过自我报告进行评估。
结果:共有来自 4 所机构的 70 名医学生(41 名女性[59%]和 29 名男性[41%];平均[标准差]年龄为 21.8[2.3]岁)被随机分组,包括 23 名 VOA 组学生、24 名指导组学生和 23 名对照组学生。所有参与者均纳入最终分析。ICEMS 评估了 350 次练习切除,ICEMS 和 OSATS 评估了 70 次现实切除。与指导组相比,VOA 组的练习专家评分提高了 0.66(95%CI,0.55 至 0.77)分,与对照组相比提高了 0.65(95%CI,0.54 至 0.77)分(P<0.001)。与指导组(平均差异为 0.53 [95%CI,0.40 至 0.67]分;P<0.001)和对照组(平均差异为 0.49 [95%CI,0.34 至 0.61]分;P<0.001)相比,VOA 组的现实专家评分显著更高。VOA 组的全球 OSATS 评分平均值与指导组(4.63 [95%CI,4.06 至 5.20]分)和对照组(3.86 [95%CI,3.44 至 4.27]分)相比无统计学意义。然而,在 OSATS 子评分方面,VOA 组与对照组相比,OSATS 总分子评分显著提高(平均差异为 1.04 [95%CI,0.13 至 1.96]分;P=0.02),而专家指导组与对照组相比,仪器处理子评分显著提高(平均差异为 1.18 [95%CI,0.22 至 2.14]分;P=0.01)。未发现认知负荷、积极激活和消极情绪有显著差异。
结论和相关性:在这项随机临床试验中,与远程专家指导相比,VOA 反馈在表现结果和技能转移方面表现出更好的效果,同时具有等效的 OSATS 评分和认知及情感反应,这表明其在模拟培训中的应用具有优势。
试验注册:ClinicalTrials.gov 标识符:NCT04700384。