Department of Surgery, Stanford University, Stanford, CA.
Ann Surg. 2020 Sep 1;272(3):523-528. doi: 10.1097/SLA.0000000000004207.
Artificial intelligence (AI) has numerous applications in surgical quality assurance. We assessed AI accuracy in evaluating the critical view of safety (CVS) and intraoperative events during laparoscopic cholecystectomy. We hypothesized that AI accuracy and intraoperative events are associated with disease severity.
One thousand fifty-one laparoscopic cholecystectomy videos were annotated by AI for disease severity (Parkland Scale), CVS achievement (Strasberg Criteria), and intraoperative events. Surgeons performed focused video review on procedures with ≥1 intraoperative events (n = 335). AI versus surgeon annotation of CVS components and intraoperative events were compared. For all cases (n = 1051), intraoperative-event association with CVS achievement and severity was examined using ordinal logistic regression.
Using AI annotation, surgeons reviewed 50 videos/hr. CVS was achieved in ≤10% of cases. Hepatocystic triangle and cystic plate visualization was achieved more often in low-severity cases (P < 0.03). AI-surgeon agreement for all CVS components exceeded 75%, with higher agreement in high-severity cases (P < 0.03). Surgeons agreed with 99% of AI-annotated intraoperative events. AI-annotated intraoperative events were associated with both disease severity and number of CVS components not achieved. Intraoperative events occurred more frequently in high-severity versus low-severity cases (0.98 vs 0.40 events/case, P < 0.001).
AI annotation allows for efficient video review and is a promising quality assurance tool. Disease severity may limit its use and surgeon oversight is still required, especially in complex cases. Continued refinement may improve AI applicability and allow for automated assessment.
人工智能(AI)在手术质量保证方面有多种应用。我们评估了 AI 在评估腹腔镜胆囊切除术时的关键安全视野(CVS)和术中事件的准确性。我们假设 AI 准确性和术中事件与疾病严重程度相关。
对 1051 个腹腔镜胆囊切除术视频进行 AI 注释,用于疾病严重程度(Parkland 量表)、CVS 实现(Strasberg 标准)和术中事件。外科医生对有≥1 个术中事件的手术进行重点视频复查(n = 335)。比较 AI 与外科医生对 CVS 成分和术中事件的注释。对于所有病例(n = 1051),使用有序逻辑回归检查术中事件与 CVS 实现和严重程度的关系。
使用 AI 注释,外科医生可以在 50 分钟内审查 1 个视频。CVS 的实现率在≤10%的病例中。在低严重程度的病例中,肝胆囊三角和胆囊板的可视化更常见(P < 0.03)。所有 CVS 成分的 AI-外科医生一致性均超过 75%,在严重程度高的病例中更高(P < 0.03)。外科医生同意 99%的 AI 注释的术中事件。AI 注释的术中事件与疾病严重程度和未实现的 CVS 成分数量均相关。在高严重程度与低严重程度的病例中,术中事件更常见(0.98 与 0.40 事件/病例,P < 0.001)。
AI 注释允许进行高效的视频审查,是一种有前途的质量保证工具。疾病严重程度可能会限制其使用,仍需要外科医生监督,尤其是在复杂病例中。持续改进可能会提高 AI 的适用性,并允许进行自动评估。