Wittbrodt Matthew, Klug Matthew, Etemadi Mozziyar, Yang Anthony, Pandolfino John E, Keswani Rajesh N
Information Services, Northwestern Medicine, Chicago, United States.
Anesthesiology, Northwestern University Feinberg School of Medicine, Chicago, United States.
Endosc Int Open. 2024 Jul 3;12(7):E849-E853. doi: 10.1055/a-2333-8138. eCollection 2024 Jul.
Low-quality colonoscopy increases cancer risk but measuring quality remains challenging. We developed an automated, interactive assessment of colonoscopy quality (AI-CQ) using machine learning (ML). Based on quality guidelines, metrics selected for AI development included insertion time (IT), withdrawal time (WT), polyp detection rate (PDR), and polyps per colonoscopy (PPC). Two novel metrics were also developed: HQ-WT (time during withdrawal with clear image) and WT-PT (withdrawal time subtracting polypectomy time). The model was pre-trained using a self-supervised vision transformer on unlabeled colonoscopy images and then finetuned for multi-label classification on another mutually exclusive colonoscopy image dataset. A timeline of video predictions and metric calculations were presented to clinicians in addition to the raw video using a web-based application. The model was externally validated using 50 colonoscopies at a second hospital. The AI-CQ accuracy to identify cecal intubation was 88%. IT ( = 0.99) and WT ( = 0.99) were highly correlated between manual and AI-CQ measurements with a median difference of 1.5 seconds and 4.5 seconds, respectively. AI-CQ PDR did not significantly differ from manual PDR (47.6% versus 45.5%, = 0.66). Retroflexion was correctly identified in 95.2% and number of right colon evaluations in 100% of colonoscopies. HQ-WT was 45.9% of, and significantly correlated with ( = 0.85) WT time. An interactive AI assessment of colonoscopy skill can automatically assess quality. We propose that this tool can be utilized to rapidly identify and train providers in need of remediation.
低质量的结肠镜检查会增加癌症风险,但衡量其质量仍具有挑战性。我们利用机器学习(ML)开发了一种结肠镜检查质量的自动化交互式评估方法(AI-CQ)。根据质量指南,为AI开发选择的指标包括插入时间(IT)、退镜时间(WT)、息肉检出率(PDR)和每次结肠镜检查的息肉数(PPC)。还开发了两个新指标:HQ-WT(清晰图像下的退镜时间)和WT-PT(退镜时间减去息肉切除时间)。该模型首先使用自监督视觉变换器在未标记的结肠镜检查图像上进行预训练,然后在另一个相互排斥的结肠镜检查图像数据集上进行多标签分类微调。除了原始视频外,还使用基于网络的应用程序向临床医生展示视频预测和指标计算的时间线。该模型在第二家医院使用50例结肠镜检查进行了外部验证。AI-CQ识别盲肠插管的准确率为88%。手动测量与AI-CQ测量之间的IT(r = 0.99)和WT(r = 0.99)高度相关,中位数差异分别为1.5秒和4.5秒。AI-CQ的PDR与手动PDR无显著差异(47.6%对45.5%,P = 0.66)。在95.2%的结肠镜检查中正确识别了反转,在100%的结肠镜检查中正确识别了右半结肠评估次数。HQ-WT占WT时间的45.9%,且与WT时间显著相关(r = 0.85)。一种结肠镜检查技能的交互式AI评估可以自动评估质量。我们建议可以利用这个工具快速识别并培训需要改进的医护人员。