Liu Wei, Wu Yu, Yuan Xianglei, Zhang Jingyu, Zhou Yao, Zhang Wanhong, Zhu Peipei, Tao Zhang, He Long, Hu Bing, Yi Zhang
Department of Gastroenterology, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, Sichuan, China.
Endoscopy. 2022 Oct;54(10):972-979. doi: 10.1055/a-1799-8297. Epub 2022 Apr 7.
This study aimed to develop an artificial intelligence (AI)-based system for measuring fold examination quality (FEQ) of colonoscopic withdrawal technique. We also examined the relationship between the system's evaluation of FEQ and FEQ scores from experts, and adenoma detection rate (ADR) and withdrawal time of colonoscopists, and evaluated the system's ability to improve FEQ during colonoscopy.
First, we developed an AI-based system for measuring FEQ. Next, 103 consecutive colonoscopies performed by 11 colonoscopists were collected for evaluation. Three experts graded FEQ of each colonoscopy, after which the recorded colonoscopies were evaluated by the system. We further assessed the system by correlating its evaluation of FEQ against expert scoring, historical ADR, and withdrawal time of each colonoscopist. We also conducted a prospective observational study to evaluate the system's performance in enhancing fold examination.
The system's evaluations of FEQ of each endoscopist were significantly correlated with experts' scores (r = 0.871, < 0.001), historical ADR (r = 0.852, = 0.001), and withdrawal time (r = 0.727, = 0.01). For colonoscopies performed by colonoscopists with previously low ADRs (< 25 %), AI assistance significantly improved the FEQ, evaluated by both the AI system (0.29 [interquartile range (IQR) 0.27-0.30] vs. 0.23 [0.17-0.26]) and experts (14.00 [14.00-15.00] vs. 11.67 [10.00-13.33]) (both < 0.001).
The system's evaluation of FEQ was strongly correlated with FEQ scores from experts, historical ADR, and withdrawal time of each colonoscopist. The system has the potential to enhance FEQ.
本研究旨在开发一种基于人工智能(AI)的系统,用于评估结肠镜检查退镜技术的褶皱检查质量(FEQ)。我们还研究了该系统对FEQ的评估与专家给出的FEQ评分、腺瘤检出率(ADR)以及结肠镜检查医师的退镜时间之间的关系,并评估了该系统在结肠镜检查过程中改善FEQ的能力。
首先,我们开发了一种基于AI的系统来测量FEQ。接下来,收集了11位结肠镜检查医师连续进行的103例结肠镜检查用于评估。三位专家对每例结肠镜检查的FEQ进行评分,然后由该系统对记录的结肠镜检查进行评估。我们通过将系统对FEQ的评估与专家评分、历史ADR以及每位结肠镜检查医师的退镜时间进行关联,进一步评估该系统。我们还进行了一项前瞻性观察研究,以评估该系统在增强褶皱检查方面的性能。
该系统对每位内镜医师的FEQ评估与专家评分(r = 0.871,P < 0.001)、历史ADR(r = 0.852,P = 0.001)和退镜时间(r = 0.727,P = 0.01)显著相关。对于先前ADR较低(< 25%)的结肠镜检查医师所进行的结肠镜检查,AI辅助显著改善了FEQ,无论是通过AI系统评估(0.29 [四分位间距(IQR)0.27 - 0.30] 对比 0.23 [0.17 - 0.26])还是专家评估(14.00 [14.00 - 15.00] 对比 11.67 [10.00 - 13.33])(均P < 0.001)。
该系统对FEQ的评估与专家的FEQ评分、历史ADR以及每位结肠镜检查医师的退镜时间密切相关。该系统具有提高FEQ的潜力。