Fan Yanyun, Mu Ruochen, Xu Hongzhi, Xie Chenxi, Zhang Yinghao, Liu Lupeng, Wang Lin, Shi Huaxiu, Hu Yiqun, Ren Jianlin, Qin Jing, Wang Liansheng, Cai Shuntian
Department of Gastroenterology, Zhongshan Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, China; Department of Digestive Diseases, School of Medicine, Xiamen University, Xiamen, China.
Department of Computer Science, Xiamen University, Xiamen, China.
Gastrointest Endosc. 2023 Feb;97(2):335-346. doi: 10.1016/j.gie.2022.08.015. Epub 2022 Aug 17.
Endoscopy is increasingly performed for evaluating patients with ulcerative colitis (UC). However, its diagnostic accuracy is largely affected by the subjectivity of endoscopists' experience and scoring methods, and scoring of selected endoscopic images cannot reflect the inflammation of the entire intestine. We aimed to develop an automatic scoring system using deep-learning technology for consistent and objective scoring of endoscopic images and full-length endoscopic videos of patients with UC.
We collected 5875 endoscopic images and 20 full-length videos from 332 patients with UC who underwent colonoscopy between January 2017 and March 2021. We trained the artificial intelligence (AI) scoring system using these images, which was then used for full-length video scoring. To more accurately assess and visualize the full-length intestinal inflammation, we divided the large intestine into a fixed number of "areas" (cecum, 20; transverse colon, 20; descending colon, 20; sigmoid colon, 15; rectum, 10). The scoring system automatically scored inflammatory severity of 85 areas from every video and generated a visualized result of full-length intestinal inflammatory activity.
Compared with endoscopist scoring, the trained convolutional neural network achieved 86.54% accuracy in the Mayo-scored task, whereas the kappa coefficient was .813 (95% confidence interval [CI], .782-.844). The metrics of the Ulcerative Colitis Endoscopic Index of Severity-scored task were encouraging, with accuracies of 90.7%, 84.6%, and 77.7% and kappa coefficients of .822 (95% CI, .788-.855), .784 (95% CI, .744-.823), and .702 (95% CI, .612-.793) for vascular pattern, erosions and ulcers, and bleeding, respectively. The AI scoring system predicted each bowel segment's score and displayed distribution of inflammatory activity in the entire large intestine using a 2-dimensional colorized image.
We established a novel deep learning-based scoring system to evaluate endoscopic images from patients with UC, which can also accurately describe the severity and distribution of inflammatory activity through full-length intestinal endoscopic videos.
越来越多地通过内镜检查来评估溃疡性结肠炎(UC)患者。然而,其诊断准确性在很大程度上受内镜医师经验和评分方法主观性的影响,且所选内镜图像的评分无法反映整个肠道的炎症情况。我们旨在利用深度学习技术开发一种自动评分系统,以便对UC患者的内镜图像和全长内镜视频进行一致且客观的评分。
我们收集了2017年1月至2021年3月间接受结肠镜检查的332例UC患者的5875张内镜图像和20段全长视频。我们使用这些图像训练人工智能(AI)评分系统,然后将其用于全长视频评分。为了更准确地评估和可视化全长肠道炎症,我们将大肠划分为固定数量的“区域”(盲肠,20个;横结肠,20个;降结肠,20个;乙状结肠,15个;直肠,10个)。该评分系统自动对每个视频中的85个区域的炎症严重程度进行评分,并生成全长肠道炎症活动的可视化结果。
与内镜医师评分相比,经过训练的卷积神经网络在梅奥评分任务中的准确率达到86.54%,而kappa系数为0.813(95%置信区间[CI],0.782 - 0.844)。溃疡性结肠炎内镜严重程度指数评分任务的指标令人鼓舞,血管形态、糜烂和溃疡以及出血的准确率分别为90.7%、84.6%和77.7%,kappa系数分别为0.822(95% CI,0.788 - 0.855)、0.784(95% CI,0.744 - 0.823)和0.702(95% CI,0.612 - 0.793)。AI评分系统预测每个肠段的评分,并使用二维彩色图像显示整个大肠内炎症活动的分布情况。
我们建立了一种基于深度学习的新型评分系统,用于评估UC患者的内镜图像,该系统还可通过全长肠道内镜视频准确描述炎症活动的严重程度和分布情况。