Ikenoyama Yohei, Hirasawa Toshiaki, Ishioka Mitsuaki, Namikawa Ken, Yoshimizu Shoichi, Horiuchi Yusuke, Ishiyama Akiyoshi, Yoshio Toshiyuki, Tsuchida Tomohiro, Takeuchi Yoshinori, Shichijo Satoki, Katayama Naoyuki, Fujisaki Junko, Tada Tomohiro
Department of Gastroenterology, Cancer Institute Hospital, Japanese Foundation for Cancer Research, Tokyo, Japan.
Department of Hematology and Oncology, Mie University Graduate School of Medicine, Mie, Japan.
Dig Endosc. 2021 Jan;33(1):141-150. doi: 10.1111/den.13688. Epub 2020 Jun 2.
Detecting early gastric cancer is difficult, and it may even be overlooked by experienced endoscopists. Recently, artificial intelligence based on deep learning through convolutional neural networks (CNNs) has enabled significant advancements in the field of gastroenterology. However, it remains unclear whether a CNN can outperform endoscopists. In this study, we evaluated whether the performance of a CNN in detecting early gastric cancer is better than that of endoscopists.
The CNN was constructed using 13,584 endoscopic images from 2639 lesions of gastric cancer. Subsequently, its diagnostic ability was compared to that of 67 endoscopists using an independent test dataset (2940 images from 140 cases).
The average diagnostic time for analyzing 2940 test endoscopic images by the CNN and endoscopists were 45.5 ± 1.8 s and 173.0 ± 66.0 min, respectively. The sensitivity, specificity, and positive and negative predictive values for the CNN were 58.4%, 87.3%, 26.0%, and 96.5%, respectively. These values for the 67 endoscopists were 31.9%, 97.2%, 46.2%, and 94.9%, respectively. The CNN had a significantly higher sensitivity than the endoscopists (by 26.5%; 95% confidence interval, 14.9-32.5%).
The CNN detected more early gastric cancer cases in a shorter time than the endoscopists. The CNN needs further training to achieve higher diagnostic accuracy. However, a diagnostic support tool for gastric cancer using a CNN will be realized in the near future.
早期胃癌的检测存在困难,甚至可能被经验丰富的内镜医师忽视。近年来,基于卷积神经网络(CNN)的深度学习人工智能在胃肠病学领域取得了显著进展。然而,CNN是否能超越内镜医师尚不清楚。在本研究中,我们评估了CNN在检测早期胃癌方面的表现是否优于内镜医师。
使用来自2639例胃癌病变的13584张内镜图像构建CNN。随后,使用独立测试数据集(来自140例患者的2940张图像)将其诊断能力与67位内镜医师的诊断能力进行比较。
CNN和内镜医师分析2940张测试内镜图像的平均诊断时间分别为45.5±1.8秒和173.0±66.0分钟。CNN的灵敏度、特异度、阳性预测值和阴性预测值分别为58.4%、87.3%、26.0%和96.5%。67位内镜医师的这些值分别为31.9%、97.2%、46.2%和94.9%。CNN的灵敏度显著高于内镜医师(高26.5%;95%置信区间,14.9 - 32.5%)。
与内镜医师相比,CNN在更短的时间内检测到更多早期胃癌病例。CNN需要进一步训练以实现更高的诊断准确性。然而,在不久的将来,使用CNN的胃癌诊断支持工具将会实现。