Department of Medicine III, University Hospital RWTH Aachen, Pauwelsstrasse 30, 52074, Aachen, Germany.
Department of Medicine, Rhein-Maas-Klinikum, Würselen, Germany.
Sci Rep. 2022 Mar 22;12(1):4829. doi: 10.1038/s41598-022-08773-1.
Artificial intelligence (AI) is widely used to analyze gastrointestinal (GI) endoscopy image data. AI has led to several clinically approved algorithms for polyp detection, but application of AI beyond this specific task is limited by the high cost of manual annotations. Here, we show that a weakly supervised AI can be trained on data from a clinical routine database to learn visual patterns of GI diseases without any manual labeling or annotation. We trained a deep neural network on a dataset of N = 29,506 gastroscopy and N = 18,942 colonoscopy examinations from a large endoscopy unit serving patients in Germany, the Netherlands and Belgium, using only routine diagnosis data for the 42 most common diseases. Despite a high data heterogeneity, the AI system reached a high performance for diagnosis of multiple diseases, including inflammatory, degenerative, infectious and neoplastic diseases. Specifically, a cross-validated area under the receiver operating curve (AUROC) of above 0.70 was reached for 13 diseases, and an AUROC of above 0.80 was reached for two diseases in the primary data set. In an external validation set including six disease categories, the AI system was able to significantly predict the presence of diverticulosis, candidiasis, colon and rectal cancer with AUROCs above 0.76. Reverse engineering the predictions demonstrated that plausible patterns were learned on the level of images and within images and potential confounders were identified. In summary, our study demonstrates the potential of weakly supervised AI to generate high-performing classifiers and identify clinically relevant visual patterns based on non-annotated routine image data in GI endoscopy and potentially other clinical imaging modalities.
人工智能(AI)广泛用于分析胃肠道(GI)内窥镜图像数据。AI 已经导致了几种临床批准的息肉检测算法,但由于手动注释成本高,AI 在这一特定任务之外的应用受到限制。在这里,我们展示了一种弱监督 AI 可以在来自临床常规数据库的数据上进行训练,以学习 GI 疾病的视觉模式,而无需任何手动标记或注释。我们在一个大型内窥镜单位的数据集上训练了一个深度神经网络,该数据集包括来自德国、荷兰和比利时的患者的 N = 29506 次胃镜检查和 N = 18942 次结肠镜检查,仅使用常规诊断数据对 42 种最常见的疾病进行了诊断。尽管数据异质性很高,但 AI 系统在多种疾病的诊断中达到了很高的性能,包括炎症、退行性、感染和肿瘤性疾病。具体来说,对于 13 种疾病,交叉验证后的接收者操作特征曲线下面积(AUROC)超过 0.70,对于原发性数据集的两种疾病,AUROC 超过 0.80。在包括六个疾病类别的外部验证集中,AI 系统能够显著预测憩室病、假丝酵母菌病、结肠癌和直肠癌的存在,AUROC 超过 0.76。反向工程预测结果表明,在图像和图像内层面上学习了合理的模式,并确定了潜在的混杂因素。总之,我们的研究表明,弱监督 AI 具有生成高性能分类器的潜力,并能够根据 GI 内窥镜和潜在的其他临床成像方式的非注释常规图像数据识别临床相关的视觉模式。