Liu Xinning, Li Fei, Xu Jie, Ma Jinting, Duan Xiaoyu, Mao Ren, Chen Minhu, Chen Zhihui, Huang Yan, Jiang Jingyi, Huang Bingsheng, Ye Ziyin
Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, No. 58 Zhongshan Road 2, Guangzhou, 510080, Guangdong, People's Republic of China.
Department of Pathology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, Guangdong, People's Republic of China.
Virchows Arch. 2024 Jun;484(6):965-976. doi: 10.1007/s00428-024-03740-9. Epub 2024 Feb 8.
Crohn's disease (CD) and intestinal tuberculosis (ITB) share similar histopathological characteristics, and differential diagnosis can be a dilemma for pathologists. This study aimed to apply deep learning (DL) to analyze whole slide images (WSI) of surgical resection specimens to distinguish CD from ITB. Overall, 1973 WSI from 85 cases from 3 centers were obtained. The DL model was established in internal training and validated in external test cohort, evaluated by area under receiver operator characteristic curve (AUC). Diagnostic results of pathologists were compared with those of the DL model using DeLong's test. DL model had case level AUC of 0.886, 0.893 and slide level AUC of 0.954, 0.827 in training and test cohorts. Attention maps highlighted discriminative areas and top 10 features were extracted from CD and ITB. DL model's diagnostic efficiency (AUC = 0.886) was better than junior pathologists (*1 AUC = 0.701, P = 0.088; *2 AUC = 0.861, P = 0.788) and inferior to senior GI pathologists (*3 AUC = 0.910, P = 0.800; *4 AUC = 0.946, P = 0.507) in training cohort. In the test cohort, model (AUC = 0.893) outperformed senior non-GI pathologists (*5 AUC = 0.782, P = 0.327; *6 AUC = 0.821, P = 0.516). We developed a DL model for the classification of CD and ITB, improving pathological diagnosis accuracy effectively.
克罗恩病(CD)和肠结核(ITB)具有相似的组织病理学特征,鉴别诊断对病理学家来说可能是个难题。本研究旨在应用深度学习(DL)分析手术切除标本的全切片图像(WSI),以区分CD和ITB。总体而言,从3个中心的85例病例中获取了1973张WSI。DL模型在内部训练中建立,并在外部测试队列中进行验证,通过受试者操作特征曲线下面积(AUC)进行评估。使用德龙检验将病理学家的诊断结果与DL模型的结果进行比较。DL模型在训练和测试队列中的病例水平AUC分别为0.886、0.893,玻片水平AUC分别为0.954、0.827。注意力图突出了鉴别区域,并从CD和ITB中提取了前10个特征。在训练队列中,DL模型的诊断效率(AUC = 0.886)优于初级病理学家(*1 AUC = 0.701,P = 0.088;*2 AUC = 0.861,P = 0.788),但低于高级胃肠病理学家(*3 AUC = 0.910,P = 0.800;*4 AUC = 0.946,P = 0.507)。在测试队列中,模型(AUC = 0.893)的表现优于高级非胃肠病理学家(*5 AUC = 0.782,P = 0.327;*6 AUC = 0.821,P = 0.516)。我们开发了一种用于CD和ITB分类的DL模型,有效提高了病理诊断的准确性。