Gupta Pankaj, Siddiqui Ruby, Singh Shravya, Pradhan Nikita, Shah Jimil, Samanta Jayanta, Jearth Vaneet, Singh Anupam, Mandavdhare Harshal, Sharma Vishal, Mukund Amar, Birda Chhagan Lal, Kumar Ishan, Kumar Niraj, Patidar Yashwant, Agarwal Ashish, Yadav Taruna, Sureka Binit, Tiwari Anurag, Verma Ashish, Kumar Ashish, Sinha Saroj K, Dutta Usha
Department of Radiodiagnosis and Imaging, Postgraduate Institute of Medical Education and Research, Chandigarh, India.
Department of Gastroenterology, Postgraduate Institute of Medical Education and Research, Chandigarh, India.
Abdom Radiol (NY). 2025 May;50(5):2258-2267. doi: 10.1007/s00261-024-04607-y. Epub 2024 Sep 30.
To apply CT-based deep learning (DL) models for accurate solid debris-based classification of pancreatic fluid collections (PFC) in acute pancreatitis (AP).
This retrospective study comprised four tertiary care hospitals. Consecutive patients with AP and PFCs who had computed tomography (CT) prior to drainage were screened. Those who had magnetic resonance imaging (MRI) or endoscopic ultrasound (EUS) within 20 days of CT were considered for inclusion. Axial CT images were utilized for model training. Images were labelled as those with≤30% solid debris and >30% solid debris based on MRI or EUS. Single center data was used for model training and validation. Data from other three centers comprised the held out external test cohort. We experimented with ResNet 50, Vision transformer (ViT), and MedViT architectures.
Overall, we recruited 152 patients (129 training/validation and 23 testing). There were 1334, 334 and 512 images in the training, validation, and test cohorts, respectively. In the overall training and validation cohorts, ViT and MedVit models had high diagnostic performance (sensitivity 92.4-98.7%, specificity 89.7-98.4%, and AUC 0.908-0.980). The sensitivity (85.3-98.6%), specificity (69.4-99.4%), and AUC (0.779-0.984) of all the models was high in all the subgroups in the training and validation cohorts. In the overall external test cohort, MedViT had the best diagnostic performance (sensitivity 75.2%, specificity 75.3%, and AUC 0.753). MedVit had sensitivity, specificity, and AUC of 75.2%, 74.3%, and 0.748, in walled off necrosis and 79%, 74.2%, 75.3%, and 0.767 for collections >5 cm.
DL-models have moderate diagnostic performance for solid-debris based classification of WON and collections greater than 5 cm on CT.
应用基于CT的深度学习(DL)模型对急性胰腺炎(AP)患者的胰腺液体积聚(PFC)进行基于实性成分的准确分类。
这项回顾性研究纳入了四家三级医疗机构。筛选出在引流前进行过计算机断层扫描(CT)的连续AP和PFC患者。在CT检查后20天内进行过磁共振成像(MRI)或内镜超声(EUS)检查的患者被纳入研究。轴向CT图像用于模型训练。根据MRI或EUS结果,将图像标记为实性成分≤30%和>30%的图像。单中心数据用于模型训练和验证。来自其他三个中心的数据组成外部测试队列。我们对ResNet 50、视觉Transformer(ViT)和医学ViT架构进行了实验。
总体而言,我们招募了152例患者(129例用于训练/验证,23例用于测试)。训练、验证和测试队列中的图像分别有1334、334和512张。在总体训练和验证队列中,ViT和MedVit模型具有较高的诊断性能(敏感性92.4 - 98.7%,特异性89.7 - 98.4%,AUC 0.908 - 0.980)。在训练和验证队列的所有亚组中,所有模型的敏感性(85.3 - 98.6%)、特异性(69.4 - 99.4%)和AUC(0.779 - 0.984)都很高。在总体外部测试队列中,MedViT具有最佳诊断性能(敏感性75.2%,特异性75.3%,AUC 0.753)。在包裹性坏死中,MedVit的敏感性、特异性和AUC分别为75.2%、74.3%和0.748,对于直径>5 cm的液体积聚,其敏感性、特异性、AUC分别为79%、74.2%、75.3%和0.767。
DL模型对基于实性成分的壁内坏死(WON)和CT上直径大于5 cm的液体积聚分类具有中等诊断性能。