Hallinan James Thomas Patrick Decourcy, Zhu Lei, Zhang Wenqiao, Lim Desmond Shi Wei, Baskar Sangeetha, Low Xi Zhen, Yeong Kuan Yuen, Teo Ee Chin, Kumarakulasinghe Nesaretnam Barr, Yap Qai Ven, Chan Yiong Huak, Lin Shuxun, Tan Jiong Hao, Kumar Naresh, Vellayappan Balamurugan A, Ooi Beng Chin, Quek Swee Tian, Makmur Andrew
Department of Diagnostic Imaging, National University Hospital, Singapore, Singapore.
Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
Front Oncol. 2022 May 4;12:849447. doi: 10.3389/fonc.2022.849447. eCollection 2022.
Metastatic epidural spinal cord compression (MESCC) is a devastating complication of advanced cancer. A deep learning (DL) model for automated MESCC classification on MRI could aid earlier diagnosis and referral.
To develop a DL model for automated classification of MESCC on MRI.
Patients with known MESCC diagnosed on MRI between September 2007 and September 2017 were eligible. MRI studies with instrumentation, suboptimal image quality, and non-thoracic regions were excluded. Axial T2-weighted images were utilized. The internal dataset split was 82% and 18% for training/validation and test sets, respectively. External testing was also performed. Internal training/validation data were labeled using the Bilsky MESCC classification by a musculoskeletal radiologist (10-year experience) and a neuroradiologist (5-year experience). These labels were used to train a DL model utilizing a prototypical convolutional neural network. Internal and external test sets were labeled by the musculoskeletal radiologist as the reference standard. For assessment of DL model performance and interobserver variability, test sets were labeled independently by the neuroradiologist (5-year experience), a spine surgeon (5-year experience), and a radiation oncologist (11-year experience). Inter-rater agreement (Gwet's kappa) and sensitivity/specificity were calculated.
Overall, 215 MRI spine studies were analyzed [164 patients, mean age = 62 ± 12(SD)] with 177 (82%) for training/validation and 38 (18%) for internal testing. For internal testing, the DL model and specialists all showed almost perfect agreement (kappas = 0.92-0.98, p < 0.001) for dichotomous Bilsky classification (low versus high grade) compared to the reference standard. Similar performance was seen for external testing on a set of 32 MRI spines with the DL model and specialists all showing almost perfect agreement (kappas = 0.94-0.95, p < 0.001) compared to the reference standard.
A DL model showed comparable agreement to a subspecialist radiologist and clinical specialists for the classification of malignant epidural spinal cord compression and could optimize earlier diagnosis and surgical referral.
转移性硬膜外脊髓压迫症(MESCC)是晚期癌症的一种毁灭性并发症。用于在MRI上自动进行MESCC分类的深度学习(DL)模型有助于早期诊断和转诊。
开发一种用于在MRI上自动对MESCC进行分类的DL模型。
纳入2007年9月至2017年9月期间经MRI诊断为已知MESCC的患者。排除有器械植入、图像质量欠佳及非胸部区域的MRI研究。使用轴位T2加权图像。内部数据集划分为训练/验证集和测试集,分别占82%和18%。还进行了外部测试。内部训练/验证数据由一位肌肉骨骼放射科医生(10年经验)和一位神经放射科医生(5年经验)使用Bilsky MESCC分类法进行标注。这些标签用于训练一个利用典型卷积神经网络的DL模型。内部和外部测试集由肌肉骨骼放射科医生作为参考标准进行标注。为评估DL模型性能和观察者间变异性,测试集由神经放射科医生(5年经验)、脊柱外科医生(5年经验)和放射肿瘤学家(11年经验)独立标注。计算评分者间一致性(Gwet卡帕)和敏感度/特异度。
总体而言,分析了215例MRI脊柱研究[164例患者,平均年龄=62±12(标准差)],其中177例(82%)用于训练/验证,38例(18%)用于内部测试。对于内部测试,与参考标准相比,DL模型和专家在二分法Bilsky分类(低级别与高级别)上均显示出几乎完美的一致性(卡帕值=0.92 - 0.98,p<0.001)。在一组32例MRI脊柱的外部测试中也观察到类似表现,与参考标准相比,DL模型和专家均显示出几乎完美的一致性(卡帕值=0.94 - 0.95,p<0.001)。
在恶性硬膜外脊髓压迫症的分类方面,DL模型与专科放射科医生和临床专家显示出相当的一致性,并且可以优化早期诊断和手术转诊。