Department of Radiology, Clinical Medical College of Shanghai Tenth People's Hospital of Nanjing Medical University, 301 Middle Yanchang Road, Shanghai, 200072, P.R. China.
Department of Radiology, Sir RunRun Hospital affiliated to Nanjing Medical University, 109 Longmian Road, Nanjing, Jiangsu, 211002, P.R. China.
BMC Musculoskelet Disord. 2023 Mar 6;24(1):165. doi: 10.1186/s12891-023-06281-5.
We evaluated the diagnostic efficacy of deep learning radiomics (DLR) and hand-crafted radiomics (HCR) features in differentiating acute and chronic vertebral compression fractures (VCFs).
A total of 365 patients with VCFs were retrospectively analysed based on their computed tomography (CT) scan data. All patients completed MRI examination within 2 weeks. There were 315 acute VCFs and 205 chronic VCFs. Deep transfer learning (DTL) features and HCR features were extracted from CT images of patients with VCFs using DLR and traditional radiomics, respectively, and feature fusion was performed to establish the least absolute shrinkage and selection operator. The MRI display of vertebral bone marrow oedema was used as the gold standard for acute VCF, and the model performance was evaluated using the receiver operating characteristic (ROC).To separately evaluate the effectiveness of DLR, traditional radiomics and feature fusion in the differential diagnosis of acute and chronic VCFs, we constructed a nomogram based on the clinical baseline data to visualize the classification evaluation. The predictive power of each model was compared using the Delong test, and the clinical value of the nomogram was evaluated using decision curve analysis (DCA).
Fifty DTL features were obtained from DLR, 41 HCR features were obtained from traditional radiomics, and 77 features fusion were obtained after feature screening and fusion of the two. The area under the curve (AUC) of the DLR model in the training cohort and test cohort were 0.992 (95% confidence interval (CI), 0.983-0.999) and 0.871 (95% CI, 0.805-0.938), respectively. While the AUCs of the conventional radiomics model in the training cohort and test cohort were 0.973 (95% CI, 0.955-0.990) and 0.854 (95% CI, 0.773-0.934), respectively. The AUCs of the features fusion model in the training cohort and test cohort were 0.997 (95% CI, 0.994-0.999) and 0.915 (95% CI, 0.855-0.974), respectively. The AUCs of nomogram constructed by the features fusion in combination with clinical baseline data were 0.998 (95% CI, 0.996-0.999) and 0.946 (95% CI, 0.906-0.987) in the training cohort and test cohort, respectively. The Delong test showed that the differences between the features fusion model and the nomogram in the training cohort and the test cohort were not statistically significant (P values were 0.794 and 0.668, respectively), and the differences in the other prediction models in the training cohort and the test cohort were statistically significant (P < 0.05). DCA showed that the nomogram had high clinical value.
The features fusion model can be used for the differential diagnosis of acute and chronic VCFs, and its differential diagnosis ability is improved when compared with that when either radiomics is used alone. At the same time, the nomogram has a high predictive value for acute and chronic VCFs and can be a potential decision-making tool to assist clinicians, especially when a patient is unable to undergo spinal MRI examination.
我们评估了深度学习放射组学(DLR)和手工放射组学(HCR)特征在区分急性和慢性椎体压缩性骨折(VCF)中的诊断效能。
回顾性分析了 365 例 VCF 患者的 CT 扫描数据。所有患者在 2 周内完成 MRI 检查。其中急性 VCF 315 例,慢性 VCF 205 例。使用 DLR 和传统放射组学从 VCF 患者的 CT 图像中提取深度转移学习(DTL)特征和 HCR 特征,并进行特征融合以建立最小绝对收缩和选择算子。将椎体骨髓水肿的 MRI 显示作为急性 VCF 的金标准,使用受试者工作特征(ROC)曲线评价模型性能。为了分别评估 DLR、传统放射组学和特征融合在急性和慢性 VCF 鉴别诊断中的有效性,我们根据临床基线数据构建了一个列线图,以可视化分类评估。使用 Delong 检验比较各模型的预测能力,使用决策曲线分析(DCA)评估列线图的临床价值。
从 DLR 中获得 50 个 DTL 特征,从传统放射组学中获得 41 个 HCR 特征,经过特征筛选和融合后获得 77 个特征融合。在训练队列和测试队列中,DLR 模型的曲线下面积(AUC)分别为 0.992(95%置信区间(CI),0.983-0.999)和 0.871(95% CI,0.805-0.938)。而常规放射组学模型在训练队列和测试队列中的 AUC 分别为 0.973(95% CI,0.955-0.990)和 0.854(95% CI,0.773-0.934)。特征融合模型在训练队列和测试队列中的 AUC 分别为 0.997(95% CI,0.994-0.999)和 0.915(95% CI,0.855-0.974)。融合特征与临床基线数据构建的列线图在训练队列和测试队列中的 AUC 分别为 0.998(95% CI,0.996-0.999)和 0.946(95% CI,0.906-0.987)。Delong 检验显示,在训练队列和测试队列中,特征融合模型与列线图的差异无统计学意义(P 值分别为 0.794 和 0.668),而其他预测模型在训练队列和测试队列中的差异有统计学意义(P<0.05)。DCA 显示列线图具有较高的临床价值。
特征融合模型可用于急性和慢性 VCF 的鉴别诊断,与单独使用放射组学相比,其鉴别诊断能力有所提高。同时,列线图对急性和慢性 VCF 具有较高的预测价值,可作为辅助临床医生决策的潜在工具,特别是当患者无法进行脊柱 MRI 检查时。