Zhang Hao, Xu Ruixiang, Guo Xiang, Zhou Dan, Xu Tongshuai, Zhong Xin, Kong Meng, Zhang Zhimin, Wang Yan, Ma Xuexiao
Department of Spinal Surgery, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China.
Department of Pain, YanTai YuHuangDing Hospital, Yantai, Shandong, China.
Front Bioeng Biotechnol. 2024 May 14;12:1397003. doi: 10.3389/fbioe.2024.1397003. eCollection 2024.
Digital radiography (DR) is a common and widely available examination. However, spinal DR cannot detect bone marrow edema, therefore, determining vertebral compression fractures (VCFs), especially fresh VCFs, remains challenging for clinicians.
We trained, validated, and externally tested the deep residual network (DRN) model that automated the detection and identification of fresh VCFs from spinal DR images. A total of 1,747 participants from five institutions were enrolled in this study and divided into the training cohort, validation cohort and external test cohorts (YHDH and BMUH cohorts). We evaluated the performance of DRN model based on the area under the receiver operating characteristic curve (AUC), feature attention maps, sensitivity, specificity, and accuracy. We compared it with five other deep learning models and validated and tested the model internally and externally and explored whether it remains highly accurate for an external test cohort. In addition, the influence of old VCFs on the performance of the DRN model was assessed.
The AUC was 0.99, 0.89, and 0.88 in the validation, YHDH, and BMUH cohorts, respectively, for the DRN model for detecting and discriminating fresh VCFs. The accuracies were 81.45% and 72.90%, sensitivities were 84.75% and 91.43%, and specificities were 80.25% and 63.89% in the YHDH and BMUH cohorts, respectively. The DRN model generated correct activation on the fresh VCFs and accurate peak responses on the area of the target vertebral body parts and demonstrated better feature representation learning and classification performance. The AUC was 0.90 (95% confidence interval [CI] 0.84-0.95) and 0.84 (95% CI 0.72-0.93) in the non-old VCFs and old VCFs groups, respectively, in the YHDH cohort ( = 0.067). The AUC was 0.89 (95% CI 0.84-0.94) and 0.85 (95% CI 0.72-0.95) in the non-old VCFs and old VCFs groups, respectively, in the BMUH cohort ( = 0.051).
In present study, we developed the DRN model for automated diagnosis and identification of fresh VCFs from spinal DR images. The DRN model can provide interpretable attention maps to support the excellent prediction results, which is the key that most clinicians care about when using the model to assist decision-making.
数字X线摄影(DR)是一种常见且广泛应用的检查方法。然而,脊柱DR无法检测骨髓水肿,因此,对于临床医生而言,确定椎体压缩骨折(VCF),尤其是新鲜VCF,仍然具有挑战性。
我们训练、验证并对外测试了深度残差网络(DRN)模型,该模型可自动从脊柱DR图像中检测和识别新鲜VCF。本研究共纳入了来自五个机构的1747名参与者,并将其分为训练队列、验证队列和外部测试队列(YHDH队列和BMUH队列)。我们基于受试者操作特征曲线(AUC)下面积、特征注意力图、敏感性、特异性和准确性评估了DRN模型的性能。我们将其与其他五个深度学习模型进行了比较,并在内部和外部对该模型进行了验证和测试,探讨其在外部测试队列中是否仍具有高度准确性。此外,评估了陈旧性VCF对DRN模型性能的影响。
对于检测和区分新鲜VCF的DRN模型,在验证队列、YHDH队列和BMUH队列中的AUC分别为0.99、0.89和0.88。YHDH队列和BMUH队列中的准确率分别为81.45%和72.90%,敏感性分别为84.75%和91.43%,特异性分别为80.25%和63.89%。DRN模型在新鲜VCF上产生了正确的激活,并在目标椎体部位区域产生了准确的峰值响应,表现出更好的特征表示学习和分类性能。在YHDH队列中,非陈旧性VCF组和陈旧性VCF组的AUC分别为0.90(95%置信区间[CI]0.84 - 0.95)和0.84(95%CI 0.72 - 0.93)(P = 0.067)。在BMUH队列中,非陈旧性VCF组和陈旧性VCF组的AUC分别为0.89(95%CI 0.84 - 0.94)和0.85(95%CI 0.72 - 0.95)(P = 0.051)。
在本研究中,我们开发了用于从脊柱DR图像中自动诊断和识别新鲜VCF的DRN模型。DRN模型可以提供可解释的注意力图以支持出色预测结果,这是大多数临床医生在使用该模型辅助决策时所关心的关键。