Ghobrial George, Roth Christian
Clinic for Diagnostic and Interventional Radiology and Neuroradiology, Klinikum Bremerhaven Reinkenheide, Bremerhaven, Germany.
Clinic for Diagnostic and Interventional Neuroradiology, Klinikum Bremen-Mitte/Bremen-Ost, Bremen, Germany.
Front Radiol. 2025 Mar 25;5:1503625. doi: 10.3389/fradi.2025.1503625. eCollection 2025.
Lumbar spine magnetic resonance imaging (MRI) plays a critical role in diagnosing and planning treatment for spinal conditions such as degenerative disc disease, spinal canal stenosis, and disc herniation. Measuring the cross-sectional area of the dural sac (DSCA) is a key factor in evaluating the severity of spinal canal narrowing. Traditionally, radiologists perform this measurement manually, which is both time-consuming and susceptible to errors. Advances in deep learning, particularly convolutional neural networks (CNNs) like the U-Net architecture, have demonstrated significant potential in the analysis of medical images. This study evaluates the efficacy of deep learning models for automating DSCA measurements in lumbar spine MRIs to enhance diagnostic precision and alleviate the workload of radiologists.
For algorithm development and assessment, we utilized two extensive, anonymized online datasets: the "Lumbar Spine MRI Dataset" and the SPIDER-MRI dataset. The combined dataset comprised 683 lumbar spine MRI scans for training and testing, with an additional 50 scans reserved for external validation. We implemented and assessed three deep learning models-U-Net, Attention U-Net, and MultiResUNet-using 5-fold cross-validation. The models were trained on T1-weighted axial MRI images and evaluated on metrics such as accuracy, precision, recall, F1-score, and mean absolute error (MAE).
All models exhibited a high correlation between predicted and actual DSCA values. The MultiResUNet model achieved superior results, with a Pearson correlation coefficient of 0.9917 and an MAE of 23.7032 mm on the primary dataset. This high precision and reliability were consistent in external validation, where the MultiResUNet model attained an accuracy of 99.95%, a recall of 0.9989, and an F1-score of 0.9393. Bland-Altman analysis revealed that most discrepancies between predicted and actual DSCA values fell within the limits of agreement, further affirming the robustness of these models.
This study demonstrates that deep learning models, particularly MultiResUNet, offer high accuracy and reliability in the automated segmentation and calculation of DSCA in lumbar spine MRIs. These models hold significant potential for improving diagnostic accuracy and reducing the workload of radiologists. Despite some limitations, such as the restricted dataset size and reliance on T1-weighted images, this study provides valuable insights into the application of deep learning in medical imaging. Future research should include larger, more diverse datasets and additional image weightings to further validate and enhance the generalizability and clinical utility of these models.
腰椎磁共振成像(MRI)在诊断和规划治疗诸如椎间盘退变、椎管狭窄和椎间盘突出等脊柱疾病方面发挥着关键作用。测量硬脊膜囊横截面积(DSCA)是评估椎管狭窄严重程度的一个关键因素。传统上,放射科医生手动进行此项测量,这既耗时又容易出错。深度学习的进展,特别是像U-Net架构这样的卷积神经网络(CNN),在医学图像分析中已显示出巨大潜力。本研究评估深度学习模型在自动测量腰椎MRI的DSCA以提高诊断精度和减轻放射科医生工作量方面的功效。
为了算法开发和评估,我们使用了两个广泛的、匿名的在线数据集:“腰椎MRI数据集”和SPIDER-MRI数据集。合并后的数据集包括683例用于训练和测试的腰椎MRI扫描,另有50例扫描留作外部验证。我们使用5折交叉验证实现并评估了三种深度学习模型——U-Net、注意力U-Net和多分辨率U-Net。这些模型在T1加权轴向MRI图像上进行训练,并根据准确性、精确性、召回率、F1分数和平均绝对误差(MAE)等指标进行评估。
所有模型在预测的和实际的DSCA值之间都表现出高度相关性。多分辨率U-Net模型取得了更好的结果,在主要数据集上的皮尔逊相关系数为0.9917,MAE为23.7032 mm。这种高精度和可靠性在外部验证中也一致,多分辨率U-Net模型在外部验证中达到了99.95%的准确率、0.9989的召回率和0.9393的F1分数。Bland-Altman分析表明,预测的和实际的DSCA值之间的大多数差异都在一致性界限内,进一步证实了这些模型的稳健性。
本研究表明,深度学习模型,特别是多分辨率U-Net,在腰椎MRI的DSCA自动分割和计算中具有很高的准确性和可靠性。这些模型在提高诊断准确性和减少放射科医生工作量方面具有巨大潜力。尽管存在一些局限性,如数据集规模有限和依赖T1加权图像,但本研究为深度学习在医学成像中的应用提供了有价值的见解。未来的研究应包括更大、更多样化的数据集以及额外的图像加权,以进一步验证和提高这些模型的通用性和临床实用性。