基于卷积神经网络的腰椎椎管狭窄症深度学习检测。
Deep learning-based detection of lumbar spinal canal stenosis using convolutional neural networks.
机构信息
Department of Orthopaedic Surgery, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, N15W7, Sapporo, Hokkaido 060-8638, Japan; Department of Orthopaedic Surgery, Eniwa Hospital, 2-1-1 Kogane Chuo, Eniwa, Hokkaido 061-1449, Japan.
Department of Orthopaedic Surgery, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, N15W7, Sapporo, Hokkaido 060-8638, Japan.
出版信息
Spine J. 2024 Nov;24(11):2086-2101. doi: 10.1016/j.spinee.2024.06.009. Epub 2024 Jun 22.
BACKGROUND CONTEXT
Lumbar spinal canal stenosis (LSCS) is the most common spinal degenerative disorder in elderly people and usually first seen by primary care physicians or orthopedic surgeons who are not spine surgery specialists. Magnetic resonance imaging (MRI) is useful in the diagnosis of LSCS, but the equipment is often not available or difficult to read. LSCS patients with progressive neurologic deficits have difficulty with recovery if surgical treatment is delayed. So, early diagnosis and determination of appropriate surgical indications are crucial in the treatment of LSCS. Convolutional neural networks (CNNs), a type of deep learning, offers significant advantages for image recognition and classification, and work well with radiographs, which can be easily taken at any facility.
PURPOSE
Our purpose was to develop an algorithm to diagnose the presence or absence of LSCS requiring surgery from plain radiographs using CNNs.
STUDY DESIGN
Retrospective analysis of consecutive, nonrandomized series of patients at a single institution.
PATIENT SAMPLE
Data of 150 patients who underwent surgery for LSCS, including degenerative spondylolisthesis, at a single institution from January 2022 to August 2022, were collected. Additionally, 25 patients who underwent surgery at 2 other hospitals were included for extra external validation.
OUTCOME MEASURES
In annotation 1, the area under the curve (AUC) computed from the receiver operating characteristic (ROC) curve, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, positive likelihood ratio (PLR), and negative likelihood ratio (NLR) were calculated. In annotation 2, correlation coefficients were used.
METHODS
Four intervertebral levels from L1/2 to L4/5 were extracted as region of interest from lateral plain lumbar spine radiographs totaling 600 images were obtained. Based on the date of surgery, 500 images derived from the first 125 cases were used for internal validation, and 100 images from the subsequent 25 cases used for external validation. Additionally, 100 images from other hospitals were used for extra external validation. In annotation 1, binary classification of operative and nonoperative levels was used, and in annotation 2, the spinal canal area measured on axial MRI was labeled as the output layer. For internal validation, the 500 images were divided into each 5 dataset on per-patient basis and 5-fold cross-validation was performed. Five trained models were registered in the external validation prediction performance. Grad-CAM was used to visualize area with the high features extracted by CNNs.
RESULTS
In internal validation, the AUC and accuracy for annotation 1 ranged between 0.85-0.89 and 79-83%, respectively, and the correlation coefficients for annotation 2 ranged between 0.53 and 0.64 (all p<.01). In external validation, the AUC and accuracy for annotation 1 were 0.90 and 82%, respectively, and the correlation coefficient for annotation 2 was 0.69, using 5 trained CNN models. In the extra external validation, the AUC and accuracy for annotation 1 were 0.89 and 84%, respectively, and the correlation coefficient for annotation 2 was 0.56. Grad-CAM showed high feature density in the intervertebral joints and posterior intervertebral discs.
CONCLUSIONS
This technology automatically detects LSCS from plain lumbar spine radiographs, making it possible for medical facilities without MRI or nonspecialists to diagnose LSCS, suggesting the possibility of eliminating delays in the diagnosis and treatment of LSCS that require early treatment.
背景
腰椎管狭窄症(LSCS)是老年人最常见的脊柱退行性疾病,通常首先由初级保健医生或骨科医生就诊,他们不是脊柱外科专家。磁共振成像(MRI)有助于 LSCS 的诊断,但设备通常不可用或难以读取。如果手术治疗延迟,LSCS 伴有进行性神经功能缺损的患者难以恢复。因此,早期诊断和确定适当的手术适应证对于 LSCS 的治疗至关重要。卷积神经网络(CNN)是深度学习的一种,在图像识别和分类方面具有显著优势,并且与射线照片配合良好,射线照片可以在任何医疗机构轻松拍摄。
目的
我们的目的是开发一种算法,使用 CNN 从普通射线照片诊断需要手术的 LSCS 的存在或不存在。
研究设计
对单机构连续、非随机系列患者的回顾性分析。
患者样本
收集了一家机构在 2022 年 1 月至 2022 年 8 月期间因 LSCS(包括退行性脊椎滑脱)接受手术的 150 例患者的数据,另外还包括在其他 2 家医院接受手术的 25 例患者的数据,用于额外的外部验证。
结局测量
在注释 1 中,计算了受试者工作特征(ROC)曲线计算的曲线下面积(AUC)、敏感性、特异性、阳性预测值(PPV)、阴性预测值(NPV)、准确性、阳性似然比(PLR)和阴性似然比(NLR)。在注释 2 中,使用了相关系数。
方法
从 L1/2 到 L4/5 的 4 个椎间水平从侧位腰椎平片提取为感兴趣区,共获得 600 张图像。根据手术日期,从前 125 例中获得 500 张图像用于内部验证,随后 25 例中的 100 张图像用于外部验证。此外,还使用了其他医院的 100 张图像进行额外的外部验证。在注释 1 中,使用手术和非手术水平的二进制分类,在注释 2 中,将在轴向 MRI 上测量的椎管面积标记为输出层。对于内部验证,将 500 张图像按每位患者分为每个 5 个数据集,并进行 5 折交叉验证。在外部验证预测性能中注册了 5 个训练模型。使用 Grad-CAM 可视化 CNN 提取的高特征区域。
结果
在内部验证中,注释 1 的 AUC 和准确性分别在 0.85-0.89 和 79-83%之间,注释 2 的相关系数在 0.53 和 0.64 之间(均<0.01)。在外部验证中,注释 1 的 AUC 和准确性分别为 0.90 和 82%,使用 5 个训练的 CNN 模型,注释 2 的相关系数为 0.69。在额外的外部验证中,注释 1 的 AUC 和准确性分别为 0.89 和 84%,注释 2 的相关系数为 0.56。Grad-CAM 显示出椎间关节和椎间盘后部的高特征密度。
结论
该技术可自动从腰椎平片检测 LSCS,使没有 MRI 或非专家的医疗机构能够诊断 LSCS,这表明有可能消除 LSCS 诊断和治疗的延迟,这些延迟需要早期治疗。