Verheijen E J A, Kapogiannis T, Munteh D, Chabros J, Staring M, Smith T R, Vleggeert-Lankamp C L A
Computational Neuroscience Outcomes Center, Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, Boston, USA.
Department of Neurosurgery, Leiden University Medical Center, Albinusdreef 2, 2333ZA, Leiden, The Netherlands.
Eur Spine J. 2025 Mar;34(3):1146-1155. doi: 10.1007/s00586-025-08672-9. Epub 2025 Jan 30.
Lumbar spinal stenosis (LSS) is a frequently occurring condition defined by narrowing of the spinal or nerve root canal due to degenerative changes. Physicians use MRI scans to determine the severity of stenosis, occasionally complementing it with X-ray or CT scans during the diagnostic work-up. However, manual grading of stenosis is time-consuming and induces inter-reader variability as a standardized grading system is lacking. Machine Learning (ML) has the potential to aid physicians in this process by automating segmentation and classification of LSS. However, it is unclear what models currently exist to perform these tasks.
A systematic review of literature was performed by searching the Cochrane Library, Embase, Emcare, PubMed, and Web of Science databases for studies describing an ML-based algorithm to perform segmentation or classification of the lumbar spine for LSS. Risk of bias was assessed through an adjusted version of the Newcastle-Ottawa Quality Assessment Scale that was more applicable to ML studies. Qualitative analyses were performed based on type of algorithm (conventional ML or Deep Learning (DL)) and task (segmentation or classification).
A total of 27 articles were included of which nine on segmentation, 16 on classification and 2 on both tasks. The majority of studies focused on algorithms for MRI analysis. There was wide variety among the outcome measures used to express model performance. Overall, ML algorithms are able to perform segmentation and classification tasks excellently. DL methods tend to demonstrate better performance than conventional ML models. For segmentation the best performing DL models were U-Net based. For classification U-Net and unspecified CNNs powered the models that performed the best for the majority of outcome metrics. The number of models with external validation was limited.
DL models achieve excellent performance for segmentation and classification tasks for LSS, outperforming conventional ML algorithms. However, comparisons between studies are challenging due to the variety in outcome measures and test datasets. Future studies should focus on the segmentation task using DL models and utilize a standardized set of outcome measures and publicly available test dataset to express model performance. In addition, these models need to be externally validated to assess generalizability.
腰椎管狭窄症(LSS)是一种常见疾病,由退变导致的脊髓或神经根通道狭窄所定义。医生使用磁共振成像(MRI)扫描来确定狭窄的严重程度,在诊断过程中偶尔会辅以X线或计算机断层扫描(CT)扫描。然而,由于缺乏标准化的分级系统,狭窄的手动分级耗时且会导致不同阅片者之间的差异。机器学习(ML)有潜力通过自动分割和分类LSS来辅助医生完成这一过程。然而,目前尚不清楚有哪些模型可用于执行这些任务。
通过检索考克兰图书馆、Embase、Emcare、PubMed和科学网数据库,对文献进行系统综述,以查找描述基于ML的算法用于LSS腰椎分割或分类的研究。通过纽卡斯尔-渥太华质量评估量表的调整版本评估偏倚风险,该版本更适用于ML研究。基于算法类型(传统ML或深度学习(DL))和任务(分割或分类)进行定性分析。
共纳入27篇文章,其中9篇关于分割,16篇关于分类,2篇涉及两项任务。大多数研究集中于MRI分析算法。用于表达模型性能的结果指标多种多样。总体而言,ML算法能够出色地执行分割和分类任务。DL方法往往比传统ML模型表现更好。对于分割,表现最佳的DL模型基于U-Net。对于分类,U-Net和未指定的卷积神经网络(CNN)驱动的模型在大多数结果指标上表现最佳。进行外部验证的模型数量有限。
DL模型在LSS的分割和分类任务中表现出色,优于传统ML算法。然而,由于结果指标和测试数据集的多样性,研究之间的比较具有挑战性。未来的研究应聚焦于使用DL模型的分割任务,并利用一套标准化结果指标和公开可用的测试数据集来表达模型性能。此外,这些模型需要进行外部验证以评估其通用性。