Suppr超能文献

用于磁共振成像上腰椎间盘突出症分级和定位的深度学习模型

Deep Learning Model for Grading and Localization of Lumbar Disc Herniation on Magnetic Resonance Imaging.

作者信息

Xu Yefu, Zheng Shijie, Tian Qingyi, Kou Zhuoyan, Li Wenqing, Xie Xinhui, Wu Xiaotao

机构信息

Department of Spine Surgery, ZhongDa Hospital, School of Medicine, Southeast University, Nanjing, China.

出版信息

J Magn Reson Imaging. 2025 Jan;61(1):364-375. doi: 10.1002/jmri.29403. Epub 2024 Apr 27.

Abstract

BACKGROUND

Methods for grading and localization of lumbar disc herniation (LDH) on MRI are complex, time-consuming, and subjective. Utilizing deep learning (DL) models as assistance would mitigate such complexities.

PURPOSE

To develop an interpretable DL model capable of grading and localizing LDH.

STUDY TYPE

Retrospective.

SUBJECTS

1496 patients (M/F: 783/713) were evaluated, and randomly divided into training (70%), validation (10%), and test (20%) sets.

FIELD STRENGTH/SEQUENCE: 1.5T MRI for axial T2-weighted sequences (spin echo).

ASSESSMENT

The training set was annotated by three spinal surgeons using the Michigan State University classification to train the DL model. The test set was annotated by a spinal surgery expert (as ground truth labels), and two spinal surgeons (comparison with the trained model). An external test set was employed to evaluate the generalizability of the DL model.

STATISTICAL TESTS

Calculated intersection over union (IoU) for detection consistency, utilized Gwet's AC1 to assess interobserver agreement, and evaluated model performance based on sensitivity and specificity, with statistical significance set at P < 0.05.

RESULTS

The DL model achieved high detection consistency in both the internal test dataset (grading: mean IoU 0.84, recall 99.6%; localization: IoU 0.82, recall 99.5%) and external test dataset (grading: 0.72, 98.0%; localization: 0.71, 97.6%). For internal testing, the DL model (grading: 0.81; localization: 0.76), Rater 1 (0.88; 0.82), and Rater 2 (0.86; 0.83) demonstrated results highly consistent with the ground truth labels. The overall sensitivity of the DL model was 87.0% for grading and 84.0% for localization, while the specificity was 95.5% and 94.4%. For external testing, the DL model showed an appreciable decrease in consistency (grading: 0.69; localization: 0.66), sensitivity (77.2%; 76.7%), and specificity (92.3%; 91.8%).

DATA CONCLUSION

The classification capabilities of the DL model closely resemble those of spinal surgeons. For future improvement, enriching the diversity of cases could enhance the model's generalization.

TECHNICAL EFFICACY

Stage 2.

摘要

背景

腰椎间盘突出症(LDH)在MRI上的分级和定位方法复杂、耗时且主观。利用深度学习(DL)模型作为辅助可以减轻此类复杂性。

目的

开发一种能够对LDH进行分级和定位的可解释DL模型。

研究类型

回顾性研究。

研究对象

对1496例患者(男/女:783/713)进行评估,并随机分为训练集(70%)、验证集(10%)和测试集(20%)。

场强/序列:1.5T MRI用于轴向T2加权序列(自旋回波)。

评估

训练集由三位脊柱外科医生使用密歇根州立大学分类法进行标注,以训练DL模型。测试集由一位脊柱外科专家进行标注(作为真实标签),另外两位脊柱外科医生进行标注(与训练好的模型进行比较)。使用外部测试集评估DL模型的泛化能力。

统计检验

计算检测一致性的交并比(IoU),利用Gwet's AC1评估观察者间的一致性,并基于敏感性和特异性评估模型性能,设定统计学显著性为P < 0.05。

结果

DL模型在内部测试数据集(分级:平均IoU 0.84,召回率99.6%;定位:IoU 0.82,召回率99.5%)和外部测试数据集(分级:0.72,98.0%;定位:0.71,97.6%)中均实现了较高的检测一致性。对于内部测试,DL模型(分级:0.81;定位:0.76)、评估者第1位(0.88;0.82)和评估者第2位(0.86;0.83)的结果与真实标签高度一致。DL模型分级的总体敏感性为87.0%,定位的总体敏感性为84.0%,而特异性分别为95.5%和94.4%。对于外部测试,DL模型在一致性(分级:0.69;定位:0.66)、敏感性(77.2%;76.7%)和特异性(92.3%;91.8%)方面均有明显下降。

数据结论

DL模型的分类能力与脊柱外科医生的能力非常相似。为了未来的改进,丰富病例的多样性可以提高模型的泛化能力。

技术效能

2级。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验