Hu Houmin, Wang Xiandi, Yang Heng, Zhang Jin, Li Kang, Zeng Jiancheng
College of Electrical Engineering, Sichuan University, Chengdu Sichuan, 610041, P. R. China.
Orthopedic Research Institute, Department of Orthopedics, West China Hospital, Sichuan University, Chengdu Sichuan, 610041, P. R. China.
Zhongguo Xiu Fu Chong Jian Wai Ke Za Zhi. 2023 Jan 15;37(1):81-90. doi: 10.7507/1002-1892.202209058.
To develop an automatic diagnostic tool based on deep learning for lumbar spine stability and validate diagnostic accuracy.
Preoperative lumbar hyper-flexion and hyper-extension X-ray films were collected from 153 patients with lumbar disease. The following 5 key points were marked by 3 orthopedic surgeons: L posteroinferior, anterior inferior angles as well as L posterosuperior, anterior superior, and posterior inferior angles. The labeling results of each surgeon were preserved independently, and a total of three sets of labeling results were obtained. A total of 306 lumbar X-ray films were randomly divided into training (=156), validation (=50), and test (=100) sets in a ratio of 3∶1∶2. A new neural network architecture, Swin-PGNet was proposed, which was trained using annotated radiograph images to automatically locate the lumbar vertebral key points and calculate L intervertebral Cobb angle and L lumbar sliding distance through the predicted key points. The mean error and intra-class correlation coefficient () were used as an evaluation index, to compare the differences between surgeons' annotations and Swin-PGNet on the three tasks (key point positioning, Cobb angle measurement, and lumbar sliding distance measurement). Meanwhile, the change of Cobb angle more than 11° was taken as the criterion of lumbar instability, and the lumbar sliding distance more than 3 mm was taken as the criterion of lumbar spondylolisthesis. The accuracy of surgeon annotation and Swin-PGNet in judging lumbar instability was compared.
① Key point: The mean error of key point location by Swin-PGNet was (1.407±0.939) mm, and by different surgeons was (3.034±2.612) mm. ② Cobb angle: The mean error of Swin-PGNet was (2.062±1.352)° and the mean error of surgeons was (3.580±2.338)°. There was no significant difference between Swin-PGNet and surgeons (>0.05), but there was a significant difference between different surgeons (<0.05). ③ Lumbar sliding distance: The mean error of Swin-PGNet was (1.656±0.878) mm and the mean error of surgeons was (1.884±1.612) mm. There was no significant difference between Swin-PGNet and surgeons and between different surgeons (>0.05). The accuracy of lumbar instability diagnosed by surgeons and Swin-PGNet was 75.3% and 84.0%, respectively. The accuracy of lumbar spondylolisthesis diagnosed by surgeons and Swin-PGNet was 70.7% and 71.3%, respectively. There was no significant difference between Swin-PGNet and surgeons, as well as between different surgeons (>0.05). ④ Consistency of lumbar stability diagnosis: The of Cobb angle among different surgeons was 0.913 [95% (0.898, 0.934)] (<0.05), and the of lumbar sliding distance was 0.741 [95% (0.729, 0.796)] (<0.05). The result showed that the annotating of the three surgeons were consistent. The of Cobb angle between Swin-PGNet and surgeons was 0.922 [95% (0.891, 0.938)] (<0.05), and the of lumbar sliding distance was 0.748 [95%(0.726, 0.783)] (<0.05). The result showed that the annotating of Swin-PGNet were consistent with those of surgeons.
The automatic diagnostic tool for lumbar instability constructed based on deep learning can realize the automatic identification of lumbar instability and spondylolisthesis accurately and conveniently, which can effectively assist clinical diagnosis.
开发一种基于深度学习的腰椎稳定性自动诊断工具并验证其诊断准确性。
收集153例腰椎疾病患者术前腰椎过屈和过伸位X线片。由3名骨科医生标记以下5个关键点:L椎后下、前下角度以及L椎后上、前上和后下角度。每位医生的标记结果独立保存,共获得三组标记结果。将306张腰椎X线片按3∶1∶2的比例随机分为训练集(=156)、验证集(=50)和测试集(=100)。提出一种新的神经网络架构Swin-PGNet,使用标注的X线影像对其进行训练,以自动定位腰椎关键点,并通过预测的关键点计算L椎间Cobb角和L腰椎滑动距离。以平均误差和组内相关系数()作为评估指标,比较医生标注与Swin-PGNet在三项任务(关键点定位、Cobb角测量和腰椎滑动距离测量)上的差异。同时,将Cobb角变化超过11°作为腰椎不稳定的标准,将腰椎滑动距离超过3 mm作为腰椎滑脱的标准。比较医生标注和Swin-PGNet判断腰椎不稳定的准确性。
①关键点:Swin-PGNet关键点定位的平均误差为(1.407±0.939)mm,不同医生的平均误差为(3.034±2.612)mm。②Cobb角:Swin-PGNet的平均误差为(2.062±1.352)°,医生的平均误差为(3.580±2.338)°。Swin-PGNet与医生之间差异无统计学意义(>0.05),但不同医生之间差异有统计学意义(<0.05)。③腰椎滑动距离:Swin-PGNet的平均误差为(1.656±0.878)mm,医生的平均误差为(1.884±1.612)mm。Swin-PGNet与医生之间以及不同医生之间差异均无统计学意义(>0.05)。医生和Swin-PGNet诊断腰椎不稳定的准确性分别为75.3%和84.0%。医生和Swin-PGNet诊断腰椎滑脱的准确性分别为70.7%和71.3%。Swin-PGNet与医生之间以及不同医生之间差异均无统计学意义(>0.05)。④腰椎稳定性诊断的一致性:不同医生之间Cobb角的为0.913 [95%(0.898,0.934)](<0.05),腰椎滑动距离的为0.741 [95%(0.729,0.796)](<0.05)。结果表明三位医生的标注具有一致性。Swin-PGNet与医生之间Cobb角的为0.922 [95%(0.891,0.938)](<0.05),腰椎滑动距离的为0.748 [95%(0.726,0.783)](<0.05)。结果表明Swin-PGNet的标注与医生的标注具有一致性。
基于深度学习构建的腰椎不稳定自动诊断工具能够准确、便捷地实现腰椎不稳定和腰椎滑脱的自动识别,可有效辅助临床诊断。