College of Science and Technology, Shandong University of Traditional Chinese Medicine, Jinan, SD, China; Computational Medicine Lab, Shandong University of Traditional Chinese Medicine, Jinan, SD, China; Digital Image Group (DIG), London, ON, Canada; Department of Medical Imaging, Western University, London, ON, Canada.
College of Science and Technology, Shandong University of Traditional Chinese Medicine, Jinan, SD, China; Computational Medicine Lab, Shandong University of Traditional Chinese Medicine, Jinan, SD, China.
Med Image Anal. 2018 Dec;50:23-35. doi: 10.1016/j.media.2018.08.005. Epub 2018 Aug 25.
Spinal clinicians still rely on laborious workloads to conduct comprehensive assessments of multiple spinal structures in MRIs, in order to detect abnormalities and discover possible pathological factors. The objective of this work is to perform automated segmentation and classification (i.e., normal and abnormal) of intervertebral discs, vertebrae, and neural foramen in MRIs in one shot, which is called semantic segmentation that is extremely urgent to assist spinal clinicians in diagnosing neural foraminal stenosis, disc degeneration, and vertebral deformity as well as discovering possible pathological factors. However, no work has simultaneously achieved the semantic segmentation of intervertebral discs, vertebrae, and neural foramen due to three-fold unusual challenges: 1) Multiple tasks, i.e., simultaneous semantic segmentation of multiple spinal structures, are more difficult than individual tasks; 2) Multiple targets: average 21 spinal structures per MRI require automated analysis yet have high variety and variability; 3) Weak spatial correlations and subtle differences between normal and abnormal structures generate dynamic complexity and indeterminacy. In this paper, we propose a Recurrent Generative Adversarial Network called Spine-GAN for resolving above-aforementioned challenges. Firstly, Spine-GAN explicitly solves the high variety and variability of complex spinal structures through an atrous convolution (i.e., convolution with holes) autoencoder module that is capable of obtaining semantic task-aware representation and preserving fine-grained structural information. Secondly, Spine-GAN dynamically models the spatial pathological correlations between both normal and abnormal structures thanks to a specially designed long short-term memory module. Thirdly, Spine-GAN obtains reliable performance and efficient generalization by leveraging a discriminative network that is capable of correcting predicted errors and global-level contiguity. Extensive experiments on MRIs of 253 patients have demonstrated that Spine-GAN achieves high pixel accuracy of 96.2%, Dice coefficient of 87.1%, Sensitivity of 89.1% and Specificity of 86.0%, which reveals its effectiveness and potential as a clinical tool.
脊柱临床医生仍然依靠繁琐的工作量来对 MRI 中的多个脊柱结构进行全面评估,以检测异常并发现可能的病理因素。这项工作的目的是对 MRI 中的椎间盘、椎体和神经孔进行自动分割和分类(即正常和异常),这被称为语义分割,它对于帮助脊柱临床医生诊断神经孔狭窄、椎间盘退变和椎体畸形以及发现可能的病理因素非常紧迫。然而,由于三重不寻常的挑战,还没有工作同时实现椎间盘、椎体和神经孔的语义分割:1)多任务,即同时对多个脊柱结构进行语义分割,比单个任务更困难;2)多目标:每个 MRI 平均需要 21 个脊柱结构,需要自动化分析,但具有高度的多样性和可变性;3)正常和异常结构之间的弱空间相关性和细微差异产生了动态复杂性和不确定性。在本文中,我们提出了一种称为 Spine-GAN 的递归生成对抗网络,用于解决上述挑战。首先,Spine-GAN 通过具有空洞卷积(即带孔卷积)的自动编码器模块显式解决复杂脊柱结构的高度多样性和可变性,该模块能够获得语义任务感知表示并保留细粒度的结构信息。其次,Spine-GAN 借助专门设计的长短时记忆模块动态地建模正常和异常结构之间的空间病理相关性。第三,Spine-GAN 通过利用能够纠正预测错误和全局级连续性的判别网络来获得可靠的性能和高效的泛化能力。在 253 名患者的 MRI 上进行的广泛实验表明,Spine-GAN 实现了 96.2%的高像素准确率、87.1%的 Dice 系数、89.1%的灵敏度和 86.0%的特异性,这表明其作为一种临床工具的有效性和潜力。