IEEE Trans Neural Netw Learn Syst. 2022 Jun;33(6):2726-2736. doi: 10.1109/TNNLS.2020.3045601. Epub 2022 Jun 1.
Accurate identification and localization of the vertebrae in CT scans is a critical and standard pre-processing step for clinical spinal diagnosis and treatment. Existing methods are mainly based on the integration of multiple neural networks, and most of them use heatmaps to locate the vertebrae's centroid. However, the process of obtaining vertebrae's centroid coordinates using heatmaps is non-differentiable, so it is impossible to train the network to label the vertebrae directly. Therefore, for end-to-end differential training of vertebrae coordinates on CT scans, a robust and accurate automatic vertebral labeling algorithm is proposed in this study. First, a novel end-to-end integral regression localization and multi-label classification network is developed, which can capture multi-scale features and also utilize the residual module and skip connection to fuse the multi-level features. Second, to solve the problem that the process of finding coordinates is non-differentiable and the spatial structure of location being destroyed, an integral regression module is used in the localization network. It combines the advantages of heatmaps representation and direct regression coordinates to achieve end-to-end training and can be compatible with any key point detection methods of medical images based on heatmaps. Finally, multi-label classification of vertebrae is carried out to improve the identification rate, which uses bidirectional long short-term memory (Bi-LSTM) online to enhance the learning of long contextual information of vertebrae. The proposed method is evaluated on a challenging data set, and the results are significantly better than state-of-the-art methods (identification rate is 91.1% and the mean localization error is 2.2 mm). The method is evaluated on a new CT data set, and the results show that our method has good generalization.
在 CT 扫描中准确识别和定位椎体是临床脊柱诊断和治疗的关键和标准预处理步骤。现有的方法主要基于多个神经网络的集成,并且大多数方法都使用热图来定位椎体的质心。然而,使用热图获取椎体质心坐标的过程是不可微的,因此不可能直接训练网络来标记椎体。因此,为了在 CT 扫描上实现椎体坐标的端到端差异训练,本文提出了一种稳健且准确的自动椎体标记算法。首先,开发了一种新颖的端到端积分回归定位和多标签分类网络,该网络可以捕获多尺度特征,并利用残差模块和跳过连接融合多层次特征。其次,为了解决坐标查找过程不可微和位置空间结构被破坏的问题,在定位网络中使用了积分回归模块。它结合了热图表示和直接回归坐标的优点,实现了端到端训练,并且可以与基于热图的任何医学图像关键点检测方法兼容。最后,通过双向长短期记忆(Bi-LSTM)在线进行椎体的多标签分类,以提高识别率,从而增强对椎体长上下文信息的学习。在具有挑战性的数据集上评估所提出的方法,结果明显优于最先进的方法(识别率为 91.1%,平均定位误差为 2.2 毫米)。在新的 CT 数据集上评估该方法,结果表明我们的方法具有良好的泛化能力。