Department of Orthopaedics, University Medical Center Utrecht, Utrecht, The Netherlands.
Image Sciences Institute, University Medical Center Utrecht, Utrecht, The Netherlands.
J Orthop Res. 2022 Dec;40(12):2894-2907. doi: 10.1002/jor.25314. Epub 2022 Mar 15.
Semantic segmentation of bone from lower extremity computerized tomography (CT) scans can improve and accelerate the visualization, diagnosis, and surgical planning in orthopaedics. However, the large field of view of these scans makes automatic segmentation using deep learning based methods challenging, slow and graphical processing unit (GPU) memory intensive. We investigated methods to more efficiently represent anatomical context for accurate and fast segmentation and compared these with state-of-the-art methodology. Six lower extremity bones from patients of two different datasets were manually segmented from CT scans, and used to train and optimize a cascaded deep learning approach. We varied the number of resolution levels, receptive fields, patch sizes, and number of V-net blocks. The best performing network used a multi-stage, cascaded V-net approach with 128 -64 -32 voxel patches as input. The average Dice coefficient over all bones was 0.98 ± 0.01, the mean surface distance was 0.26 ± 0.12 mm and the 95th percentile Hausdorff distance 0.65 ± 0.28 mm. This was a significant improvement over the results of the state-of-the-art nnU-net, with only approximately 1/12th of training time, 1/3th of inference time and 1/4th of GPU memory required. Comparison of the morphometric measurements performed on automatic and manual segmentations showed good correlation (Intraclass Correlation Coefficient [ICC] >0.8) for the alpha angle and excellent correlation (ICC >0.95) for the hip-knee-ankle angle, femoral inclination, femoral version, acetabular version, Lateral Centre-Edge angle, acetabular coverage. The segmentations were generally of sufficient quality for the tested clinical applications and were performed accurately and quickly compared to state-of-the-art methodology from the literature.
下肢计算机断层扫描 (CT) 扫描的骨骼语义分割可以改善和加速矫形外科的可视化、诊断和手术规划。然而,这些扫描的大视野使得使用基于深度学习的方法进行自动分割具有挑战性、速度慢且图形处理单元 (GPU) 内存密集。我们研究了更有效地表示解剖上下文的方法,以实现准确和快速的分割,并将这些方法与最先进的方法进行了比较。从两个不同数据集的患者的六块下肢骨骼从 CT 扫描中手动分割,并用于训练和优化级联深度学习方法。我们改变了分辨率水平、感受野、补丁大小和 V-net 块的数量。表现最佳的网络使用多阶段、级联 V-net 方法,输入的 128-64-32 体素补丁。所有骨骼的平均 Dice 系数为 0.98±0.01,平均表面距离为 0.26±0.12mm,95%的 Hausdorff 距离为 0.65±0.28mm。与最先进的 nnU-net 相比,这是一个显著的改进,仅需要大约 1/12 的训练时间、1/3 的推理时间和 1/4 的 GPU 内存。对自动和手动分割进行的形态测量比较表明,alpha 角的相关性良好(组内相关系数 [ICC] >0.8),髋膝踝角的相关性非常好(ICC >0.95),股骨倾斜度、股骨版本、髋臼版本、外侧中心-边缘角、髋臼覆盖率。与文献中最先进的方法相比,这些分割通常具有足够的质量,可用于测试的临床应用,并且可以准确快速地进行。