Department of Biostatistical Sciences, Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, North Carolina.
Department of Radiology, Wake Forest School of Medicine, Medical Center Blvd, Winston-Salem, NC 27157.
Acad Radiol. 2019 Dec;26(12):1686-1694. doi: 10.1016/j.acra.2019.06.017. Epub 2019 Jul 17.
To develop and evaluate an automated machine learning (ML) algorithm for segmenting the paraspinous muscles on chest computed tomography (CT) scans to evaluate for presence of sarcopenia.
A convolutional neural network based on the U-Net architecture was trained to perform muscle segmentation on a dataset of 1875 single slice CT images and was tested on 209 CT images of participants in the National Lung Screening Trial. Low-dose, noncontrast CT examinations were obtained at 33 clinical sites, using scanners from four manufacturers. The study participants had a mean age of 71.6 years (range, 70-74 years). Ground truth was obtained by manually segmenting the left paraspinous muscle at the level of the T12 vertebra. Muscle cross-sectional area (CSA) and muscle attenuation (MA) were recorded. Comparison between the ML algorithm and ground truth measures of muscle CSA and MA were obtained using Dice similarity coefficients and Pearson correlations.
Compared to ground truth segmentation, the ML algorithm achieved median (standard deviation) Dice scores of 0.94 (0.04) in the test set. Mean (SD) muscle CSA was 14.3 (3.6) cm for ground truth and 13.7 (3.5) cm for ML segmentation. Mean (SD) MA was 41.6 (7.6) Hounsfield units (HU) for ground truth and 43.5 (7.9) HU for ML segmentation. There was high correlation between ML algorithm and ground truth for muscle CSA (r = 0.86; p < 0.0001) and MA (r = 0.95; p < 0.0001).
The ML algorithm for measurement of paraspinous muscles compared favorably to manual ground truth measurements in the NLST. The algorithm generalized well to a heterogeneous set of low-dose CT images and may be capable of automated quantification of muscle metrics to screen for sarcopenia on routine chest CT examinations.
开发并评估一种基于自动机器学习(ML)的算法,以对胸部 CT 扫描的脊柱旁肌肉进行分割,从而评估是否存在肌肉减少症。
使用基于 U-Net 架构的卷积神经网络对包含 1875 个单切片 CT 图像的数据集进行肌肉分割训练,并在来自 4 家制造商的扫描仪在 33 个临床站点获取的 209 例国家肺癌筛查试验(NLST)参与者的 CT 图像上进行测试。研究参与者的平均年龄为 71.6 岁(范围,70-74 岁)。通过手动在 T12 椎体水平对左侧脊柱旁肌肉进行分割来获得真实值。记录肌肉横截面积(CSA)和肌肉衰减(MA)。通过 Dice 相似系数和 Pearson 相关性比较 ML 算法和真实值的肌肉 CSA 和 MA 测量值。
与真实值分割相比,在测试集中,ML 算法的中位数(标准差)Dice 评分分别为 0.94(0.04)。真实值的肌肉 CSA 平均值(SD)为 14.3(3.6)cm,ML 分割的肌肉 CSA 平均值(SD)为 13.7(3.5)cm。真实值的 MA 平均值(SD)为 41.6(7.6)HU,ML 分割的 MA 平均值(SD)为 43.5(7.9)HU。ML 算法与真实值之间肌肉 CSA(r=0.86;p<0.0001)和 MA(r=0.95;p<0.0001)具有高度相关性。
在 NLST 中,用于测量脊柱旁肌肉的 ML 算法与手动真实值测量结果相比表现良好。该算法对异质的低剂量 CT 图像具有很好的泛化能力,并且可能能够自动量化肌肉指标,从而在常规胸部 CT 检查中筛查肌肉减少症。