Hou Benjamin, Mathai Tejas Sudharshan, Liu Jianfei, Parnell Christopher, Summers Ronald M
National Institutes of Health (NIH) Clinical Center, Bethesda MD, USA.
Walter Reed National Military Medical Center, Bethesda MD, USA.
ArXiv. 2024 Apr 12:arXiv:2401.05294v2.
Body composition measurements from routine abdominal CT can yield personalized risk assessments for asymptomatic and diseased patients. In particular, attenuation and volume measures of muscle and fat are associated with important clinical outcomes, such as cardiovascular events, fractures, and death. This study evaluates the reliability of an Internal tool for the segmentation of muscle and fat (subcutaneous and visceral) as compared to the well-established public TotalSegmentator tool.
We assessed the tools across 900 CT series from the publicly available SAROS dataset, focusing on muscle, subcutaneous fat, and visceral fat. The Dice score was employed to assess accuracy in subcutaneous fat and muscle segmentation. Due to the lack of ground truth segmentations for visceral fat, Cohen's Kappa was utilized to assess segmentation agreement between the tools.
Our Internal tool achieved a 3% higher Dice (83.8 vs. 80.8) for subcutaneous fat and a 5% improvement (87.6 vs. 83.2) for muscle segmentation respectively. A Wilcoxon signed-rank test revealed that our results were statistically different with < 0.01. For visceral fat, the Cohen's kappa score of 0.856 indicated near-perfect agreement between the two tools. Our internal tool also showed very strong correlations for muscle volume (=0.99), muscle attenuation (=0.93), and subcutaneous fat volume (=0.99) with a moderate correlation for subcutaneous fat attenuation (=0.45).
Our findings indicated that our Internal tool outperformed TotalSegmentator in measuring subcutaneous fat and muscle. The high Cohen's Kappa score for visceral fat suggests a reliable level of agreement between the two tools. These results demonstrate the potential of our tool in advancing the accuracy of body composition analysis.
通过常规腹部CT进行身体成分测量可为无症状和患病患者提供个性化风险评估。特别是,肌肉和脂肪的衰减及体积测量与重要临床结局相关,如心血管事件、骨折和死亡。本研究评估了一种用于肌肉和脂肪(皮下和内脏)分割的内部工具与成熟的公共TotalSegmentator工具相比的可靠性。
我们在公开可用的SAROS数据集中的900个CT系列上评估了这些工具,重点关注肌肉、皮下脂肪和内脏脂肪。采用Dice分数评估皮下脂肪和肌肉分割的准确性。由于缺乏内脏脂肪的真实分割数据,使用Cohen's Kappa评估工具之间的分割一致性。
我们的内部工具在皮下脂肪分割方面的Dice分数分别高出3%(83.8对80.8),在肌肉分割方面提高了5%(87.6对83.2)。Wilcoxon符号秩检验显示我们的结果在统计学上有显著差异,<0.01。对于内脏脂肪,Cohen's kappa分数为0.856表明两种工具之间几乎完全一致。我们的内部工具在肌肉体积(=0.99)、肌肉衰减(=0.93)和皮下脂肪体积(=0.99)方面也显示出非常强的相关性,皮下脂肪衰减的相关性中等(=0.45)。
我们的研究结果表明,我们的内部工具在测量皮下脂肪和肌肉方面优于TotalSegmentator。内脏脂肪的高Cohen's Kappa分数表明两种工具之间具有可靠的一致性水平。这些结果证明了我们的工具在提高身体成分分析准确性方面的潜力。