Mai Dinh Van Chi, Drami Ioanna, Pring Edward T, Gould Laura E, Lung Phillip, Popuri Karteek, Chow Vincent, Beg Mirza F, Athanasiou Thanos, Jenkins John T
Department of Surgery, St Mark's Academic Institute, St Mark's Hospital, London, UK.
Department of Surgery and Cancer, Imperial College, London, UK.
J Cachexia Sarcopenia Muscle. 2023 Oct;14(5):1973-1986. doi: 10.1002/jcsm.13310. Epub 2023 Aug 10.
Automated computed tomography (CT) scan segmentation (labelling of pixels according to tissue type) is now possible. This technique is being adapted to achieve three-dimensional (3D) segmentation of CT scans, opposed to single L3-slice alone. This systematic review evaluates feasibility and accuracy of automated segmentation of 3D CT scans for volumetric body composition (BC) analysis, as well as current limitations and pitfalls clinicians and researchers should be aware of. OVID Medline, Embase and grey literature databases up to October 2021 were searched. Original studies investigating automated skeletal muscle, visceral and subcutaneous AT segmentation from CT were included. Seven of the 92 studies met inclusion criteria. Variation existed in expertise and numbers of humans performing ground-truth segmentations used to train algorithms. There was heterogeneity in patient characteristics, pathology and CT phases that segmentation algorithms were developed upon. Reporting of anatomical CT coverage varied, with confusing terminology. Six studies covered volumetric regional slabs rather than the whole body. One study stated the use of whole-body CT, but it was not clear whether this truly meant head-to-fingertip-to-toe. Two studies used conventional computer algorithms. The latter five used deep learning (DL), an artificial intelligence technique where algorithms are similarly organized to brain neuronal pathways. Six of seven reported excellent segmentation performance (Dice similarity coefficients > 0.9 per tissue). Internal testing on unseen scans was performed for only four of seven algorithms, whilst only three were tested externally. Trained DL algorithms achieved full CT segmentation in 12 to 75 s versus 25 min for non-DL techniques. DL enables opportunistic, rapid and automated volumetric BC analysis of CT performed for clinical indications. However, most CT scans do not cover head-to-fingertip-to-toe; further research must validate using common CT regions to estimate true whole-body BC, with direct comparison to single lumbar slice. Due to successes of DL, we expect progressive numbers of algorithms to materialize in addition to the seven discussed in this paper. Researchers and clinicians in the field of BC must therefore be aware of pitfalls. High Dice similarity coefficients do not inform the degree to which BC tissues may be under- or overestimated and nor does it inform on algorithm precision. Consensus is needed to define accuracy and precision standards for ground-truth labelling. Creation of a large international, multicentre common CT dataset with BC ground-truth labels from multiple experts could be a robust solution.
现在,自动计算机断层扫描(CT)扫描分割(根据组织类型对像素进行标记)已成为可能。这项技术正在进行调整,以实现CT扫描的三维(3D)分割,而不是仅对单个L3切片进行分割。本系统评价评估了用于体成分(BC)分析的3D CT扫描自动分割的可行性和准确性,以及临床医生和研究人员应注意的当前局限性和陷阱。检索了截至2021年10月的OVID Medline、Embase和灰色文献数据库。纳入了调查从CT自动分割骨骼肌、内脏和皮下脂肪组织的原始研究。92项研究中有7项符合纳入标准。在用于训练算法的进行真值分割的人员专业知识和数量方面存在差异。分割算法所基于的患者特征、病理和CT期存在异质性。解剖学CT覆盖范围的报告各不相同,术语也令人困惑。六项研究涵盖了体积区域切片而非全身。一项研究称使用了全身CT,但不清楚这是否真的意味着从头到指尖再到脚趾。两项研究使用了传统计算机算法。后五项使用了深度学习(DL),这是一种人工智能技术,算法的组织方式类似于大脑神经元通路。七项研究中有六项报告了出色的分割性能(每个组织的骰子相似系数>0.9)。七项算法中只有四项对未见过的扫描进行了内部测试,而只有三项进行了外部测试。经过训练的DL算法实现全CT分割需要12至75秒,而非DL技术则需要25分钟。DL能够对因临床指征而进行的CT进行机会性、快速和自动的体成分BC分析。然而,大多数CT扫描并不覆盖从头到指尖再到脚趾的范围;进一步的研究必须验证使用常见的CT区域来估计真正的全身BC,并与单个腰椎切片进行直接比较。由于DL取得的成功,除了本文讨论的七种算法外,我们预计还会有越来越多的算法出现。因此,BC领域的研究人员和临床医生必须注意其中的陷阱。高骰子相似系数并不能说明BC组织可能被低估或高估的程度,也不能说明算法的精度。需要就真值标记的准确性和精度标准达成共识。创建一个由多个专家提供BC真值标签的大型国际多中心通用CT数据集可能是一个有力的解决方案。