Weisman Amy J, Huff Daniel T, Govindan Rajkumar Munian, Chen Song, Perk Timothy G
AIQ Solutions, Madison, WI, United States of America.
Department of Nuclear Medicine, The First Hospital of China Medical University, Shenyang, Liaoning, People's Republic of China.
Biomed Phys Eng Express. 2023 Oct 18;9(6). doi: 10.1088/2057-1976/acfb06.
. Automated organ segmentation on CT images can enable the clinical use of advanced quantitative software devices, but model performance sensitivities must be understood before widespread adoption can occur. The goal of this study was to investigate performance differences between Convolutional Neural Networks (CNNs) trained to segment one (single-class) versus multiple (multi-class) organs, and between CNNs trained on scans from a single manufacturer versus multiple manufacturers.. The multi-class CNN was trained on CT images obtained from 455 whole-body PET/CT scans (413 for training, 42 for testing) taken with Siemens, GE, and Phillips PET/CT scanners where 16 organs were segmented. The multi-class CNN was compared to 16 smaller single-class CNNs trained using the same data, but with segmentations of only one organ per model. In addition, CNNs trained on Siemens-only (N = 186) and GE-only (N = 219) scans (manufacturer-specific) were compared with CNNs trained on data from both Siemens and GE scanners (manufacturer-mixed). Segmentation performance was quantified using five performance metrics, including the Dice Similarity Coefficient (DSC).. The multi-class CNN performed well compared to previous studies, even in organs usually considered difficult auto-segmentation targets (e.g., pancreas, bowel). Segmentations from the multi-class CNN were significantly superior to those from smaller single-class CNNs in most organs, and the 16 single-class models took, on average, six times longer to segment all 16 organs compared to the single multi-class model. The manufacturer-mixed approach achieved minimally higher performance over the manufacturer-specific approach.. A CNN trained on contours of multiple organs and CT data from multiple manufacturers yielded high-quality segmentations. Such a model is an essential enabler of image processing in a software device that quantifies and analyzes such data to determine a patient's treatment response. To date, this activity of whole organ segmentation has not been adopted due to the intense manual workload and time required.
CT图像上的自动器官分割能够使先进的定量软件设备应用于临床,但在广泛采用之前必须了解模型性能的敏感性。本研究的目的是调查训练用于分割单个(单类)器官与多个(多类)器官的卷积神经网络(CNN)之间,以及在来自单一制造商与多个制造商的扫描数据上训练的CNN之间的性能差异。多类CNN是在从西门子、GE和飞利浦PET/CT扫描仪获取的455例全身PET/CT扫描(413例用于训练,42例用于测试)的CT图像上进行训练的,其中分割了16个器官。将多类CNN与使用相同数据训练的16个较小的单类CNN进行比较,但每个模型仅分割一个器官。此外,将仅在西门子(N = 186)和仅在GE(N = 219)扫描数据上训练的CNN(特定制造商)与在西门子和GE扫描仪的数据上训练的CNN(混合制造商)进行比较。使用包括骰子相似系数(DSC)在内的五个性能指标对分割性能进行量化。与先前的研究相比,多类CNN表现良好,即使在通常被认为是自动分割困难目标的器官(如胰腺、肠道)中也是如此。在大多数器官中,多类CNN的分割明显优于较小的单类CNN,并且16个单类模型分割所有16个器官的平均时间是单个多类模型的六倍。混合制造商方法比特定制造商方法的性能略高。在多个器官的轮廓和来自多个制造商的CT数据上训练的CNN产生了高质量的分割。这样的模型是软件设备中图像处理的关键促成因素,该软件设备对这些数据进行量化和分析以确定患者的治疗反应。迄今为止,由于繁重的人工工作量和所需时间,全器官分割这项工作尚未得到应用。