Xu Yuchen, Tang Olivia, Tang Yucheng, Lee Ho Hin, Chen Yunqiang, Gao Dashan, Han Shizhong, Gao Riqiang, Savona Michael R, Abramson Richard G, Huo Yuankai, Landman Bennett A
Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA 37212.
12 Sigma Technology, San Diego, CA, USA 92130.
Proc SPIE Int Soc Opt Eng. 2020;11313. doi: 10.1117/12.2549365. Epub 2020 Mar 10.
Abdominal multi-organ segmentation of computed tomography (CT) images has been the subject of extensive research interest. It presents a substantial challenge in medical image processing, as the shape and distribution of abdominal organs can vary greatly among the population and within an individual over time. While continuous integration of novel datasets into the training set provides potential for better segmentation performance, collection of data at scale is not only costly, but also impractical in some contexts. Moreover, it remains unclear what marginal value additional data have to offer. Herein, we propose a single-pass active learning method through human quality assurance (QA). We built on a pre-trained 3D U-Net model for abdominal multi-organ segmentation and augmented the dataset either with outlier data (e.g., exemplars for which the baseline algorithm failed) or inliers (e.g., exemplars for which the baseline algorithm worked). The new models were trained using the augmented datasets with 5-fold cross-validation (for outlier data) and withheld outlier samples (for inlier data). Manual labeling of outliers increased Dice scores with outliers by 0.130, compared to an increase of 0.067 with inliers (p<0.001, two-tailed paired t-test). By adding 5 to 37 inliers or outliers to training, we find that the marginal value of adding outliers is higher than that of adding inliers. In summary, improvement on single-organ performance was obtained without diminishing multi-organ performance or significantly increasing training time. Hence, identification and correction of baseline failure cases present an effective and efficient method of selecting training data to improve algorithm performance.
计算机断层扫描(CT)图像的腹部多器官分割一直是广泛研究的热点。它在医学图像处理中提出了重大挑战,因为腹部器官的形状和分布在人群中以及个体内部随时间可能有很大差异。虽然将新数据集持续整合到训练集中为提高分割性能提供了潜力,但大规模收集数据不仅成本高昂,而且在某些情况下也不切实际。此外,额外数据能提供的边际价值仍不明确。在此,我们提出一种通过人工质量保证(QA)的单通道主动学习方法。我们基于预训练的用于腹部多器官分割的3D U-Net模型,并使用离群数据(例如基线算法失败的样本)或内群数据(例如基线算法成功的样本)扩充数据集。使用扩充后的数据集通过5折交叉验证(用于离群数据)和保留离群样本(用于内群数据)训练新模型。与内群数据增加0.067相比,人工标记离群数据使离群数据的Dice分数提高了0.130(p<0.001,双尾配对t检验)。通过向训练中添加5到37个内群或离群数据,我们发现添加离群数据的边际价值高于添加内群数据。总之,在不降低多器官性能或显著增加训练时间的情况下,单器官性能得到了提升。因此,识别和纠正基线失败案例是选择训练数据以提高算法性能的一种有效且高效的方法。