Hoang Duc Albert K, Eminowicz Gemma, Mendes Ruheena, Wong Swee-Ling, McClelland Jamie, Modat Marc, Cardoso M Jorge, Mendelson Alex F, Veiga Catarina, Kadir Timor, D'Souza Derek, Ourselin Sebastien
Center for Medical Image Computing, University College London, London WC1E 6BT, United Kingdom.
Radiotherapy Department, University College London Hospitals, 235 Euston Road, London NW1 2BU, United Kingdom.
Med Phys. 2015 Sep;42(9):5027-34. doi: 10.1118/1.4927567.
The aim of this study was to assess whether clinically acceptable segmentations of organs at risk (OARs) in head and neck cancer can be obtained automatically and efficiently using the novel "similarity and truth estimation for propagated segmentations" (STEPS) compared to the traditional "simultaneous truth and performance level estimation" (STAPLE) algorithm.
First, 6 OARs were contoured by 2 radiation oncologists in a dataset of 100 patients with head and neck cancer on planning computed tomography images. Each image in the dataset was then automatically segmented with STAPLE and STEPS using those manual contours. Dice similarity coefficient (DSC) was then used to compare the accuracy of these automatic methods. Second, in a blind experiment, three separate and distinct trained physicians graded manual and automatic segmentations into one of the following three grades: clinically acceptable as determined by universal delineation guidelines (grade A), reasonably acceptable for clinical practice upon manual editing (grade B), and not acceptable (grade C). Finally, STEPS segmentations graded B were selected and one of the physicians manually edited them to grade A. Editing time was recorded.
Significant improvements in DSC can be seen when using the STEPS algorithm on large structures such as the brainstem, spinal canal, and left/right parotid compared to the STAPLE algorithm (all p < 0.001). In addition, across all three trained physicians, manual and STEPS segmentation grades were not significantly different for the brainstem, spinal canal, parotid (right/left), and optic chiasm (all p > 0.100). In contrast, STEPS segmentation grades were lower for the eyes (p < 0.001). Across all OARs and all physicians, STEPS produced segmentations graded as well as manual contouring at a rate of 83%, giving a lower bound on this rate of 80% with 95% confidence. Reduction in manual interaction time was on average 61% and 93% when automatic segmentations did and did not, respectively, require manual editing.
The STEPS algorithm showed better performance than the STAPLE algorithm in segmenting OARs for radiotherapy of the head and neck. It can automatically produce clinically acceptable segmentation of OARs, with results as relevant as manual contouring for the brainstem, spinal canal, the parotids (left/right), and optic chiasm. A substantial reduction in manual labor was achieved when using STEPS even when manual editing was necessary.
本研究旨在评估与传统的“同时真相与性能水平估计”(STAPLE)算法相比,使用新型的“传播分割的相似性与真相估计”(STEPS)算法能否自动、高效地获得头颈部癌中危及器官(OARs)的临床可接受分割。
首先,2名放射肿瘤学家在100名头颈部癌患者的计划计算机断层扫描图像数据集上勾勒出6个OARs。然后,使用这些手动轮廓,通过STAPLE和STEPS对数据集中的每个图像进行自动分割。接着,使用骰子相似系数(DSC)比较这些自动方法的准确性。其次,在一项盲法实验中,3名经过单独且不同培训的医生将手动和自动分割分为以下三个等级之一:根据通用轮廓指南确定为临床可接受(A级)、经手动编辑后在临床实践中合理可接受(B级)、不可接受(C级)。最后,选择STEPS分割等级为B的图像,由一名医生将其手动编辑为A级,并记录编辑时间。
与STAPLE算法相比,在脑干、脊髓和左/右腮腺等大结构上使用STEPS算法时,DSC有显著提高(所有p<0.001)。此外,在所有3名经过培训的医生中,脑干、脊髓、腮腺(右/左)和视交叉的手动和STEPS分割等级无显著差异(所有p>0.100)。相比之下,眼睛的STEPS分割等级较低(p<0.001)。在所有OARs和所有医生中,STEPS产生的分割等级与手动轮廓的等级相同,比例为83%,在95%置信度下该比例的下限为80%。当自动分割需要和不需要手动编辑时,手动交互时间平均分别减少61%和93%。
在对头颈部放疗的OARs进行分割时,STEPS算法表现优于STAPLE算法。它可以自动生成临床可接受的OARs分割,对于脑干、脊髓、腮腺(左/右)和视交叉,其结果与手动轮廓相当。即使需要手动编辑,使用STEPS时也能大幅减少人工劳动。