放射治疗中多部位轮廓自动分割工具的临床应用与评估
Clinical Implementation and Evaluation of Auto-Segmentation Tools for Multi-Site Contouring in Radiotherapy.
作者信息
Heilemann Gerd, Buschmann Martin, Lechner Wolfgang, Dick Vincent, Eckert Franziska, Heilmann Martin, Herrmann Harald, Moll Matthias, Knoth Johannes, Konrad Stefan, Simek Inga-Malin, Thiele Christopher, Zaharie Alexandru, Georg Dietmar, Widder Joachim, Trnkova Petra
机构信息
Department of Radiation Oncology, Comprehensive Cancer Center Vienna, Medical University Vienna, Vienna, Austria.
出版信息
Phys Imaging Radiat Oncol. 2023 Nov 17;28:100515. doi: 10.1016/j.phro.2023.100515. eCollection 2023 Oct.
BACKGROUND AND PURPOSE
Tools for auto-segmentation in radiotherapy are widely available, but guidelines for clinical implementation are missing. The goal was to develop a workflow for performance evaluation of three commercial auto-segmentation tools to select one candidate for clinical implementation.
MATERIALS AND METHODS
One hundred patients with six treatment sites (brain, head-and-neck, thorax, abdomen, and pelvis) were included. Three sets of AI-based contours for organs-at-risk (OAR) generated by three software tools and manually drawn expert contours were blindly rated for contouring accuracy. The dice similarity coefficient (DSC), the Hausdorff distance, and a dose/volume evaluation based on the recalculation of the original treatment plan were assessed. Statistically significant differences were tested using the Kruskal-Wallis test and the post-hoc Dunn Test with Bonferroni correction.
RESULTS
The mean DSC scores compared to expert contours for all OARs combined were 0.80 ± 0.10, 0.75 ± 0.10, and 0.74 ± 0.11 for the three software tools. Physicians' rating identified equivalent or superior performance of some AI-based contours in head (eye, lens, optic nerve, brain, chiasm), thorax (e.g., heart and lungs), and pelvis and abdomen (e.g., kidney, femoral head) compared to manual contours. For some OARs, the AI models provided results requiring only minor corrections. Bowel-bag and stomach were not fit for direct use. During the interdisciplinary discussion, the physicians' rating was considered the most relevant.
CONCLUSION
A comprehensive method for evaluation and clinical implementation of commercially available auto-segmentation software was developed. The in-depth analysis yielded clear instructions for clinical use within the radiotherapy department.
背景与目的
放疗中的自动分割工具广泛可得,但缺乏临床实施指南。目标是开发一种工作流程,用于对三种商用自动分割工具进行性能评估,以选择一种用于临床实施的候选工具。
材料与方法
纳入了100例患有六个治疗部位(脑、头颈部、胸部、腹部和骨盆)的患者。由三种软件工具生成的三组基于人工智能的危及器官(OAR)轮廓以及手动绘制的专家轮廓被进行盲法评分以评估轮廓准确性。评估了骰子相似系数(DSC)、豪斯多夫距离以及基于原始治疗计划重新计算的剂量/体积评估。使用Kruskal-Wallis检验和带有Bonferroni校正的事后Dunn检验来检验统计学上的显著差异。
结果
对于所有合并的OAR,三种软件工具与专家轮廓相比的平均DSC分数分别为0.80±0.10、0.75±0.10和0.74±0.11。医生的评分表明,与手动轮廓相比,某些基于人工智能的轮廓在头部(眼睛、晶状体、视神经、脑、视交叉)、胸部(例如心脏和肺)以及骨盆和腹部(例如肾脏、股骨头)具有同等或更优的性能。对于一些OAR,人工智能模型提供的结果仅需进行 minor corrections。肠袋和胃不适合直接使用。在跨学科讨论中,医生的评分被认为是最相关的。
结论
开发了一种用于评估和临床实施商用自动分割软件的综合方法。深入分析为放疗科的临床使用提供了明确的指导。