Brooks Jamison, Tryggestad Erik, Anand Aman, Beltran Chris, Foote Robert, Lucido J John, Laack Nadia N, Routman David, Patel Samir H, Seetamsetty Srinivas, Moseley Douglas
Department of Radiation Oncology, Mayo Clinic Rochester, Rochester, MN, United States.
Department of Radiation Oncology, Mayo Clinic Arizona, Phoenix, AZ, United States.
Front Oncol. 2024 Feb 29;14:1295251. doi: 10.3389/fonc.2024.1295251. eCollection 2024.
Manual review of organ at risk (OAR) contours is crucial for creating safe radiotherapy plans but can be time-consuming and error prone. Statistical and deep learning models show the potential to automatically detect improper contours by identifying outliers using large sets of acceptable data (knowledge-based outlier detection) and may be able to assist human reviewers during review of OAR contours.
This study developed an automated knowledge-based outlier detection method and assessed its ability to detect erroneous contours for all common head and neck (HN) OAR types used clinically at our institution. We utilized 490 accurate CT-based HN structure sets from unique patients, each with forty-two HN OAR contours when anatomically present. The structure sets were distributed as 80% for training, 10% for validation, and 10% for testing. In addition, 190 and 37 simulated contours containing errors were added to the validation and test sets, respectively. Single-contour features, including location, shape, orientation, volume, and CT number, were used to train three single-contour feature models (z-score, Mahalanobis distance [MD], and autoencoder [AE]). Additionally, a novel contour-to-contour relationship (CCR) model was trained using the minimum distance and volumetric overlap between pairs of OAR contours to quantify overlap and separation. Inferences from single-contour feature models were combined with the CCR model inferences and inferences evaluating the number of disconnected parts in a single contour and then compared.
In the test dataset, before combination with the CCR model, the area under the curve values were 0.922/0.939/0.939 for the z-score, MD, and AE models respectively for all contours. After combination with CCR model inferences, the z-score, MD, and AE had sensitivities of 0.838/0.892/0.865, specificities of 0.922/0.907/0.887, and balanced accuracies (BA) of 0.880/0.900/0.876 respectively. In the validation dataset, with similar overall performance and no signs of overfitting, model performance for individual OAR types was assessed. The combined AE model demonstrated minimum, median, and maximum BAs of 0.729, 0.908, and 0.980 across OAR types.
Our novel knowledge-based method combines models utilizing single-contour and CCR features to effectively detect erroneous OAR contours across a comprehensive set of 42 clinically used OAR types for HN radiotherapy.
对危及器官(OAR)轮廓进行人工检查对于制定安全的放射治疗计划至关重要,但可能耗时且容易出错。统计和深度学习模型显示出通过使用大量可接受数据识别异常值(基于知识的异常值检测)来自动检测不当轮廓的潜力,并且可能能够在OAR轮廓检查期间协助人工审阅者。
本研究开发了一种基于知识的自动异常值检测方法,并评估了其检测本机构临床使用的所有常见头颈部(HN)OAR类型错误轮廓的能力。我们使用了来自独特患者的490个基于CT的准确HN结构集,每个结构集在解剖学上存在时具有42个HN OAR轮廓。这些结构集按80%用于训练、10%用于验证、10%用于测试进行分配。此外,分别向验证集和测试集添加了190个和37个包含错误的模拟轮廓。使用包括位置、形状、方向、体积和CT值在内的单轮廓特征来训练三个单轮廓特征模型(z分数、马氏距离[MD]和自动编码器[AE])。此外,使用OAR轮廓对之间的最小距离和体积重叠训练了一种新颖的轮廓到轮廓关系(CCR)模型,以量化重叠和分离。将单轮廓特征模型的推理与CCR模型的推理以及评估单个轮廓中不连续部分数量的推理相结合,然后进行比较。
在测试数据集中,在与CCR模型组合之前,z分数、MD和AE模型对于所有轮廓的曲线下面积值分别为0.922/0.939/0.939。与CCR模型推理组合后,z分数、MD和AE的灵敏度分别为0.838/0.892/0.865,特异性分别为0.922/0.907/0.887,平衡准确率(BA)分别为0.880/0.900/0.876。在验证数据集中,在具有相似的整体性能且没有过拟合迹象的情况下,评估了单个OAR类型的模型性能。组合后的AE模型在所有OAR类型中的最小、中位数和最大BA分别为0.729、0.908和0.980。
我们新颖的基于知识的方法结合了利用单轮廓和CCR特征的模型,以有效检测HN放射治疗临床使用的42种OAR类型中的错误OAR轮廓。