Department of Radiation Oncology, Niigata University Medical and Dental Hospital, 1-754 Asahimachi-dori, Chuo-ku, Niigata, 951-8520, Japan.
Radiation Therapy Section, Department of Clinical Support, Hiroshima University Hospital, 1-2-3 Kasumi, Minami-ku, Hiroshima, 734-8551, Japan.
Med Phys. 2021 Mar;48(3):991-1002. doi: 10.1002/mp.14699. Epub 2021 Jan 27.
We sought to develop machine learning models to detect multileaf collimator (MLC) modeling errors with the use of radiomic features of fluence maps measured in patient-specific quality assurance (QA) for intensity-modulated radiation therapy (IMRT) with an electric portal imaging device (EPID).
Fluence maps measured with EPID for 38 beams from 19 clinical IMRT plans were assessed. Plans with various degrees of error in MLC modeling parameters [i.e., MLC transmission factor (TF) and dosimetric leaf gap (DLG)] and plans with an MLC positional error for comparison were created. For a total of 152 error plans for each type of error, we calculated fluence difference maps for each beam by subtracting the calculated maps from the measured maps. A total of 837 radiomic features were extracted from each fluence difference map, and we determined the number of features used for the training dataset in the machine learning models by using random forest regression. Machine learning models using the five typical algorithms [decision tree, k-nearest neighbor (kNN), support vector machine (SVM), logistic regression, and random forest] for binary classification between the error-free plan and the plan with the corresponding error for each type of error were developed. We used part of the total dataset to perform fourfold cross-validation to tune the models, and we used the remaining test dataset to evaluate the performance of the developed models. A gamma analysis was also performed between the measured and calculated fluence maps with the criteria of 3%/2 and 2%/2 mm for all of the types of error.
The radiomic features and its optimal number were similar for the models for the TF and the DLG error detection, which was different from the MLC positional error. The highest sensitivity was obtained as 0.913 for the TF error with SVM and logistic regression, 0.978 for the DLG error with kNN and SVM, and 1.000 for the MLC positional error with kNN, SVM, and random forest. The highest specificity was obtained as 1.000 for the TF error with a decision tree, SVM, and logistic regression, 1.000 for the DLG error with a decision tree, logistic regression, and random forest, and 0.909 for the MLC positional error with a decision tree and logistic regression. The gamma analysis showed the poorest performance in which sensitivities were 0.737 for the TF error and the DLG error and 0.882 for the MLC positional error for 3%/2 mm. The addition of another type of error to fluence maps significantly reduced the sensitivity for the TF and the DLG error, whereas no effect was observed for the MLC positional error detection.
Compared to the conventional gamma analysis, the radiomics-based machine learning models showed higher sensitivity and specificity in detecting a single type of the MLC modeling error and the MLC positional error. Although the developed models need further improvement for detecting multiple types of error, radiomics-based IMRT QA was shown to be a promising approach for detecting the MLC modeling error.
我们旨在利用电子射野影像装置(EPID)在调强放疗(IMRT)中测量的通量图的放射组学特征,开发一种机器学习模型来检测多叶准直器(MLC)建模误差。
评估了 19 个临床 IMRT 计划中 38 束的 EPID 测量的通量图。为了创建具有不同程度 MLC 建模参数误差(即 MLC 透射因子(TF)和剂量学叶片间隙(DLG))的计划和具有 MLC 位置误差的计划,我们为每种类型的误差计算了 152 个误差计划。对于每个通量差异图,我们通过从测量的通量图中减去计算的通量图来计算通量差图。从每个通量差图中提取了 837 个放射组学特征,并使用随机森林回归确定了机器学习模型中用于训练数据集的特征数量。使用五种典型算法(决策树、k-近邻(kNN)、支持向量机(SVM)、逻辑回归和随机森林)为每种类型的误差开发了用于二元分类的错误计划和具有相应误差的计划的机器学习模型。我们使用部分总数据集进行四轮交叉验证来调整模型,并使用剩余的测试数据集来评估开发模型的性能。对于所有类型的误差,我们还使用 3%/2 和 2%/2mm 的标准对测量和计算的通量图进行了伽马分析。
对于 TF 和 DLG 误差检测的模型,放射组学特征及其最佳数量相似,而对于 MLC 位置误差则不同。对于 TF 误差,SVM 和逻辑回归的最高灵敏度为 0.913,kNN 和 SVM 对于 DLG 误差的最高灵敏度为 0.978,kNN、SVM 和随机森林对于 MLC 位置误差的最高灵敏度为 1.000。对于 TF 误差,决策树、SVM 和逻辑回归的最高特异性为 1.000,对于 DLG 误差,决策树、逻辑回归和随机森林的最高特异性为 1.000,对于 MLC 位置误差,决策树和逻辑回归的最高特异性为 0.909。伽马分析显示,在 3%/2mm 时,TF 误差和 DLG 误差的灵敏度分别为 0.737 和 0.882,MLC 位置误差的灵敏度为 0.882,性能最差。在通量图中添加另一种类型的误差会显著降低 TF 和 DLG 误差的灵敏度,而对 MLC 位置误差检测则没有影响。
与传统的伽马分析相比,基于放射组学的机器学习模型在检测单一类型的 MLC 建模误差和 MLC 位置误差方面具有更高的灵敏度和特异性。尽管开发的模型需要进一步改进以检测多种类型的误差,但基于放射组学的 IMRT QA 被证明是检测 MLC 建模误差的一种很有前途的方法。