Li Xinyi, Wu Q Jackie, Wu Qiuwen, Wang Chunhao, Sheng Yang, Wang Wentao, Stephens Hunter, Yin Fang-Fang, Ge Yaorong
Duke University Medical Center, United States of America.
The University of North Carolina at Chapel Hill, United States of America.
Phys Med Biol. 2021 Nov 26;66(23). doi: 10.1088/1361-6560/ac3841.
We have previously reported an artificial intelligence (AI) agent that automatically generates intensity-modulated radiation therapy (IMRT) plans via fluence map prediction, by-passing inverse planning. This AI agent achieved clinically comparable quality for prostate cases, but its performance on head-and-neck patients leaves room for improvement. This study aims to collect insights of the deep-learning-based (DL-based) fluence map prediction model by systematically analyzing its prediction errors.From the modeling perspective, the DL model's output is the fluence maps of IMRT plans. However, from the clinical planning perspective, the plan quality evaluation should be based on the clinical dosimetric criteria such as dose-volume histograms. To account for the complex and non-intuitive relationships between fluence map prediction errors and the corresponding dose distribution changes, we propose a novel error analysis approach that systematically examines plan dosimetric changes that are induced by varying amounts of fluence prediction errors. We investigated four decomposition modes of model prediction errors. The two spatial domain decompositions are based on fluence intensity and fluence gradient. The two frequency domain decompositions are based on Fourier-space banded frequency rings and Fourier-space truncated low-frequency disks. The decomposed error was analyzed for its impact on the resulting plans' dosimetric metrics. The analysis was conducted on 15 test cases spared from the 200 training and 16 validation cases used to train the model.Most planning target volume metrics were significantly correlated with most error decompositions. The Fourier space disk radii had the largest Spearman's coefficients. The low-frequency region within a disk of ∼20% Fourier space contained most of errors that impact overall plan quality.This study demonstrates the feasibility of using fluence map prediction error analysis to understand the AI agent's performance. Such insights will help fine-tune the DL models in architecture design and loss function selection.
我们之前报道过一种人工智能(AI)代理,它通过通量图预测自动生成调强放射治疗(IMRT)计划,绕过逆向计划。该AI代理在前列腺病例中实现了临床可比的质量,但其在头颈患者中的表现仍有改进空间。本研究旨在通过系统分析基于深度学习(DL)的通量图预测模型的预测误差,收集相关见解。从建模角度来看,DL模型的输出是IMRT计划的通量图。然而,从临床计划角度来看,计划质量评估应基于诸如剂量体积直方图等临床剂量学标准。为了考虑通量图预测误差与相应剂量分布变化之间复杂且不直观的关系,我们提出了一种新颖的误差分析方法,该方法系统地检查由不同程度的通量预测误差引起的计划剂量学变化。我们研究了模型预测误差的四种分解模式。两种空间域分解基于通量强度和通量梯度。两种频域分解基于傅里叶空间带状频率环和傅里叶空间截断低频盘。分析分解后的误差对所得计划的剂量学指标的影响。分析是在用于训练模型的200个训练案例和16个验证案例之外留出的15个测试案例上进行的。大多数计划靶区体积指标与大多数误差分解显著相关。傅里叶空间盘半径具有最大的斯皮尔曼系数。在约20%傅里叶空间的盘内低频区域包含了影响总体计划质量的大部分误差。本研究证明了使用通量图预测误差分析来理解AI代理性能的可行性。这些见解将有助于在架构设计和损失函数选择中对DL模型进行微调。