Petragallo Rachel, Bertram Pascal, Halvorsen Per, Iftimia Ileana, Low Daniel A, Morin Olivier, Narayanasamy Ganesh, Saenz Daniel L, Sukumar Kevinraj N, Valdes Gilmer, Weinstein Lauren, Wells Michelle C, Ziemer Benjamin P, Lamb James M
Department of Radiation Oncology, University of California Los Angeles, Los Angeles, California, USA.
Brainlab AG, Munich, Germany.
Med Phys. 2023 May;50(5):2662-2671. doi: 10.1002/mp.16359. Epub 2023 Mar 21.
Misalignment to the incorrect vertebral body remains a rare but serious patient safety risk in image-guided radiotherapy (IGRT).
Our group has proposed that an automated image-review algorithm be inserted into the IGRT process as an interlock to detect off-by-one vertebral body errors. This study presents the development and multi-institutional validation of a convolutional neural network (CNN)-based approach for such an algorithm using patient image data from a planar stereoscopic x-ray IGRT system.
X-rays and digitally reconstructed radiographs (DRRs) were collected from 429 spine radiotherapy patients (1592 treatment fractions) treated at six institutions using a stereoscopic x-ray image guidance system. Clinically-applied, physician approved, alignments were used for true-negative, "no-error" cases. "Off-by-one vertebral body" errors were simulated by translating DRRs along the spinal column using a semi-automated method. A leave-one-institution-out approach was used to estimate model accuracy on data from unseen institutions as follows: All of the images from five of the institutions were used to train a CNN model from scratch using a fixed network architecture and hyper-parameters. The size of this training set ranged from 5700 to 9372 images, depending on exactly which five institutions were contributing data. The training set was randomized and split using a 75/25 split into the final training/ validation sets. X-ray/ DRR image pairs and the associated binary labels of "no-error" or "shift" were used as the model input. Model accuracy was evaluated using images from the sixth institution, which were left out of the training phase entirely. This test set ranged from 180 to 3852 images, again depending on which institution had been left out of the training phase. The trained model was used to classify the images from the test set as either "no-error" or "shifted", and the model predictions were compared to the ground truth labels to assess the model accuracy. This process was repeated until each institution's images had been used as the testing dataset.
When the six models were used to classify unseen image pairs from the institution left out during training, the resulting receiver operating characteristic area under the curve values ranged from 0.976 to 0.998. With the specificity fixed at 99%, the corresponding sensitivities ranged from 61.9% to 99.2% (mean: 77.6%). With the specificity fixed at 95%, sensitivities ranged from 85.5% to 99.8% (mean: 92.9%).
This study demonstrated the CNN-based vertebral body misalignment model is robust when applied to previously unseen test data from an outside institution, indicating that this proposed additional safeguard against misalignment is feasible.
在图像引导放射治疗(IGRT)中,与错误的椎体未对准仍然是一种罕见但严重的患者安全风险。
我们团队提出在IGRT流程中插入一种自动图像审查算法作为联锁装置,以检测椎体偏差一个的错误。本研究介绍了一种基于卷积神经网络(CNN)的方法的开发和多机构验证,该方法使用来自平面立体X射线IGRT系统的患者图像数据来实现这种算法。
从六个机构使用立体X射线图像引导系统治疗的429例脊柱放射治疗患者(1592个治疗分次)中收集X射线和数字重建放射影像(DRR)。临床上应用的、经医生批准的对准用于真阴性“无错误”病例。通过使用半自动方法沿脊柱平移DRR来模拟“椎体偏差一个”错误。采用留一机构法来估计在来自未见过的机构的数据上的模型准确性,具体如下:使用来自五个机构的所有图像,使用固定的网络架构和超参数从头开始训练一个CNN模型。根据具体哪五个机构提供数据,该训练集的大小在5700到9372张图像之间。训练集随机化并以75/25的比例划分为最终的训练/验证集。X射线/DRR图像对以及“无错误”或“移位”的相关二元标签用作模型输入。使用完全排除在训练阶段之外的第六个机构的图像来评估模型准确性。这个测试集的大小在180到3852张图像之间,同样取决于哪个机构被排除在训练阶段之外。使用训练好的模型将测试集的图像分类为“无错误”或“移位”,并将模型预测与地面真值标签进行比较以评估模型准确性。重复这个过程,直到每个机构的图像都被用作测试数据集。
当使用这六个模型对训练期间排除的机构的未见图像对进行分类时,得到的曲线下面积的受试者操作特征值范围为0.976至0.998。当特异性固定为99%时,相应的灵敏度范围为61.9%至99.2%(平均值:77.6%)。当特异性固定为95%时,灵敏度范围为85.5%至99.8%(平均值:92.9%)。
本研究表明,基于CNN的椎体未对准模型应用于来自外部机构的先前未见的测试数据时具有鲁棒性,表明所提出的针对未对准的额外保障措施是可行的。