Hindley Nicholas, Keall Paul J
Image X Institute, School of Health Sciences, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia.
Med Phys. 2025 Jul;52(7):e18015. doi: 10.1002/mp.18015.
Real-time image-guided radiation therapy (IGRT) was first clinically implemented more than 25 years ago but is yet to find widespread adoption. Existing approaches to real-time IGRT require dedicated or specialized equipment that is not available in most treatment centers and most techniques focus exclusively on targets without tracking the surrounding organs-at-risk (OARs).
To address the need for inexpensive real-time IGRT, we developed Voxelmap, a deep learning framework that achieves 3D respiratory motion estimation and volumetric imaging using the data and resources already available in standard clinical settings. This framework can also be adapted to other imaging modalities such as MRI-Linacs. In contrast with existing approaches, which constrain the solution space with linear priors, Voxelmap encourages diffeomorphic mappings that are topology-preserving and invertible.
Deformable image registration and forward-projection or slice extraction were used to generate patient-specific training datasets of 3D deformation vector fields (DVFs) and 2D images (or k-space data) from pretreatment 4D-CT or 4D-MRI scans. The XCAT and CoMBAT digital phantoms and SPARE Grand Challenge Dataset provided synthetic and patient data, respectively. Five network architectures were used to predict 3D DVFs from 2D imaging data. Networks A-C were trained on x-ray images, Network D was trained on MR images and Network E was trained on k-space data. Using Voxelmap, network-generated 3D DVFs were used to warp both structures contoured on the peak-exhale pretreatment image and the image itself to enable simultaneous target and OAR tracking and volumetric imaging. Using the standard-of-care approach, contours were expanded to internal target volumes.
Validating on digital phantom data for x-ray guided treatments of cardiac arrhythmia, mean Dice similarity between predicted and ground-truth target and OAR contours for Networks A-C ranged from 0.81 ± 0.05 to 0.82 ± 0.05 and 0.78 ± 0.04 to 0.81 ± 0.04, respectively, while target centroid error ranged from 2.0 ± 0.5 to 2.3 ± 0.9 mm. For MRI-based digital phantom data, mean Dice similarity for target and OAR contours was 0.91 ± 0.06 and 0.90 ± 0.02 for both Networks D and E, while target centroid error ranged from 1.7 ± 0.8 to 1.8 ± 0.8 mm. For x-ray-based lung cancer patient data, mean Dice similarity for target and OAR contours for Networks A-C ranged from 0.86 ± 0.05 to 0.89 ± 0.04 and 0.94 ± 0.01 to 0.97 ± 0.01, respectively. However, in terms of target centroid error, only Network A outperformed an ITV-based approach at 1.8 ± 0.7 mm while Networks B and C exhibited large errors of 2.7 ± 1.2 to 3.5 ± 1.4 mm, respectively. Target volumes dynamically shifted using Voxelmap were 31 % smaller than the standard-of-care.
Voxelmap provides a generalized, open-source tool for intrafraction respiratory motion monitoring and volumetric imaging. Comparing tracking errors across synthetic and patient data revealed that certain network architectures are more robust to the scatter and noise profiles encountered in typical clinical settings. These learnings will inform future developments in real-time motion tracking. Our code is available at https://github.com/Image-X-Institute/Voxelmap .
实时图像引导放射治疗(IGRT)在25多年前首次临床应用,但尚未得到广泛采用。现有的实时IGRT方法需要专用或特殊设备,大多数治疗中心并不具备,而且大多数技术仅专注于靶区,而不追踪周围的危及器官(OARs)。
为满足对廉价实时IGRT的需求,我们开发了Voxelmap,这是一个深度学习框架,可利用标准临床环境中已有的数据和资源实现三维呼吸运动估计和容积成像。该框架还可适用于其他成像模态,如MRI直线加速器。与现有方法通过线性先验约束解空间不同,Voxelmap鼓励进行拓扑保持和可逆的微分同胚映射。
使用可变形图像配准和前向投影或切片提取,从预处理的四维CT或四维MRI扫描中生成患者特异性的三维变形矢量场(DVFs)和二维图像(或k空间数据)训练数据集。XCAT和CoMBAT数字体模以及SPARE重大挑战数据集分别提供了合成数据和患者数据。使用五种网络架构从二维成像数据预测三维DVFs。网络A - C在X射线图像上训练,网络D在MR图像上训练,网络E在k空间数据上训练。使用Voxelmap,网络生成的三维DVFs用于对呼气末预处理图像上勾勒的结构以及图像本身进行变形,以实现同时追踪靶区和OARs以及容积成像。使用标准治疗方法,将轮廓扩展为内部靶区体积。
在用于心律失常X射线引导治疗的数字体模数据上进行验证时,网络A - C预测的靶区和OAR轮廓与真实轮廓之间的平均骰子相似系数分别在0.81±0.05至0.82±0.05以及0.78±0.04至0.81±0.04之间,而靶区质心误差在2.0±0.5至2.3±0.9毫米之间。对于基于MRI的数字体模数据,网络D和E的靶区和OAR轮廓的平均骰子相似系数分别为0.91±0.06和0.90±0.02,而靶区质心误差在1.7±0.8至1.8±0.8毫米之间。对于基于X射线的肺癌患者数据,网络A - C的靶区和OAR轮廓的平均骰子相似系数分别在0.86±0.05至0.89±(此处原文有误,应为0.89±0.04)以及0.94±0.01至0.97±0.01之间。然而,在靶区质心误差方面,只有网络A的表现优于基于内部靶区体积(ITV)的方法,误差为1.8±0.7毫米,而网络B和C分别表现出2.7±1.2至3.5±1.4毫米的较大误差。使用Voxelmap动态移动的靶区体积比标准治疗方法小31%。
Voxelmap为分次内呼吸运动监测和容积成像提供了一个通用的开源工具。比较合成数据和患者数据的追踪误差表明,某些网络架构对典型临床环境中遇到的散射和噪声分布更具鲁棒性。这些经验将为实时运动追踪的未来发展提供参考。我们的代码可在https://github.com/Image-X-Institute/Voxelmap获取。