单视图荧光透视X射线姿态估计：替代损失函数和体积场景表示的比较

Single-View Fluoroscopic X-Ray Pose Estimation: A Comparison of Alternative Loss Functions and Volumetric Scene Representations.

作者信息

Zhou Chaochao, Faruqui Syed Hasib Akhter, An Dayeong, Patel Abhinav, Abdalla Ramez N, Hurley Michael C, Shaibani Ali, Potts Matthew B, Jahromi Babak S, Ansari Sameer A, Cantrell Donald R

机构信息

Department of Radiology, Northwestern Medicine, Northwestern University, Chicago, IL, USA.

Department of Neurology, Northwestern Medicine, Northwestern University, Chicago, IL, USA.

出版信息

J Imaging Inform Med. 2024 Dec 13. doi: 10.1007/s10278-024-01354-w.

DOI:10.1007/s10278-024-01354-w

PMID:39673009

Abstract

Many tasks performed in image-guided procedures can be cast as pose estimation problems, where specific projections are chosen to reach a target in 3D space. In this study, we construct a framework for fluoroscopic pose estimation and compare alternative loss functions and volumetric scene representations. We first develop a differentiable projection (DiffProj) algorithm for the efficient computation of Digitally Reconstructed Radiographs (DRRs) from either Cone-Beam Computerized Tomography (CBCT) or neural scene representations. We introduce two innovative neural scene representations, Neural Tuned Tomography (NeTT) and masked Neural Radiance Fields (mNeRF). Pose estimation is then performed within the framework by iterative gradient descent using loss functions that quantify the image discrepancy of the synthesized DRR with respect to the ground-truth, target fluoroscopic X-ray image. We compared alternative loss functions and volumetric scene representations for pose estimation using a dataset consisting of 50 cranial tomographic X-ray sequences. We find that Mutual Information significantly outperforms alternative loss functions for pose estimation, avoiding entrapment in local optima. The alternative discrete (CBCT) and neural (NeTT and mNeRF) volumetric scene representations yield comparable performance (3D angle errors, mean ≤ 3.2° and 90% quantile ≤ 3.4°); however, the neural scene representations incur a considerable computational expense to train.

摘要

在图像引导手术中执行的许多任务都可以归结为姿态估计问题，即选择特定的投影以在三维空间中到达目标。在本研究中，我们构建了一个用于荧光透视姿态估计的框架，并比较了替代损失函数和体场景表示。我们首先开发了一种可微投影（DiffProj）算法，用于从锥束计算机断层扫描（CBCT）或神经场景表示中高效计算数字重建射线照片（DRR）。我们引入了两种创新的神经场景表示，即神经调谐断层扫描（NeTT）和掩码神经辐射场（mNeRF）。然后，在该框架内通过迭代梯度下降进行姿态估计，使用损失函数来量化合成的DRR与真实目标荧光透视X射线图像之间的图像差异。我们使用由50个颅骨断层X射线序列组成的数据集，比较了姿态估计的替代损失函数和体场景表示。我们发现，互信息在姿态估计方面显著优于替代损失函数，避免陷入局部最优。替代的离散（CBCT）和神经（NeTT和mNeRF）体场景表示产生了可比的性能（三维角度误差，平均值≤3.2°，90%分位数≤3.4°）；然而，神经场景表示在训练时会产生相当大的计算成本。

相似文献

Single-View Fluoroscopic X-Ray Pose Estimation: A Comparison of Alternative Loss Functions and Volumetric Scene Representations.

J Imaging Inform Med. 2024 Dec 13. doi: 10.1007/s10278-024-01354-w.

2D-3D registration for cranial radiation therapy using a 3D kV CBCT and a single limited field-of-view 2D kV radiograph.

Med Phys. 2018 May;45(5):1794-1810. doi: 10.1002/mp.12823. Epub 2018 Mar 24.

Real-time 6DoF pose recovery from X-ray images using library-based DRR and hybrid optimization.

Int J Comput Assist Radiol Surg. 2016 Jun;11(6):1211-20. doi: 10.1007/s11548-016-1387-2. Epub 2016 Apr 2.

4D-Precise: Learning-based 3D motion estimation and high temporal resolution 4DCT reconstruction from treatment 2D+t X-ray projections.

Comput Methods Programs Biomed. 2024 Jun;250:108158. doi: 10.1016/j.cmpb.2024.108158. Epub 2024 Apr 4.

CBCT-DRRs superior to CT-DRRs for target-tracking applications for pancreatic SBRT.

Biomed Phys Eng Express. 2024 Apr 26;10(3). doi: 10.1088/2057-1976/ad3bb9.

An on-board surgical tracking and video augmentation system for C-arm image guidance.

Int J Comput Assist Radiol Surg. 2012 Sep;7(5):647-65. doi: 10.1007/s11548-012-0682-9. Epub 2012 Apr 27.

A technique for estimating 4D-CBCT using prior knowledge and limited-angle projections.

Med Phys. 2013 Dec;40(12):121701. doi: 10.1118/1.4825097.

Reconstruction of a cone-beam CT image via forward iterative projection matching.

Med Phys. 2010 Dec;37(12):6212-20. doi: 10.1118/1.3515460.

A digitally reconstructed radiograph algorithm calculated from first principles.

Med Phys. 2013 Jan;40(1):011902. doi: 10.1118/1.4769413.

Image Correlation Between Digitally Reconstructed Radiographs, C-arm Fluoroscopic Radiographs, and X-ray: A Phantom Study.

Cureus. 2024 Jan 8;16(1):e51868. doi: 10.7759/cureus.51868. eCollection 2024 Jan.

本文引用的文献

X-Ray to DRR Images Translation for Efficient Multiple Objects Similarity Measures in Deformable Model 3D/2D Registration.

IEEE Trans Med Imaging. 2023 Apr;42(4):897-909. doi: 10.1109/TMI.2022.3218568. Epub 2023 Apr 3.

Transfer learning from an artificial radiograph-landmark dataset for registration of the anatomic skull model to dual fluoroscopic X-ray images.

Comput Biol Med. 2021 Nov;138:104923. doi: 10.1016/j.compbiomed.2021.104923. Epub 2021 Oct 7.

The Impact of Machine Learning on 2D/3D Registration for Image-Guided Interventions: A Systematic Review and Perspective.

Front Robot AI. 2021 Aug 30;8:716007. doi: 10.3389/frobt.2021.716007. eCollection 2021.

Generalizing Spatial Transformers to Projective Geometry with Applications to 2D/3D Registration.

Med Image Comput Comput Assist Interv. 2020 Oct;12263:329-339. doi: 10.1007/978-3-030-59716-0_32. Epub 2020 Sep 29.

Investigation of Alterations in the Lumbar Disc Biomechanics at the Adjacent Segments After Spinal Fusion Using a Combined In Vivo and In Silico Approach.

Ann Biomed Eng. 2021 Feb;49(2):601-616. doi: 10.1007/s10439-020-02588-9. Epub 2020 Aug 12.

Automatic annotation of hip anatomy in fluoroscopy for robust and efficient 2D/3D registration.

Int J Comput Assist Radiol Surg. 2020 May;15(5):759-769. doi: 10.1007/s11548-020-02162-7. Epub 2020 Apr 24.

Multiobjective Design Optimization of a Biconcave Mobile-Bearing Lumbar Total Artificial Disk Considering Spinal Kinematics, Facet Joint Loading, and Metal-on-Polyethylene Contact Mechanics.

J Biomech Eng. 2020 Apr 1;142(4). doi: 10.1115/1.4045048.

Endovascular thrombectomy after large-vessel ischaemic stroke: a meta-analysis of individual patient data from five randomised trials.

Lancet. 2016 Apr 23;387(10029):1723-31. doi: 10.1016/S0140-6736(16)00163-X. Epub 2016 Feb 18.

3D Slicer as an image computing platform for the Quantitative Imaging Network.

Magn Reson Imaging. 2012 Nov;30(9):1323-41. doi: 10.1016/j.mri.2012.05.001. Epub 2012 Jul 6.

International subarachnoid aneurysm trial (ISAT) of neurosurgical clipping versus endovascular coiling in 2143 patients with ruptured intracranial aneurysms: a randomised comparison of effects on survival, dependency, seizures, rebleeding, subgroups, and aneurysm occlusion.

Lancet. 2005;366(9488):809-17. doi: 10.1016/S0140-6736(05)67214-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

单视图荧光透视X射线姿态估计：替代损失函数和体积场景表示的比较

Single-View Fluoroscopic X-Ray Pose Estimation: A Comparison of Alternative Loss Functions and Volumetric Scene Representations.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献