Ramesh Jayroop, Sankalpa Donthi, Mitra Rohan, Dhou Salam
Department of Computer Science and EngineeringAmerican University of Sharjah Sharjah 26666 UAE.
IEEE Open J Eng Med Biol. 2024 Sep 11;6:61-67. doi: 10.1109/OJEMB.2024.3459622. eCollection 2025.
Respiration-correlated cone-beam computed tomography (4D-CBCT) is an X-ray-based imaging modality that uses reconstruction algorithms to produce time-varying volumetric images of moving anatomy over a cycle of respiratory motion. The quality of the produced images is affected by the number of CBCT projections available for reconstruction. Interpolation techniques have been used to generate intermediary projections to be used, along with the original projections, for reconstruction. Transfer learning is a powerful approach that harnesses the ability to reuse pre-trained models in solving new problems. Several state-of-the-art pre-trained deep learning models, used for video frame interpolation, are utilized in this work to generate intermediary projections. Moreover, a novel regression predictive modeling approach is also proposed to achieve the same objective. Digital phantom and clinical datasets are used to evaluate the performance of the models. The results show that the Real-Time Intermediate Flow Estimation (RIFE) algorithm outperforms the others in terms of the Structural Similarity Index Method (SSIM): 0.986 [Formula: see text] 0.010, Peak Signal to Noise Ratio (PSNR): 44.13 [Formula: see text] 2.76, and Mean Square Error (MSE): 18.86 [Formula: see text] 206.90 across all datasets. Moreover, the interpolated projections were used along with the original ones to reconstruct a 4D-CBCT image that was compared to that reconstructed from the original projections only. The reconstructed image using the proposed approach was found to minimize the streaking artifacts, thereby enhancing the image quality. This work demonstrates the advantage of using general-purpose transfer learning algorithms in 4D-CBCT image enhancement.
呼吸相关锥形束计算机断层扫描(4D-CBCT)是一种基于X射线的成像模态,它使用重建算法在呼吸运动周期内生成移动解剖结构的随时间变化的体积图像。所生成图像的质量受可用于重建的CBCT投影数量的影响。插值技术已被用于生成中间投影,以便与原始投影一起用于重建。迁移学习是一种强大的方法,它利用重用预训练模型来解决新问题的能力。在这项工作中,使用了几个用于视频帧插值的最先进的预训练深度学习模型来生成中间投影。此外,还提出了一种新颖的回归预测建模方法来实现相同的目标。使用数字体模和临床数据集来评估模型的性能。结果表明,在所有数据集上,实时中间流估计(RIFE)算法在结构相似性指数方法(SSIM)方面表现优于其他算法:0.986±0.010,峰值信噪比(PSNR):44.13±2.76,均方误差(MSE):18.86±206.90。此外,将插值投影与原始投影一起用于重建4D-CBCT图像,并将其与仅从原始投影重建的图像进行比较。发现使用所提出方法重建的图像能使条纹伪影最小化,从而提高图像质量。这项工作证明了在4D-CBCT图像增强中使用通用迁移学习算法的优势。