Zhang Yuanke, Xu Fan, Zhang Rui, Guo Yanfei, Wang Hanxiang, Wei Bingbing, Ma Fei, Meng Jing, Liu Jianlei, Lu Hongbing, Chen Yang
School of Computer Science, Qufu Normal University, Rizhao 276800, People's Republic of China.
School of Biomedical Engineering, Air Force Medical University, Xi'an, Shaanxi 710032, People's Republic of China.
Phys Med Biol. 2025 Jun 6;70(11). doi: 10.1088/1361-6560/addea6.
. Low-dose computed tomography (LDCT) effectively reduces radiation exposure to patients, but introduces severe noise artifacts that affect diagnostic accuracy. Recently, Transformer-based network architectures have been widely applied to LDCT image denoising, generally achieving superior results compared to traditional convolutional methods. However, these methods are often hindered by high computational costs and struggles in capturing complex local contextual features, which negatively impact denoising performance. In this work, we propose CT-Denoimer, an efficient CT Denoising Transformer network that captures both global correlations and intricate, spatially varying local contextual details in CT images, enabling the generation of high-quality images. The core of our framework is a Transformer module that consists of two key components: the multi-Dconv head transposed attention (MDTA) and the mixed contextual feed-forward network (MCFN). The MDTA block captures global correlations in the image with linear computational complexity, while the MCFN block manages multi-scale local contextual information, both static and dynamic, through a series of Enhanced Contextual Transformer modules. In addition, we incorporate operation-wise attention layers to enable collaborative refinement in the proposed CT-Denoimer, enhancing its ability to more effectively handle complex and varying noise patterns in LDCT images. Extensive experimental validation on both the AAPM-Mayo public dataset and a real-world clinical dataset demonstrated the state-of-the-art performance of the proposed CT-Denoimer. It achieved a peak signal-to-noise ratio of 33.681 dB, a structural similarity index measure of 0.921, an information fidelity criterion of 2.857 and a visual information fidelity of 0.349. Subjective assessment by radiologists gave an average score of 4.39, confirming its clinical applicability and clear advantages over existing methods. This study presents an innovative CT denoising Transformer network that sets a new benchmark in LDCT image denoising, excelling in both noise reduction and fine structure preservation.