Wang Bin, Deng Fei, Jiang Peifan, Wang Shuang, Han Xiao, Zhang Zhixuan
College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, Sichuan, China.
College of Geophysics, Chengdu University of Technology, Chengdu, 610059, Sichuan, China.
Sci Rep. 2024 Oct 26;14(1):25525. doi: 10.1038/s41598-024-76886-w.
Low-dose computed tomography (LDCT) has emerged as the preferred technology for diagnostic medical imaging due to the potential health risks associated with X-ray radiation and conventional computed tomography (CT) techniques. While LDCT utilizes a lower radiation dose compared to standard CT, it results in increased image noise, which can impair the accuracy of diagnoses. To mitigate this issue, advanced deep learning-based LDCT denoising algorithms have been developed. These primarily utilize Convolutional Neural Networks (CNNs) or Transformer Networks and often employ the Unet architecture, which enhances image detail by integrating feature maps from the encoder and decoder via skip connections. However, existing methods focus excessively on the optimization of the encoder and decoder structures while overlooking potential enhancements to the Unet architecture itself. This oversight can be problematic due to significant differences in feature map characteristics between the encoder and decoder, where simple fusion strategies may hinder effective image reconstruction. In this paper, we introduce WiTUnet, a novel LDCT image denoising method that utilizes nested, dense skip pathway in place of traditional skip connections to improve feature integration. Additionally, to address the high computational demands of conventional Transformers on large images, WiTUnet incorporates a windowed Transformer structure that processes images in smaller, non-overlapping segments, significantly reducing computational load. Moreover, our approach includes a Local Image Perception Enhancement (LiPe) module within both the encoder and decoder to replace the standard multi-layer perceptron (MLP) in Transformers, thereby improving the capture and representation of local image features. Through extensive experimental comparisons, WiTUnet has demonstrated superior performance over existing methods in critical metrics such as Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), and Root Mean Square Error (RMSE), significantly enhancing noise removal and image quality. The code is available on github https://github.com/woldier/WiTUNet .
由于与X射线辐射和传统计算机断层扫描(CT)技术相关的潜在健康风险,低剂量计算机断层扫描(LDCT)已成为诊断医学成像的首选技术。虽然与标准CT相比,LDCT使用的辐射剂量较低,但它会导致图像噪声增加,这可能会损害诊断的准确性。为了缓解这个问题,已经开发了基于深度学习的先进LDCT去噪算法。这些算法主要利用卷积神经网络(CNN)或Transformer网络,并且经常采用Unet架构,该架构通过跳跃连接整合编码器和解码器的特征图来增强图像细节。然而,现有方法过度关注编码器和解码器结构的优化,而忽略了Unet架构本身的潜在改进。由于编码器和解码器之间特征图特征存在显著差异,这种疏忽可能会产生问题,简单的融合策略可能会阻碍有效的图像重建。在本文中,我们介绍了WiTUnet,一种新颖的LDCT图像去噪方法,它利用嵌套的密集跳跃路径代替传统的跳跃连接来改善特征整合。此外,为了解决传统Transformer对大图像的高计算需求,WiTUnet采用了一种窗口化Transformer结构,该结构以较小的、不重叠的片段处理图像,显著降低了计算负荷。此外,我们的方法在编码器和解码器中都包含一个局部图像感知增强(LiPe)模块,以取代Transformer中的标准多层感知器(MLP),从而改善局部图像特征的捕获和表示。通过广泛的实验比较,WiTUnet在诸如峰值信噪比(PSNR)、结构相似性(SSIM)和均方根误差(RMSE)等关键指标上表现出优于现有方法的性能,显著提高了去噪效果和图像质量。代码可在github https://github.com/woldier/WiTUNet 上获取。