WiTUnet：一种集成卷积神经网络（CNN）和Transformer的U型架构，用于改进特征对齐和局部信息融合。

WiTUnet: A U-shaped architecture integrating CNN and Transformer for improved feature alignment and local information fusion.

作者信息

Wang Bin, Deng Fei, Jiang Peifan, Wang Shuang, Han Xiao, Zhang Zhixuan

机构信息

College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, Sichuan, China.

College of Geophysics, Chengdu University of Technology, Chengdu, 610059, Sichuan, China.

出版信息

Sci Rep. 2024 Oct 26;14(1):25525. doi: 10.1038/s41598-024-76886-w.

DOI:10.1038/s41598-024-76886-w

PMID:39462127

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11512998/

Abstract

Low-dose computed tomography (LDCT) has emerged as the preferred technology for diagnostic medical imaging due to the potential health risks associated with X-ray radiation and conventional computed tomography (CT) techniques. While LDCT utilizes a lower radiation dose compared to standard CT, it results in increased image noise, which can impair the accuracy of diagnoses. To mitigate this issue, advanced deep learning-based LDCT denoising algorithms have been developed. These primarily utilize Convolutional Neural Networks (CNNs) or Transformer Networks and often employ the Unet architecture, which enhances image detail by integrating feature maps from the encoder and decoder via skip connections. However, existing methods focus excessively on the optimization of the encoder and decoder structures while overlooking potential enhancements to the Unet architecture itself. This oversight can be problematic due to significant differences in feature map characteristics between the encoder and decoder, where simple fusion strategies may hinder effective image reconstruction. In this paper, we introduce WiTUnet, a novel LDCT image denoising method that utilizes nested, dense skip pathway in place of traditional skip connections to improve feature integration. Additionally, to address the high computational demands of conventional Transformers on large images, WiTUnet incorporates a windowed Transformer structure that processes images in smaller, non-overlapping segments, significantly reducing computational load. Moreover, our approach includes a Local Image Perception Enhancement (LiPe) module within both the encoder and decoder to replace the standard multi-layer perceptron (MLP) in Transformers, thereby improving the capture and representation of local image features. Through extensive experimental comparisons, WiTUnet has demonstrated superior performance over existing methods in critical metrics such as Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), and Root Mean Square Error (RMSE), significantly enhancing noise removal and image quality. The code is available on github https://github.com/woldier/WiTUNet .

摘要

由于与X射线辐射和传统计算机断层扫描（CT）技术相关的潜在健康风险，低剂量计算机断层扫描（LDCT）已成为诊断医学成像的首选技术。虽然与标准CT相比，LDCT使用的辐射剂量较低，但它会导致图像噪声增加，这可能会损害诊断的准确性。为了缓解这个问题，已经开发了基于深度学习的先进LDCT去噪算法。这些算法主要利用卷积神经网络（CNN）或Transformer网络，并且经常采用Unet架构，该架构通过跳跃连接整合编码器和解码器的特征图来增强图像细节。然而，现有方法过度关注编码器和解码器结构的优化，而忽略了Unet架构本身的潜在改进。由于编码器和解码器之间特征图特征存在显著差异，这种疏忽可能会产生问题，简单的融合策略可能会阻碍有效的图像重建。在本文中，我们介绍了WiTUnet，一种新颖的LDCT图像去噪方法，它利用嵌套的密集跳跃路径代替传统的跳跃连接来改善特征整合。此外，为了解决传统Transformer对大图像的高计算需求，WiTUnet采用了一种窗口化Transformer结构，该结构以较小的、不重叠的片段处理图像，显著降低了计算负荷。此外，我们的方法在编码器和解码器中都包含一个局部图像感知增强（LiPe）模块，以取代Transformer中的标准多层感知器（MLP），从而改善局部图像特征的捕获和表示。通过广泛的实验比较，WiTUnet在诸如峰值信噪比（PSNR）、结构相似性（SSIM）和均方根误差（RMSE）等关键指标上表现出优于现有方法的性能，显著提高了去噪效果和图像质量。代码可在github https://github.com/woldier/WiTUNet 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8996/11512998/9727b02c72b7/41598_2024_76886_Fig1_HTML.jpg

相似文献

WiTUnet: A U-shaped architecture integrating CNN and Transformer for improved feature alignment and local information fusion.WiTUnet：一种集成卷积神经网络（CNN）和Transformer的U型架构，用于改进特征对齐和局部信息融合。

Sci Rep. 2024 Oct 26;14(1):25525. doi: 10.1038/s41598-024-76886-w.

STEDNet: Swin transformer-based encoder-decoder network for noise reduction in low-dose CT.STEDNet：基于 Swin Transformer 的编解码网络，用于降低低剂量 CT 中的噪声。

Med Phys. 2023 Jul;50(7):4443-4458. doi: 10.1002/mp.16249. Epub 2023 Feb 9.

A novel denoising method for low-dose CT images based on transformer and CNN.基于Transformer 和 CNN 的低剂量 CT 图像新型去噪方法。

Comput Biol Med. 2023 Sep;163:107162. doi: 10.1016/j.compbiomed.2023.107162. Epub 2023 Jun 8.

HCformer: Hybrid CNN-Transformer for LDCT Image Denoising.HCformer：用于 LDCT 图像去噪的混合 CNN-Transformer。

J Digit Imaging. 2023 Oct;36(5):2290-2305. doi: 10.1007/s10278-023-00842-9. Epub 2023 Jun 29.

A new visual State Space Model for low-dose CT denoising.一种用于低剂量CT去噪的新型视觉状态空间模型。

Med Phys. 2024 Dec;51(12):8851-8864. doi: 10.1002/mp.17387. Epub 2024 Sep 4.

Structure-preserving low-dose computed tomography image denoising using a deep residual adaptive global context attention network.使用深度残差自适应全局上下文注意力网络的结构保留低剂量计算机断层扫描图像去噪

Quant Imaging Med Surg. 2023 Oct 1;13(10):6528-6545. doi: 10.21037/qims-23-194. Epub 2023 Sep 14.

ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet：探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。

Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.

Low-dose CT denoising with a high-level feature refinement and dynamic convolution network.基于高级特征细化和动态卷积网络的低剂量 CT 去噪。

Med Phys. 2023 Jun;50(6):3597-3611. doi: 10.1002/mp.16175. Epub 2023 Jan 7.

A Review of deep learning methods for denoising of medical low-dose CT images.深度学习方法在医学低剂量 CT 图像去噪中的研究进展。

Comput Biol Med. 2024 Mar;171:108112. doi: 10.1016/j.compbiomed.2024.108112. Epub 2024 Feb 15.

Spatial adaptive and transformer fusion network (STFNet) for low-count PET blind denoising with MRI.基于 MRI 的低计数 PET 盲去噪的空间自适应和变换融合网络（STFNet）

Med Phys. 2022 Jan;49(1):343-356. doi: 10.1002/mp.15368. Epub 2021 Dec 10.

引用本文的文献

An enhanced image restoration using deep learning and transformer based contextual optimization algorithm.一种使用深度学习和基于Transformer的上下文优化算法的增强图像恢复方法。

Sci Rep. 2025 Mar 25;15(1):10324. doi: 10.1038/s41598-025-94449-5.

本文引用的文献

CTformer: convolution-free Token2Token dilated vision transformer for low-dose CT denoising.CTformer：用于低剂量 CT 去噪的无卷积 Token2Token 扩张视觉转换器。

Phys Med Biol. 2023 Mar 15;68(6). doi: 10.1088/1361-6560/acc000.

Low-Dose CT Denoising via Sinogram Inner-Structure Transformer.基于正弦图内部结构变换器的低剂量CT去噪

IEEE Trans Med Imaging. 2023 Apr;42(4):910-921. doi: 10.1109/TMI.2022.3219856. Epub 2023 Apr 3.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++：一种用于医学图像分割的嵌套U-Net架构。

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.

Attention-guided CNN for image denoising.注意引导卷积神经网络进行图像去噪。

Neural Netw. 2020 Apr;124:117-129. doi: 10.1016/j.neunet.2019.12.024. Epub 2020 Jan 7.

Domain Progressive 3D Residual Convolution Network to Improve Low-Dose CT Imaging.基于域渐进式 3D 残差卷积网络的低剂量 CT 成像方法。

IEEE Trans Med Imaging. 2019 Dec;38(12):2903-2913. doi: 10.1109/TMI.2019.2917258. Epub 2019 May 17.

The risk of cancer attributable to diagnostic medical radiation: Estimation for France in 2015.诊断性医疗辐射所致癌症风险：2015 年法国的估计。

Int J Cancer. 2019 Jun 15;144(12):2954-2963. doi: 10.1002/ijc.32048. Epub 2019 Jan 15.

Low-Dose CT Image Denoising Using a Generative Adversarial Network With Wasserstein Distance and Perceptual Loss.基于 Wasserstein 距离和感知损失的生成对抗网络的低剂量 CT 图像去噪

IEEE Trans Med Imaging. 2018 Jun;37(6):1348-1357. doi: 10.1109/TMI.2018.2827462.

Low-dose CT for the detection and classification of metastatic liver lesions: Results of the 2016 Low Dose CT Grand Challenge.低剂量 CT 检测和分类转移性肝病变：2016 年低剂量 CT 大挑战的结果。

Med Phys. 2017 Oct;44(10):e339-e352. doi: 10.1002/mp.12345.

Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network.采用残差编解码器卷积神经网络的低剂量CT

IEEE Trans Med Imaging. 2017 Dec;36(12):2524-2535. doi: 10.1109/TMI.2017.2715284. Epub 2017 Jun 13.

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising.超越高斯去噪器：用于图像去噪的深度 CNN 的残差学习。

IEEE Trans Image Process. 2017 Jul;26(7):3142-3155. doi: 10.1109/TIP.2017.2662206. Epub 2017 Feb 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

WiTUnet：一种集成卷积神经网络（CNN）和Transformer的U型架构，用于改进特征对齐和局部信息融合。

WiTUnet: A U-shaped architecture integrating CNN and Transformer for improved feature alignment and local information fusion.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献