LGIT：用于低光照图像去噪的局部-全局交互变换器

LGIT: local-global interaction transformer for low-light image denoising.

作者信息

Chen Zuojun, Qin Pinle, Zeng Jianchao, Song Quanzhen, Zhao Pengcheng, Chai Rui

机构信息

School of Computer Science and Technology, North University of China, Taiyuan, 030051, China.

出版信息

Sci Rep. 2024 Sep 18;14(1):21760. doi: 10.1038/s41598-024-72912-z.

DOI:10.1038/s41598-024-72912-z

PMID:39294345

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11410926/

Abstract

Transformer-based methods effectively capture global dependencies in images, demonstrating outstanding performance in multiple visual tasks. However, existing Transformers cannot effectively denoise large noisy images captured under low-light conditions owing to (1) the global self-attention mechanism causing high computational complexity in the spatial dimension owing to a quadratic increase in computation with the number of tokens; (2) the channel-wise self-attention computation unable to optimise the spatial correlations in images. We propose a local-global interaction Transformer (LGIT) that employs an adaptive strategy to select relevant patches for global interaction, achieving low computational complexity in global self-attention computation. A top-N patch cross-attention model (TPCA) is designed based on superpixel segmentation guidance. TPCA selects top-N patches most similar to the target image patch and applies cross attention to aggregate information from them into the target patch, effectively enhancing the utilisation of the image's nonlocal self-similarity. A mixed-scale dual-gated feedforward network (MDGFF) is introduced for the effective extraction of multiscale local correlations. TPCA and MDGFF were combined to construct a hierarchical encoder-decoder network, LGIT, to compute self-attention within and across patches at different scales. Extensive experiments using real-world image-denoising datasets demonstrated that LGIT outperformed state-of-the-art (SOTA) convolutional neural network (CNN) and Transformer-based methods in qualitative and quantitative results.

摘要

基于Transformer的方法能够有效地捕捉图像中的全局依赖性，在多个视觉任务中表现出色。然而，现有的Transformer无法有效地对在低光照条件下拍摄的大尺寸噪声图像进行去噪，原因如下：（1）全局自注意力机制由于计算量随token数量呈二次方增长，导致在空间维度上计算复杂度较高；（2）通道维度的自注意力计算无法优化图像中的空间相关性。我们提出了一种局部-全局交互Transformer（LGIT），它采用自适应策略来选择用于全局交互的相关patch，在全局自注意力计算中实现了低计算复杂度。基于超像素分割引导设计了一种top-N patch交叉注意力模型（TPCA）。TPCA选择与目标图像patch最相似的top-N个patch，并应用交叉注意力将来自它们的信息聚合到目标patch中，有效地提高了图像非局部自相似性的利用率。引入了一种混合尺度双门控前馈网络（MDGFF）来有效提取多尺度局部相关性。将TPCA和MDGFF相结合，构建了一个分层编码器-解码器网络LGIT，以计算不同尺度下patch内部和跨patch的自注意力。使用真实世界图像去噪数据集进行的大量实验表明，LGIT在定性和定量结果方面均优于当前最先进的（SOTA）卷积神经网络（CNN）和基于Transformer的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/3c9479ff2d8b/41598_2024_72912_Fig1_HTML.jpg

相似文献

LGIT: local-global interaction transformer for low-light image denoising.

Sci Rep. 2024 Sep 18;14(1):21760. doi: 10.1038/s41598-024-72912-z.

MultiTrans: Multi-branch transformer network for medical image segmentation.

Comput Methods Programs Biomed. 2024 Sep;254:108280. doi: 10.1016/j.cmpb.2024.108280. Epub 2024 Jun 8.

DiagSWin: A multi-scale vision transformer with diagonal-shaped windows for object detection and segmentation.

Neural Netw. 2024 Dec;180:106653. doi: 10.1016/j.neunet.2024.106653. Epub 2024 Aug 22.

Spach Transformer: Spatial and Channel-Wise Transformer Based on Local and Global Self-Attentions for PET Image Denoising.

IEEE Trans Med Imaging. 2024 Jun;43(6):2036-2049. doi: 10.1109/TMI.2023.3336237. Epub 2024 Jun 3.

A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.

Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.

MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.

Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.

A new visual State Space Model for low-dose CT denoising.

Med Phys. 2024 Dec;51(12):8851-8864. doi: 10.1002/mp.17387. Epub 2024 Sep 4.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images.

Comput Biol Med. 2024 Mar;170:107938. doi: 10.1016/j.compbiomed.2024.107938. Epub 2024 Jan 3.

MMNet: A Mixing Module Network for Polyp Segmentation.

Sensors (Basel). 2023 Aug 18;23(16):7258. doi: 10.3390/s23167258.

引用本文的文献

A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model.

Front Physiol. 2025 Mar 12;16:1512835. doi: 10.3389/fphys.2025.1512835. eCollection 2025.

本文引用的文献

Single Stage Adaptive Multi-Attention Network for Image Restoration.

IEEE Trans Image Process. 2024;33:2924-2935. doi: 10.1109/TIP.2024.3384838. Epub 2024 Apr 23.

A novel low light object detection method based on the YOLOv5 fusion feature enhancement.

Sci Rep. 2024 Feb 23;14(1):4486. doi: 10.1038/s41598-024-54428-8.

TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution.

IEEE Trans Image Process. 2024;33:738-752. doi: 10.1109/TIP.2023.3349004. Epub 2024 Jan 12.

Learnability Enhancement for Low-Light Raw Image Denoising: A Data Perspective.

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):370-387. doi: 10.1109/TPAMI.2023.3301502. Epub 2023 Dec 5.

Superpixel-guided class-level denoising for unsupervised domain adaptive fundus image segmentation without source data.

Comput Biol Med. 2023 Aug;162:107061. doi: 10.1016/j.compbiomed.2023.107061. Epub 2023 May 26.

A survey on image enhancement for Low-light images.

Heliyon. 2023 Mar 16;9(4):e14558. doi: 10.1016/j.heliyon.2023.e14558. eCollection 2023 Apr.

Low light image enhancement using curvelet transform and iterative back projection.

Sci Rep. 2023 Jan 17;13(1):872. doi: 10.1038/s41598-023-27838-3.

Progressive Joint Low-Light Enhancement and Noise Removal for Raw Images.

IEEE Trans Image Process. 2022;31:2390-2404. doi: 10.1109/TIP.2022.3155948. Epub 2022 Mar 15.

Deep learning in optical metrology: a review.

Light Sci Appl. 2022 Feb 23;11(1):39. doi: 10.1038/s41377-022-00714-x.

Towards Low Light Enhancement With RAW Images.

IEEE Trans Image Process. 2022;31:1391-1405. doi: 10.1109/TIP.2022.3140610. Epub 2022 Jan 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

LGIT：用于低光照图像去噪的局部-全局交互变换器

LGIT: local-global interaction transformer for low-light image denoising.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献