• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LGIT:用于低光照图像去噪的局部-全局交互变换器

LGIT: local-global interaction transformer for low-light image denoising.

作者信息

Chen Zuojun, Qin Pinle, Zeng Jianchao, Song Quanzhen, Zhao Pengcheng, Chai Rui

机构信息

School of Computer Science and Technology, North University of China, Taiyuan, 030051, China.

出版信息

Sci Rep. 2024 Sep 18;14(1):21760. doi: 10.1038/s41598-024-72912-z.

DOI:10.1038/s41598-024-72912-z
PMID:39294345
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11410926/
Abstract

Transformer-based methods effectively capture global dependencies in images, demonstrating outstanding performance in multiple visual tasks. However, existing Transformers cannot effectively denoise large noisy images captured under low-light conditions owing to (1) the global self-attention mechanism causing high computational complexity in the spatial dimension owing to a quadratic increase in computation with the number of tokens; (2) the channel-wise self-attention computation unable to optimise the spatial correlations in images. We propose a local-global interaction Transformer (LGIT) that employs an adaptive strategy to select relevant patches for global interaction, achieving low computational complexity in global self-attention computation. A top-N patch cross-attention model (TPCA) is designed based on superpixel segmentation guidance. TPCA selects top-N patches most similar to the target image patch and applies cross attention to aggregate information from them into the target patch, effectively enhancing the utilisation of the image's nonlocal self-similarity. A mixed-scale dual-gated feedforward network (MDGFF) is introduced for the effective extraction of multiscale local correlations. TPCA and MDGFF were combined to construct a hierarchical encoder-decoder network, LGIT, to compute self-attention within and across patches at different scales. Extensive experiments using real-world image-denoising datasets demonstrated that LGIT outperformed state-of-the-art (SOTA) convolutional neural network (CNN) and Transformer-based methods in qualitative and quantitative results.

摘要

基于Transformer的方法能够有效地捕捉图像中的全局依赖性,在多个视觉任务中表现出色。然而,现有的Transformer无法有效地对在低光照条件下拍摄的大尺寸噪声图像进行去噪,原因如下:(1)全局自注意力机制由于计算量随token数量呈二次方增长,导致在空间维度上计算复杂度较高;(2)通道维度的自注意力计算无法优化图像中的空间相关性。我们提出了一种局部-全局交互Transformer(LGIT),它采用自适应策略来选择用于全局交互的相关patch,在全局自注意力计算中实现了低计算复杂度。基于超像素分割引导设计了一种top-N patch交叉注意力模型(TPCA)。TPCA选择与目标图像patch最相似的top-N个patch,并应用交叉注意力将来自它们的信息聚合到目标patch中,有效地提高了图像非局部自相似性的利用率。引入了一种混合尺度双门控前馈网络(MDGFF)来有效提取多尺度局部相关性。将TPCA和MDGFF相结合,构建了一个分层编码器-解码器网络LGIT,以计算不同尺度下patch内部和跨patch的自注意力。使用真实世界图像去噪数据集进行的大量实验表明,LGIT在定性和定量结果方面均优于当前最先进的(SOTA)卷积神经网络(CNN)和基于Transformer的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c203e75ada80/41598_2024_72912_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/3c9479ff2d8b/41598_2024_72912_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/9248ee4cedd0/41598_2024_72912_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/ef0bd17a1197/41598_2024_72912_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c94f4a04e15f/41598_2024_72912_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/275717959ce5/41598_2024_72912_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c37411cd08d7/41598_2024_72912_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c203e75ada80/41598_2024_72912_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/3c9479ff2d8b/41598_2024_72912_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/9248ee4cedd0/41598_2024_72912_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/ef0bd17a1197/41598_2024_72912_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c94f4a04e15f/41598_2024_72912_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/275717959ce5/41598_2024_72912_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c37411cd08d7/41598_2024_72912_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/171f/11410926/c203e75ada80/41598_2024_72912_Fig7_HTML.jpg

相似文献

1
LGIT: local-global interaction transformer for low-light image denoising.LGIT:用于低光照图像去噪的局部-全局交互变换器
Sci Rep. 2024 Sep 18;14(1):21760. doi: 10.1038/s41598-024-72912-z.
2
MultiTrans: Multi-branch transformer network for medical image segmentation.多分支转换器网络在医学图像分割中的应用。
Comput Methods Programs Biomed. 2024 Sep;254:108280. doi: 10.1016/j.cmpb.2024.108280. Epub 2024 Jun 8.
3
DiagSWin: A multi-scale vision transformer with diagonal-shaped windows for object detection and segmentation.DiagSWin:一种具有对角线形状窗口的多尺度视觉转换器,用于目标检测和分割。
Neural Netw. 2024 Dec;180:106653. doi: 10.1016/j.neunet.2024.106653. Epub 2024 Aug 22.
4
Spach Transformer: Spatial and Channel-Wise Transformer Based on Local and Global Self-Attentions for PET Image Denoising.Spach 转换器:基于局部和全局自注意力的空间和通道转换器,用于 PET 图像去噪。
IEEE Trans Med Imaging. 2024 Jun;43(6):2036-2049. doi: 10.1109/TMI.2023.3336237. Epub 2024 Jun 3.
5
A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.一种使用变换和卷积的 3D 层次跨模态交互网络,用于磁共振图像中的脑胶质瘤分割。
Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.
6
MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.MESTrans:用于医学图像分割的多尺度嵌入空间变换器
Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.
7
A new visual State Space Model for low-dose CT denoising.一种用于低剂量CT去噪的新型视觉状态空间模型。
Med Phys. 2024 Dec;51(12):8851-8864. doi: 10.1002/mp.17387. Epub 2024 Sep 4.
8
Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。
Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.
9
TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images.TSCA-Net:基于Transformer 的空间-通道注意力分割网络用于医学图像。
Comput Biol Med. 2024 Mar;170:107938. doi: 10.1016/j.compbiomed.2024.107938. Epub 2024 Jan 3.
10
MMNet: A Mixing Module Network for Polyp Segmentation.MMNet:一种用于息肉分割的混合模块网络。
Sensors (Basel). 2023 Aug 18;23(16):7258. doi: 10.3390/s23167258.

引用本文的文献

1
A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model.一种用于精确肺炎诊断的多模态深度学习解决方案:PneumoFusion-Net模型。
Front Physiol. 2025 Mar 12;16:1512835. doi: 10.3389/fphys.2025.1512835. eCollection 2025.

本文引用的文献

1
Single Stage Adaptive Multi-Attention Network for Image Restoration.用于图像恢复的单阶段自适应多注意力网络
IEEE Trans Image Process. 2024;33:2924-2935. doi: 10.1109/TIP.2024.3384838. Epub 2024 Apr 23.
2
A novel low light object detection method based on the YOLOv5 fusion feature enhancement.一种基于YOLOv5融合特征增强的新型低光照目标检测方法。
Sci Rep. 2024 Feb 23;14(1):4486. doi: 10.1038/s41598-024-54428-8.
3
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution.TTST:一种用于遥感图像超分辨率的Top-k令牌选择变换器
IEEE Trans Image Process. 2024;33:738-752. doi: 10.1109/TIP.2023.3349004. Epub 2024 Jan 12.
4
Learnability Enhancement for Low-Light Raw Image Denoising: A Data Perspective.
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):370-387. doi: 10.1109/TPAMI.2023.3301502. Epub 2023 Dec 5.
5
Superpixel-guided class-level denoising for unsupervised domain adaptive fundus image segmentation without source data.基于超像素引导的类水平去噪的无源数据眼底图像分割的域自适应方法
Comput Biol Med. 2023 Aug;162:107061. doi: 10.1016/j.compbiomed.2023.107061. Epub 2023 May 26.
6
A survey on image enhancement for Low-light images.低光照图像的图像增强研究
Heliyon. 2023 Mar 16;9(4):e14558. doi: 10.1016/j.heliyon.2023.e14558. eCollection 2023 Apr.
7
Low light image enhancement using curvelet transform and iterative back projection.基于curvelet 变换和迭代反向投影的微光图像增强。
Sci Rep. 2023 Jan 17;13(1):872. doi: 10.1038/s41598-023-27838-3.
8
Progressive Joint Low-Light Enhancement and Noise Removal for Raw Images.用于原始图像的渐进式联合低光增强与去噪
IEEE Trans Image Process. 2022;31:2390-2404. doi: 10.1109/TIP.2022.3155948. Epub 2022 Mar 15.
9
Deep learning in optical metrology: a review.光学计量中的深度学习:综述
Light Sci Appl. 2022 Feb 23;11(1):39. doi: 10.1038/s41377-022-00714-x.
10
Towards Low Light Enhancement With RAW Images.利用原始图像实现低光增强
IEEE Trans Image Process. 2022;31:1391-1405. doi: 10.1109/TIP.2022.3140610. Epub 2022 Jan 25.