用于图像超分辨率的局部特征增强Transformer

Local feature enhancement transformer for image super-resolution.

作者信息

Weijie Huang, Detian Huang

机构信息

School of Business, Huaqiao University, Quanzhou, 362021, Fujian Province, China.

College of Engineering, Huaqiao University, Quanzhou, 362021, Fujian Province, China.

出版信息

Sci Rep. 2025 Jul 1;15(1):20792. doi: 10.1038/s41598-025-07650-x.

DOI:10.1038/s41598-025-07650-x

PMID:40595091

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12215049/

Abstract

Transformers have demonstrated remarkable success in image super-resolution (SR) owing to their powerful long-range dependency modeling capability. Although increasing the sliding window size of transformer-based models (e.g., SwinIR) can improve SR performance, this weakens the learning of the fine-level local features, resulting in blurry details in the reconstructed images. To address this limitation, we propose a local feature enhancement transformer for image super-resolution (LFESR) that benefits from global feature capture while enhancing local feature interaction. The basis of our LFESR is the local feature enhancement transformer (LFET), which achieves a balance between the spatial processing and channel configuration in self-attention. Our LFET contains neighborhood self-attention (NSA) and a ghost head, which can be easily applied to existing SR networks based on window self-attention. First, NSA utilizes the Hadamard operation to implement a third-order mapping to enhance local interaction, thus providing clues for high-quality image reconstruction. Next, the novel ghost head combines attention maps with static matrices to increase the channel capacity, thereby enhancing the inference capability of local features. Finally, ConvFFN is incorporated to further strengthen high-frequency detail information for reconstructed images. Extensive experiments were performed to validate the proposed LFESR, which significantly outperformed state-of-the-art methods in terms of both visual quality and quantitative metrics. Especially, the proposed LFESR exceeds SwinIR by 0.49 dB and 0.52 dB in PSNR metrics at a scaling factor of 4 on Urban100 and Manga109 datasets, respectively.

摘要

由于具有强大的长距离依赖建模能力，Transformer在图像超分辨率（SR）方面取得了显著成功。虽然增加基于Transformer的模型（如SwinIR）的滑动窗口大小可以提高超分辨率性能，但这会削弱对精细局部特征的学习，导致重建图像中的细节模糊。为了解决这一局限性，我们提出了一种用于图像超分辨率的局部特征增强Transformer（LFESR），它在增强局部特征交互的同时受益于全局特征捕获。我们的LFESR的基础是局部特征增强Transformer（LFET），它在自注意力的空间处理和通道配置之间实现了平衡。我们的LFET包含邻域自注意力（NSA）和一个幽灵头，可以很容易地应用于基于窗口自注意力的现有超分辨率网络。首先，NSA利用哈达玛运算实现三阶映射以增强局部交互，从而为高质量图像重建提供线索。其次，新颖的幽灵头将注意力图与静态矩阵相结合以增加通道容量，从而增强局部特征的推理能力。最后，引入ConvFFN以进一步增强重建图像的高频细节信息。进行了广泛的实验来验证所提出的LFESR，其在视觉质量和定量指标方面均显著优于现有方法。特别是，在Urban100和Manga109数据集上，所提出的LFESR在4倍缩放因子下的PSNR指标分别比SwinIR高出0.49 dB和0.52 dB。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32d4/12215049/01a8ff61eee1/41598_2025_7650_Fig11_HTML.jpg

相似文献

Local feature enhancement transformer for image super-resolution.

Sci Rep. 2025 Jul 1;15(1):20792. doi: 10.1038/s41598-025-07650-x.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

TC3Net: Transformer and Convolution Coupled Contrastive Network for Single Image Super-Resolution.

IEEE Trans Neural Netw Learn Syst. 2025 Jun 23;PP. doi: 10.1109/TNNLS.2025.3577669.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

DGCFNet: Dual Global Context Fusion Network for remote sensing image semantic segmentation.

PeerJ Comput Sci. 2025 Mar 27;11:e2786. doi: 10.7717/peerj-cs.2786. eCollection 2025.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.

JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.

本文引用的文献

Self-supervised spectral super-resolution for a fast hyperspectral and multispectral image fusion.

Sci Rep. 2024 Nov 30;14(1):29820. doi: 10.1038/s41598-024-81031-8.

A dual branch attention network based on practical degradation model for face super resolution.

Sci Rep. 2024 Nov 14;14(1):28064. doi: 10.1038/s41598-024-79695-3.

A Virtual-Sensor Construction Network Based on Physical Imaging for Image Super-Resolution.

IEEE Trans Image Process. 2024;33:5864-5877. doi: 10.1109/TIP.2024.3472494. Epub 2024 Oct 17.

VRT: A Video Restoration Transformer.

IEEE Trans Image Process. 2024;33:2171-2182. doi: 10.1109/TIP.2024.3372454. Epub 2024 Mar 22.

Double Transformer Super-Resolution for Breast Cancer ADC Images.

IEEE J Biomed Health Inform. 2024 Feb;28(2):917-928. doi: 10.1109/JBHI.2023.3341250. Epub 2024 Feb 5.

Super Resolution Dual-Energy Cone-Beam CT Imaging With Dual-Layer Flat-Panel Detector.

IEEE Trans Med Imaging. 2024 Feb;43(2):734-744. doi: 10.1109/TMI.2023.3319668. Epub 2024 Feb 2.

Image Super-Resolution via Iterative Refinement.

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4713-4726. doi: 10.1109/TPAMI.2022.3204461. Epub 2023 Mar 7.

Dual-Path Deep Fusion Network for Face Image Hallucination.

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):378-391. doi: 10.1109/TNNLS.2020.3027849. Epub 2022 Jan 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于图像超分辨率的局部特征增强Transformer

Local feature enhancement transformer for image super-resolution.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献