• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于图像超分辨率的空间和频率信息融合变压器。

Spatial and frequency information fusion transformer for image super-resolution.

作者信息

Zhang Yan, Xu Fujie, Sun Yemei, Wang Jiao

机构信息

College of Computer and Information Engineering, Tianjin Chengjian University, Tianjin 300384, China.

出版信息

Neural Netw. 2025 Jul;187:107351. doi: 10.1016/j.neunet.2025.107351. Epub 2025 Mar 17.

DOI:10.1016/j.neunet.2025.107351
PMID:40106930
Abstract

Previous works have indicated that Transformer-based models bring impressive image reconstruction performance in single image super-resolution (SISR). However, existing Transformer-based approaches utilize self-attention within non-overlapping windows. This restriction hinders the network's ability to adopt large receptive fields, which are essential for capturing global information and establishing long-distance dependencies, especially in the early layers. To fully leverage global information and activate more pixels during the image reconstruction process, we have developed a Spatial and Frequency Information Fusion Transformer (SFFT) with an expansive receptive field. SFFT concurrently combines spatial and frequency domain information to comprehensively leverage their complementary strengths, capturing both local and global image features while integrating low and high-frequency information. Additionally, we utilize the overlapping cross-attention block (OCAB) to facilitate pixel transmission between adjacent windows, enhancing network performance. During the training stage, we incorporate the Fast Fourier Transform (FFT) loss, thereby fully leveraging the capabilities of our proposed modules and further tapping into the model's potential. Extensive quantitative and qualitative evaluations on benchmark datasets indicate that the proposed algorithm surpasses state-of-the-art methods in terms of accuracy. Specifically, our method achieves a PSNR score of 32.67 dB on the Manga109 dataset, surpassing SwinIR by 0.64 dB and HAT by 0.19 dB, respectively. The source code and pre-trained models are available at https://github.com/Xufujie/SFFT.

摘要

先前的研究表明,基于Transformer的模型在单图像超分辨率(SISR)中带来了令人印象深刻的图像重建性能。然而,现有的基于Transformer的方法在非重叠窗口内使用自注意力。这种限制阻碍了网络采用大感受野的能力,而大感受野对于捕获全局信息和建立长距离依赖至关重要,尤其是在早期层。为了在图像重建过程中充分利用全局信息并激活更多像素,我们开发了一种具有扩展感受野的空间和频率信息融合Transformer(SFFT)。SFFT同时结合空间和频域信息,以全面利用它们的互补优势,在整合低频和高频信息的同时捕获局部和全局图像特征。此外,我们利用重叠交叉注意力块(OCAB)来促进相邻窗口之间的像素传输,提高网络性能。在训练阶段,我们纳入了快速傅里叶变换(FFT)损失,从而充分利用我们提出的模块的能力,并进一步挖掘模型的潜力。在基准数据集上进行的广泛定量和定性评估表明,所提出的算法在准确性方面超过了现有方法。具体而言,我们的方法在Manga109数据集上实现了32.67 dB的PSNR分数,分别比SwinIR高出0.64 dB,比HAT高出0.19 dB。源代码和预训练模型可在https://github.com/Xufujie/SFFT上获取。

相似文献

1
Spatial and frequency information fusion transformer for image super-resolution.用于图像超分辨率的空间和频率信息融合变压器。
Neural Netw. 2025 Jul;187:107351. doi: 10.1016/j.neunet.2025.107351. Epub 2025 Mar 17.
2
Dual-space high-frequency learning for transformer-based MRI super-resolution.基于变换的 MRI 超分辨率的双空间高频学习。
Comput Methods Programs Biomed. 2024 Jun;250:108165. doi: 10.1016/j.cmpb.2024.108165. Epub 2024 Apr 9.
3
Dual selective fusion transformer network for hyperspectral image classification.用于高光谱图像分类的双选择性融合变压器网络
Neural Netw. 2025 Jul;187:107311. doi: 10.1016/j.neunet.2025.107311. Epub 2025 Mar 5.
4
GlobalSR: Global context network for single image super-resolution via deformable convolution attention and fast Fourier convolution.GlobalSR:基于可变形卷积注意力和快速傅里叶卷积的单图像超分辨率全局上下文网络。
Neural Netw. 2024 Dec;180:106686. doi: 10.1016/j.neunet.2024.106686. Epub 2024 Aug 31.
5
A lightweight large receptive field network LrfSR for image super-resolution.一种用于图像超分辨率的轻量级大感受野网络LrfSR。
Sci Rep. 2025 Apr 11;15(1):12535. doi: 10.1038/s41598-025-96796-9.
6
Multi-attention fusion transformer for single-image super-resolution.用于单图像超分辨率的多注意力融合变压器
Sci Rep. 2024 May 3;14(1):10222. doi: 10.1038/s41598-024-60579-5.
7
MRI super-resolution using similarity distance and multi-scale receptive field based feature fusion GAN and pre-trained slice interpolation network.基于相似距离和多尺度感受野的特征融合生成对抗网络和预训练切片插值网络的 MRI 超分辨率方法。
Magn Reson Imaging. 2024 Jul;110:195-209. doi: 10.1016/j.mri.2024.04.021. Epub 2024 Apr 21.
8
An enhanced denoising system for mammogram images using deep transformer model with fusion of local and global features.一种使用具有局部和全局特征融合的深度变压器模型的乳腺X光图像增强去噪系统。
Sci Rep. 2025 Feb 24;15(1):6562. doi: 10.1038/s41598-025-89451-w.
9
A spatial-spectral fusion convolutional transformer network with contextual multi-head self-attention for hyperspectral image classification.一种用于高光谱图像分类的具有上下文多头自注意力机制的空间-光谱融合卷积变压器网络。
Neural Netw. 2025 Jul;187:107350. doi: 10.1016/j.neunet.2025.107350. Epub 2025 Mar 14.
10
Transforming Image Super-Resolution: A ConvFormer-Based Efficient Approach.变换图像超分辨率:一种基于卷积变换器的高效方法。
IEEE Trans Image Process. 2024;33:6071-6082. doi: 10.1109/TIP.2024.3477350. Epub 2024 Oct 25.