• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SVTSR:使用散射视觉Transformer的图像超分辨率

SVTSR: image super-resolution using scattering vision transformer.

作者信息

Liang Jiabao, Jin Yutao, Chen Xiaoyan, Huang Haotian, Deng Yue

机构信息

School of Electronic Information and Automation, Tianjin, China.

出版信息

Sci Rep. 2024 Dec 30;14(1):31770. doi: 10.1038/s41598-024-82650-x.

DOI:10.1038/s41598-024-82650-x
PMID:39738394
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11685942/
Abstract

Vision transformers have garnered substantial attention and attained impressive performance in image super-resolution tasks. Nevertheless, these networks face challenges associated with attention complexity and the effective capture of intricate, fine-grained details within images. These hurdles impede the efficient and scalable deployment of transformer models for image super-resolution tasks in real-world applications. In this paper, we present a novel vision transformer called Scattering Vision Transformer for Super-Resolution (SVTSR) to tackle these challenges. SVTSR integrates a spectrally scattering network to efficiently capture intricate image details. It addresses the invertibility problem commonly encountered in down-sampling operations by separating low-frequency and high-frequency components. Additionally, SVTSR introduces a novel spectral gating network that utilizes Einstein multiplication for token and channel mixing, effectively reducing complexity. Extensive experiments show the effectiveness of the proposed vision transformer for image super-resolution tasks. Our comprehensive methodology not only outperforms state-of-the-art methods in terms of the PSNR and SSIM metrics but, more significantly, entails a reduction in model parameters exceeding tenfold when compared to the baseline model. As shown in Fig. 1, the substantial decrease of parameter amount proves highly advantageous for the deployment and practical application of super-resolution models. Code is available at https://github.com/LiangJiabaoY/SVTSR.git.

摘要

视觉Transformer在图像超分辨率任务中受到了广泛关注并取得了令人瞩目的性能。然而,这些网络面临着与注意力复杂度以及有效捕捉图像中复杂的细粒度细节相关的挑战。这些障碍阻碍了Transformer模型在实际应用中高效且可扩展地部署于图像超分辨率任务。在本文中,我们提出了一种名为用于超分辨率的散射视觉Transformer(SVTSR)的新型视觉Transformer来应对这些挑战。SVTSR集成了一个频谱散射网络以有效捕捉复杂的图像细节。它通过分离低频和高频分量解决了下采样操作中常见的可逆性问题。此外,SVTSR引入了一种新颖的频谱门控网络,该网络利用爱因斯坦乘法进行令牌和通道混合,有效降低了复杂度。大量实验表明了所提出的视觉Transformer在图像超分辨率任务中的有效性。我们的综合方法不仅在PSNR和SSIM指标方面优于现有方法,更重要的是,与基线模型相比,模型参数减少了超过十倍。如图1所示,参数数量的大幅减少对于超分辨率模型的部署和实际应用非常有利。代码可在https://github.com/LiangJiabaoY/SVTSR.git获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/d6ff4e205abb/41598_2024_82650_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/e7e0d573996a/41598_2024_82650_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/c3337b2ea0e6/41598_2024_82650_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/d6ff4e205abb/41598_2024_82650_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/e7e0d573996a/41598_2024_82650_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/c3337b2ea0e6/41598_2024_82650_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f07/11685942/d6ff4e205abb/41598_2024_82650_Fig3_HTML.jpg

相似文献

1
SVTSR: image super-resolution using scattering vision transformer.SVTSR:使用散射视觉Transformer的图像超分辨率
Sci Rep. 2024 Dec 30;14(1):31770. doi: 10.1038/s41598-024-82650-x.
2
A Transformer-Based Model for Super-Resolution of Anime Image.基于Transformer 的动漫图像超分辨率模型。
Sensors (Basel). 2022 Oct 24;22(21):8126. doi: 10.3390/s22218126.
3
Parallel attention recursive generalization transformer for image super-resolution.用于图像超分辨率的并行注意力递归泛化变换器
Sci Rep. 2025 Mar 13;15(1):8669. doi: 10.1038/s41598-025-92377-y.
4
Texture transformer super-resolution for low-dose computed tomography.纹理变换超分辨率在低剂量 CT 中的应用。
Biomed Phys Eng Express. 2022 Nov 4;8(6). doi: 10.1088/2057-1976/ac9da7.
5
Dual-space high-frequency learning for transformer-based MRI super-resolution.基于变换的 MRI 超分辨率的双空间高频学习。
Comput Methods Programs Biomed. 2024 Jun;250:108165. doi: 10.1016/j.cmpb.2024.108165. Epub 2024 Apr 9.
6
Image Super-Resolution via Efficient Transformer Embedding Frequency Decomposition With Restart.通过带重启的高效变压器嵌入频率分解实现图像超分辨率
IEEE Trans Image Process. 2024;33:4670-4685. doi: 10.1109/TIP.2024.3444317. Epub 2024 Aug 28.
7
Enhancing fetal ultrasound image quality and anatomical plane recognition in low-resource settings using super-resolution models.在资源匮乏的环境中使用超分辨率模型提高胎儿超声图像质量和解剖平面识别能力。
Sci Rep. 2025 Mar 11;15(1):8376. doi: 10.1038/s41598-025-91808-0.
8
An enhanced denoising system for mammogram images using deep transformer model with fusion of local and global features.一种使用具有局部和全局特征融合的深度变压器模型的乳腺X光图像增强去噪系统。
Sci Rep. 2025 Feb 24;15(1):6562. doi: 10.1038/s41598-025-89451-w.
9
HMFT: Hyperspectral and Multispectral Image Fusion Super-Resolution Method Based on Efficient Transformer and Spatial-Spectral Attention Mechanism.基于高效Transformer 和空间-谱注意力机制的高光谱与多光谱图像融合超分辨率方法
Comput Intell Neurosci. 2023 Mar 1;2023:4725986. doi: 10.1155/2023/4725986. eCollection 2023.
10
Cross-Scope Spatial-Spectral Information Aggregation for Hyperspectral Image Super-Resolution.用于高光谱图像超分辨率的跨范围空间光谱信息聚合
IEEE Trans Image Process. 2024;33:5878-5891. doi: 10.1109/TIP.2024.3468905. Epub 2024 Oct 18.

本文引用的文献

1
VRT: A Video Restoration Transformer.VRT:一种视频恢复Transformer。
IEEE Trans Image Process. 2024;33:2171-2182. doi: 10.1109/TIP.2024.3372454. Epub 2024 Mar 22.
2
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition.统一卷积与自注意力机制用于视觉识别的UniFormer
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12581-12600. doi: 10.1109/TPAMI.2023.3282631. Epub 2023 Sep 5.
3
Glance and Focus Networks for Dynamic Visual Recognition.扫视与聚焦网络的动态视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4605-4621. doi: 10.1109/TPAMI.2022.3196959. Epub 2023 Mar 7.
4
Wavelet-Based Dual Recursive Network for Image Super-Resolution.基于小波的双递归网络用于图像超分辨率
IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):707-720. doi: 10.1109/TNNLS.2020.3028688. Epub 2022 Feb 3.
5
Image Super-Resolution Using Deep Convolutional Networks.基于深度卷积网络的图像超分辨率重建。
IEEE Trans Pattern Anal Mach Intell. 2016 Feb;38(2):295-307. doi: 10.1109/TPAMI.2015.2439281.