• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MIMO-Uformer:一种用于车辆监控场景的基于Transformer的图像去模糊网络。

MIMO-Uformer: A Transformer-Based Image Deblurring Network for Vehicle Surveillance Scenarios.

作者信息

Zhang Jian, Cheng Baoping, Zhang Tengying, Zhao Yongsheng, Fu Tao, Wu Zijian, Tao Xiaoming

机构信息

China Mobile (Hangzhou) Information Technology Co., Ltd., Hangzhou 311100, China.

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China.

出版信息

J Imaging. 2024 Oct 31;10(11):274. doi: 10.3390/jimaging10110274.

DOI:10.3390/jimaging10110274
PMID:39590738
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11596006/
Abstract

Motion blur is a common problem in the field of surveillance scenarios, and it obstructs the acquisition of valuable information. Thanks to the success of deep learning, a sequence of CNN-based architecture has been designed for image deblurring and has made great progress. As another type of neural network, transformers have exhibited powerful deep representation learning and impressive performance based on high-level vision tasks. Transformer-based networks leverage self-attention to capture the long-range dependencies in the data, yet the computational complexity is quadratic to the spatial resolution, which makes transformers infeasible for the restoration of high-resolution images. In this article, we propose an efficient transformer-based deblurring network, named MIMO-Uformer, for vehicle-surveillance scenarios. The distinct feature of the MIMO-Uformer is that the basic-window-based multi-head self-attention (W-MSA) of the Swin transformer is employed to reduce the computational complexity and then incorporated into a multi-input and multi-output U-shaped network (MIMO-UNet). The performance can benefit from the operation of multi-scale images by MIMO-UNet. However, most deblurring networks are designed for global blur, while local blur is more common under vehicle-surveillance scenarios since the motion blur is primarily caused by local moving vehicles. Based on this observation, we further propose an Intersection over Patch (IoP) factor and a supervised morphological loss to improve the performance based on local blur. Extensive experiments on a public and a self-established dataset are carried out to verify the effectiveness. As a result, the deblurring behavior based on PSNR is improved at least 0.21 dB based on GOPRO and 0.74 dB based on the self-established datasets compared to the existing benchmarks.

摘要

运动模糊是监控场景领域中的一个常见问题,它阻碍了有价值信息的获取。得益于深度学习的成功,一系列基于卷积神经网络(CNN)的架构被设计用于图像去模糊,并取得了巨大进展。作为另一种神经网络类型,Transformer在基于高级视觉任务的深度表示学习方面展现出强大能力和令人印象深刻的性能。基于Transformer的网络利用自注意力机制来捕捉数据中的长距离依赖关系,但其计算复杂度与空间分辨率呈二次关系,这使得Transformer在高分辨率图像恢复方面不可行。在本文中,我们提出了一种用于车辆监控场景的基于Transformer的高效去模糊网络,名为MIMO-Uformer。MIMO-Uformer的独特之处在于采用了Swin Transformer的基于基本窗口的多头自注意力机制(W-MSA)来降低计算复杂度,然后将其融入多输入多输出U型网络(MIMO-UNet)。性能可以受益于MIMO-UNet对多尺度图像的操作。然而,大多数去模糊网络是为全局模糊设计的,而在车辆监控场景下局部模糊更为常见,因为运动模糊主要是由局部移动的车辆引起的。基于这一观察结果,我们进一步提出了面片交并比(IoP)因子和监督形态学损失,以提高基于局部模糊的性能。在一个公开数据集和一个自建数据集上进行了广泛实验以验证其有效性。结果表明,与现有基准相比,基于峰值信噪比(PSNR)的去模糊性能在GOPRO数据集上至少提高了0.21 dB,在自建数据集上提高了0.74 dB。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/c583b9292392/jimaging-10-00274-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/9bc436c687f9/jimaging-10-00274-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/cdef6027798f/jimaging-10-00274-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/e57fb31d188d/jimaging-10-00274-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/0cbf4f801e09/jimaging-10-00274-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/9b50d236ec6e/jimaging-10-00274-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/234262d11e6b/jimaging-10-00274-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/de0e80008677/jimaging-10-00274-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/b45decbc8e5d/jimaging-10-00274-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/34ba96b54f2b/jimaging-10-00274-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/308cc9c1a482/jimaging-10-00274-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/78d9aa3a2fb8/jimaging-10-00274-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/57b3ab3d4315/jimaging-10-00274-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/c583b9292392/jimaging-10-00274-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/9bc436c687f9/jimaging-10-00274-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/cdef6027798f/jimaging-10-00274-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/e57fb31d188d/jimaging-10-00274-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/0cbf4f801e09/jimaging-10-00274-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/9b50d236ec6e/jimaging-10-00274-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/234262d11e6b/jimaging-10-00274-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/de0e80008677/jimaging-10-00274-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/b45decbc8e5d/jimaging-10-00274-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/34ba96b54f2b/jimaging-10-00274-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/308cc9c1a482/jimaging-10-00274-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/78d9aa3a2fb8/jimaging-10-00274-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/57b3ab3d4315/jimaging-10-00274-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0d2/11596006/c583b9292392/jimaging-10-00274-g013.jpg

相似文献

1
MIMO-Uformer: A Transformer-Based Image Deblurring Network for Vehicle Surveillance Scenarios.MIMO-Uformer:一种用于车辆监控场景的基于Transformer的图像去模糊网络。
J Imaging. 2024 Oct 31;10(11):274. doi: 10.3390/jimaging10110274.
2
EHNet: Efficient Hybrid Network with Dual Attention for Image Deblurring.EHNet:用于图像去模糊的具有双重注意力的高效混合网络。
Sensors (Basel). 2024 Oct 10;24(20):6545. doi: 10.3390/s24206545.
3
Image Deblurring by Exploring In-Depth Properties of Transformer.通过探索Transformer的深度属性进行图像去模糊
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4652-4663. doi: 10.1109/TNNLS.2024.3359810. Epub 2025 Feb 28.
4
Image Deblurring Using Multi-Stream Bottom-Top-Bottom Attention Network and Global Information-Based Fusion and Reconstruction Network.基于多流底层-顶层-底层注意力网络和全局信息融合与重建网络的图像去模糊。
Sensors (Basel). 2020 Jul 3;20(13):3724. doi: 10.3390/s20133724.
5
MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET:U 形网络中的多尺度对比变换用于医学图像分割。
Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.
6
Multi-Stage Network for Event-Based Video Deblurring with Residual Hint Attention.基于残差提示注意力的多阶段事件视频去模糊网络。
Sensors (Basel). 2023 Mar 7;23(6):2880. doi: 10.3390/s23062880.
7
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring.SharpFormer:用于图像去模糊的学习局部特征保持全局表示。
IEEE Trans Image Process. 2023;32:2857-2866. doi: 10.1109/TIP.2023.3251029. Epub 2023 May 26.
8
Cross-shaped windows transformer with self-supervised pretraining for clinically significant prostate cancer detection in bi-parametric MRI.用于双参数磁共振成像中具有临床意义的前列腺癌检测的带自监督预训练的十字形窗口变换器
Med Phys. 2025 Feb;52(2):993-1004. doi: 10.1002/mp.17546. Epub 2024 Nov 26.
9
Deep self-supervised spatial-variant image deblurring.深度自监督空间变分图像去模糊。
Neural Netw. 2024 Nov;179:106591. doi: 10.1016/j.neunet.2024.106591. Epub 2024 Jul 30.
10
ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet:探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。
Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.

引用本文的文献

1
DFCNet: Dual-Stage Frequency-Domain Calibration Network for Low-Light Image Enhancement.DFCNet:用于低光照图像增强的双阶段频域校准网络
J Imaging. 2025 Jul 28;11(8):253. doi: 10.3390/jimaging11080253.