• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视频快照压缩成像的时空Transformer。

Spatial-Temporal Transformer for Video Snapshot Compressive Imaging.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):9072-9089. doi: 10.1109/TPAMI.2022.3225382. Epub 2023 Jun 5.

DOI:10.1109/TPAMI.2022.3225382
PMID:36445993
Abstract

Video snapshot compressive imaging (SCI) captures multiple sequential video frames by a single measurement using the idea of computational imaging. The underlying principle is to modulate high-speed frames through different masks and these modulated frames are summed to a single measurement captured by a low-speed 2D sensor (dubbed optical encoder); following this, algorithms are employed to reconstruct the desired high-speed frames (dubbed software decoder) if needed. In this article, we consider the reconstruction algorithm in video SCI, i.e., recovering a series of video frames from a compressed measurement. Specifically, we propose a Spatial-Temporal transFormer (STFormer) to exploit the correlation in both spatial and temporal domains. STFormer network is composed of a token generation block, a video reconstruction block, and these two blocks are connected by a series of STFormer blocks. Each STFormer block consists of a spatial self-attention branch, a temporal self-attention branch and the outputs of these two branches are integrated by a fusion network. Extensive results on both simulated and real data demonstrate the state-of-the-art performance of STFormer. The code and models are publicly available at https://github.com/ucaswangls/STFormer.

摘要

视频快照压缩成像 (SCI) 通过使用计算成像的思想,通过单次测量来捕获多个连续的视频帧。其基本原理是通过不同的掩模来调制高速帧,这些调制帧被求和到由低速 2D 传感器(称为光学编码器)捕获的单个测量值中;之后,如果需要,使用算法来重建所需的高速帧(称为软件解码器)。在本文中,我们考虑视频 SCI 中的重建算法,即从压缩测量中恢复一系列视频帧。具体来说,我们提出了一种时空变换网络 (STFormer) 来利用空间和时间域中的相关性。STFormer 网络由一个令牌生成块和一个视频重建块组成,这两个块由一系列 STFormer 块连接。每个 STFormer 块由一个空间自注意力分支和一个时间自注意力分支组成,这两个分支的输出通过一个融合网络进行集成。在模拟和真实数据上的广泛结果表明,STFormer 的性能达到了最新水平。代码和模型可在 https://github.com/ucaswangls/STFormer 上获取。

相似文献

1
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging.视频快照压缩成像的时空Transformer。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):9072-9089. doi: 10.1109/TPAMI.2022.3225382. Epub 2023 Jun 5.
2
Recurrent Neural Networks for Snapshot Compressive Imaging.用于快照压缩成像的递归神经网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2264-2281. doi: 10.1109/TPAMI.2022.3161934. Epub 2023 Jan 6.
3
Plug-and-Play Algorithms for Video Snapshot Compressive Imaging.用于视频快照压缩成像的即插即用算法
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7093-7111. doi: 10.1109/TPAMI.2021.3099035. Epub 2022 Sep 14.
4
STformer: Spatial-Temporal Transformer for early Warning of Unplanned Extubation in ICU.STformer:用于 ICU 中计划外拔管预警的时空变换模型。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340923.
5
STFormer: Spatial-Temporal-Aware Transformer for Video Instance Segmentation.STFormer:用于视频实例分割的时空感知Transformer
IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):12910-12924. doi: 10.1109/TNNLS.2024.3455551.
6
Rank Minimization for Snapshot Compressive Imaging.快照压缩成像的秩最小化
IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2990-3006. doi: 10.1109/TPAMI.2018.2873587. Epub 2018 Oct 4.
7
Key frames assisted hybrid encoding for high-quality compressive video sensing.用于高质量压缩视频传感的关键帧辅助混合编码
Opt Express. 2022 Oct 10;30(21):39111-39128. doi: 10.1364/OE.471754.
8
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing.用于视频压缩感知的运动感知动态图神经网络
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):7850-7866. doi: 10.1109/TPAMI.2024.3395804. Epub 2024 Nov 6.
9
Snapshot temporal compressive microscopy using an iterative algorithm with untrained neural networks.使用具有未训练神经网络的迭代算法进行快照时间压缩显微镜。
Opt Lett. 2021 Apr 15;46(8):1888-1891. doi: 10.1364/OL.420139.
10
Learning a spatial-temporal texture transformer network for video inpainting.学习用于视频修复的时空纹理Transformer网络。
Front Neurorobot. 2022 Oct 13;16:1002453. doi: 10.3389/fnbot.2022.1002453. eCollection 2022.

引用本文的文献

1
In-sensor compressing via programmable optoelectronic sensors based on van der Waals heterostructures for intelligent machine vision.基于范德华异质结构的可编程光电传感器用于智能机器视觉的传感器内压缩。
Nat Commun. 2025 Apr 24;16(1):3836. doi: 10.1038/s41467-025-59104-7.
2
Tutorial on compressed ultrafast photography.压缩超快摄影教程。
J Biomed Opt. 2024 Jan;29(Suppl 1):S11524. doi: 10.1117/1.JBO.29.S1.S11524. Epub 2024 Jan 30.