视频快照压缩成像的时空Transformer。

Spatial-Temporal Transformer for Video Snapshot Compressive Imaging.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):9072-9089. doi: 10.1109/TPAMI.2022.3225382. Epub 2023 Jun 5.

DOI:10.1109/TPAMI.2022.3225382

Abstract

Video snapshot compressive imaging (SCI) captures multiple sequential video frames by a single measurement using the idea of computational imaging. The underlying principle is to modulate high-speed frames through different masks and these modulated frames are summed to a single measurement captured by a low-speed 2D sensor (dubbed optical encoder); following this, algorithms are employed to reconstruct the desired high-speed frames (dubbed software decoder) if needed. In this article, we consider the reconstruction algorithm in video SCI, i.e., recovering a series of video frames from a compressed measurement. Specifically, we propose a Spatial-Temporal transFormer (STFormer) to exploit the correlation in both spatial and temporal domains. STFormer network is composed of a token generation block, a video reconstruction block, and these two blocks are connected by a series of STFormer blocks. Each STFormer block consists of a spatial self-attention branch, a temporal self-attention branch and the outputs of these two branches are integrated by a fusion network. Extensive results on both simulated and real data demonstrate the state-of-the-art performance of STFormer. The code and models are publicly available at https://github.com/ucaswangls/STFormer.

摘要

视频快照压缩成像 (SCI) 通过使用计算成像的思想，通过单次测量来捕获多个连续的视频帧。其基本原理是通过不同的掩模来调制高速帧，这些调制帧被求和到由低速 2D 传感器（称为光学编码器）捕获的单个测量值中；之后，如果需要，使用算法来重建所需的高速帧（称为软件解码器）。在本文中，我们考虑视频 SCI 中的重建算法，即从压缩测量中恢复一系列视频帧。具体来说，我们提出了一种时空变换网络 (STFormer) 来利用空间和时间域中的相关性。STFormer 网络由一个令牌生成块和一个视频重建块组成，这两个块由一系列 STFormer 块连接。每个 STFormer 块由一个空间自注意力分支和一个时间自注意力分支组成，这两个分支的输出通过一个融合网络进行集成。在模拟和真实数据上的广泛结果表明，STFormer 的性能达到了最新水平。代码和模型可在 https://github.com/ucaswangls/STFormer 上获取。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

视频快照压缩成像的时空Transformer。

Spatial-Temporal Transformer for Video Snapshot Compressive Imaging.

出版信息

相似文献

引用本文的文献

视频快照压缩成像的时空Transformer。

Spatial-Temporal Transformer for Video Snapshot Compressive Imaging.

出版信息

相似文献

引用本文的文献