Cheng Ziheng, Chen Bo, Lu Ruiying, Wang Zhengjue, Zhang Hao, Meng Ziyi, Yuan Xin
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2264-2281. doi: 10.1109/TPAMI.2022.3161934. Epub 2023 Jan 6.
Conventional high-speed and spectral imaging systems are expensive and they usually consume a significant amount of memory and bandwidth to save and transmit the high-dimensional data. By contrast, snapshot compressive imaging (SCI), where multiple sequential frames are coded by different masks and then summed to a single measurement, is a promising idea to use a 2-dimensional camera to capture 3-dimensional scenes. In this paper, we consider the reconstruction problem in SCI, i.e., recovering a series of scenes from a compressed measurement. Specifically, the measurement and modulation masks are fed into our proposed network, dubbed BIdirectional Recurrent Neural networks with Adversarial Training (BIRNAT) to reconstruct the desired frames. BIRNAT employs a deep convolutional neural network with residual blocks and self-attention to reconstruct the first frame, based on which a bidirectional recurrent neural network is utilized to sequentially reconstruct the following frames. Moreover, we build an extended BIRNAT-color algorithm for color videos aiming at joint reconstruction and demosaicing. Extensive results on both video and spectral, simulation and real data from three SCI cameras demonstrate the superior performance of BIRNAT.
传统的高速和光谱成像系统价格昂贵,而且通常需要消耗大量内存和带宽来保存和传输高维数据。相比之下,快照压缩成像(SCI)是一种很有前景的方法,即通过不同的掩码对多个连续帧进行编码,然后将其求和为单个测量值,从而使用二维相机捕获三维场景。在本文中,我们考虑了SCI中的重建问题,即从压缩测量中恢复一系列场景。具体而言,将测量值和调制掩码输入到我们提出的网络中,该网络称为带有对抗训练的双向递归神经网络(BIRNAT),以重建所需的帧。BIRNAT采用带有残差块和自注意力的深度卷积神经网络来重建第一帧,并在此基础上利用双向递归神经网络顺序重建后续帧。此外,我们针对彩色视频构建了一种扩展的BIRNAT-颜色算法,旨在进行联合重建和去马赛克处理。来自三个SCI相机的视频和光谱、模拟和真实数据的大量结果证明了BIRNAT的卓越性能。