Liu Chengxu, Yang Huan, Fu Jianlong, Qian Xueming
IEEE Trans Image Process. 2023;32:4728-4741. doi: 10.1109/TIP.2023.3302990. Epub 2023 Aug 22.
Video frame interpolation (VFI) aims to synthesize an intermediate frame between two consecutive frames. State-of-the-art approaches usually adopt a two-step solution, which includes 1) generating locally-warped pixels by calculating the optical flow based on pre-defined motion patterns (e.g., uniform motion, symmetric motion), 2) blending the warped pixels to form a full frame through deep neural synthesis networks. However, for various complicated motions (e.g., non-uniform motion, turn around), such improper assumptions about pre-defined motion patterns introduce the inconsistent warping from the two consecutive frames. This leads to the warped features for new frames are usually not aligned, yielding distortion and blur, especially when large and complex motions occur. To solve this issue, in this paper we propose a novel Trajectory-aware Transformer for Video Frame Interpolation (TTVFI). In particular, we formulate the warped features with inconsistent motions as query tokens, and formulate relevant regions in a motion trajectory from two original consecutive frames into keys and values. Self-attention is learned on relevant tokens along the trajectory to blend the pristine features into intermediate frames through end-to-end training. Experimental results demonstrate that our method outperforms other state-of-the-art methods in four widely-used VFI benchmarks. Both code and pre-trained models will be released at https://github.com/ChengxuLiu/TTVFI.
视频帧插值(VFI)旨在合成两个连续帧之间的中间帧。当前的先进方法通常采用两步解决方案,其中包括:1)通过基于预定义运动模式(例如,匀速运动、对称运动)计算光流来生成局部扭曲像素;2)通过深度神经合成网络混合扭曲像素以形成完整帧。然而,对于各种复杂运动(例如,非匀速运动、转身),关于预定义运动模式的这种不当假设会导致两个连续帧产生不一致的扭曲。这使得新帧的扭曲特征通常无法对齐,从而产生失真和模糊,尤其是在发生大的复杂运动时。为了解决这个问题,在本文中我们提出了一种用于视频帧插值的新型轨迹感知Transformer(TTVFI)。具体而言,我们将具有不一致运动的扭曲特征表述为查询令牌,并将来自两个原始连续帧的运动轨迹中的相关区域表述为键和值。通过沿轨迹对相关令牌学习自注意力,以通过端到端训练将原始特征融合到中间帧中。实验结果表明,我们的方法在四个广泛使用的VFI基准测试中优于其他现有方法。代码和预训练模型都将在https://github.com/ChengxuLiu/TTVFI上发布。