Immersive and Creative Technologies Lab, Concordia University, Montreal, QC, Canada.
Sci Rep. 2022 Nov 16;12(1):19721. doi: 10.1038/s41598-022-21987-7.
Large displacement optical flow is an integral part of many computer vision tasks. Variational optical flow techniques based on a coarse-to-fine scheme interpolate sparse matches and locally optimize an energy model conditioned on colour, gradient and smoothness, making them sensitive to noise in the sparse matches, deformations, and arbitrarily large displacements. This paper addresses this problem and presents HybridFlow, a variational motion estimation framework for large displacements and deformations. A multi-scale hybrid matching approach is performed on the image pairs. Coarse-scale clusters formed by classifying pixels according to their feature descriptors are matched using the clusters' context descriptors. We apply a multi-scale graph matching on the finer-scale superpixels contained within each matched pair of coarse-scale clusters. Small clusters that cannot be further subdivided are matched using localized feature matching. Together, these initial matches form the flow, which is propagated by an edge-preserving interpolation and variational refinement. Our approach does not require training and is robust to substantial displacements and rigid and non-rigid transformations due to motion in the scene, making it ideal for large-scale imagery such as aerial imagery. More notably, HybridFlow works on directed graphs of arbitrary topology representing perceptual groups, which improves motion estimation in the presence of significant deformations. We demonstrate HybridFlow's superior performance to state-of-the-art variational techniques on two benchmark datasets and report comparable results with state-of-the-art deep-learning-based techniques.
大位移光流是许多计算机视觉任务的重要组成部分。基于由粗到精方案的变分光流技术通过插值稀疏匹配并根据颜色、梯度和平滑度对能量模型进行局部优化,从而使它们对稀疏匹配中的噪声、变形和任意大的位移敏感。本文解决了这个问题,并提出了 HybridFlow,这是一种用于大位移和变形的变分运动估计框架。在图像对上执行多尺度混合匹配方法。根据特征描述符对像素进行分类形成的粗尺度聚类使用聚类的上下文描述符进行匹配。我们在每个匹配的粗尺度聚类内的更精细尺度的超像素上应用多尺度图匹配。无法进一步细分的小聚类使用局部特征匹配进行匹配。这些初始匹配共同形成流,该流通过保边插值和变分细化进行传播。我们的方法不需要训练,并且由于场景中的运动而对大位移和刚性和非刚性变换具有鲁棒性,因此非常适合大规模图像,例如航空图像。更值得注意的是,HybridFlow 可以在任意拓扑的有向图上工作,这些有向图表示感知群组,从而在存在显著变形的情况下提高运动估计的性能。我们在两个基准数据集上展示了 HybridFlow 优于最先进的变分技术的性能,并报告了与最先进的基于深度学习的技术相当的结果。