IEEE Trans Pattern Anal Mach Intell. 2017 Sep;39(9):1866-1879. doi: 10.1109/TPAMI.2016.2616391. Epub 2016 Oct 11.
Though many tasks in computer vision can be formulated elegantly as pixel-labeling problems, a typical challenge discouraging such a discrete formulation is often due to computational efficiency. Recent studies on fast cost volume filtering based on efficient edge-aware filters provide a fast alternative to solve discrete labeling problems, with the complexity independent of the support window size. However, these methods still have to step through the entire cost volume exhaustively, which makes the solution speed scale linearly with the label space size. When the label space is huge or even infinite, which is often the case for (subpixel-accurate) stereo and optical flow estimation, their computational complexity becomes quickly unacceptable. Developed to search approximate nearest neighbors rapidly, the PatchMatch method can significantly reduce the complexity dependency on the search space size. But, its pixel-wise randomized search and fragmented data access within the 3D cost volume seriously hinder the application of efficient cost slice filtering. This paper presents a generic and fast computational framework for general multi-labeling problems called PatchMatch Filter (PMF). We explore effective and efficient strategies to weave together these two fundamental techniques developed in isolation, i.e., PatchMatch-based randomized search and efficient edge-aware image filtering. By decompositing an image into compact superpixels, we also propose superpixel-based novel search strategies that generalize and improve the original PatchMatch method. Further motivated to improve the regularization strength, we propose a simple yet effective cross-scale consistency constraint, which handles labeling estimation for large low-textured regions more reliably than a single-scale PMF algorithm. Focusing on dense correspondence field estimation in this paper, we demonstrate PMF's applications in stereo and optical flow. Our PMF methods achieve top-tier correspondence accuracy but run much faster than other related competing methods, often giving over 10-100 times speedup.
虽然计算机视觉中的许多任务都可以巧妙地表示为像素标记问题,但通常由于计算效率的原因,一个典型的挑战是不鼓励这种离散的表示形式。最近基于高效边缘感知滤波器的快速代价体滤波的研究为解决离散标记问题提供了一种快速替代方法,其复杂度与支持窗口大小无关。然而,这些方法仍然必须全面遍历整个代价体,这使得解决方案的速度与标签空间大小呈线性比例。当标签空间很大甚至是无限的时,这通常是(亚像素精确)立体和光流估计的情况,它们的计算复杂度很快就变得不可接受。为了快速搜索近似最近邻而开发的 PatchMatch 方法可以显著降低对搜索空间大小的复杂度依赖。但是,其逐像素随机搜索和在 3D 代价体中的碎片化数据访问严重阻碍了高效代价体滤波的应用。本文提出了一种通用的快速计算框架,用于一般的多标签问题,称为 PatchMatch Filter(PMF)。我们探索了将这两种独立开发的基本技术有效而高效地结合在一起的策略,即基于 PatchMatch 的随机搜索和高效的边缘感知图像滤波。通过将图像分解为紧凑的超像素,我们还提出了基于超像素的新搜索策略,该策略对原始 PatchMatch 方法进行了概括和改进。进一步受到提高正则化强度的启发,我们提出了一种简单而有效的跨尺度一致性约束,该约束比单尺度 PMF 算法更可靠地处理大纹理区域的标记估计。本文重点关注密集对应场估计,展示了 PMF 在立体和光流中的应用。我们的 PMF 方法实现了顶级的对应准确性,但运行速度比其他相关竞争方法快得多,通常可以提供 10 到 100 倍以上的加速。