Xu Yafan, Zhao Yan, Ji Mengqi
Appl Opt. 2014 Oct 10;53(29):6885-92. doi: 10.1364/AO.53.006885.
Cost aggregation is the most important step in a local stereo algorithm. In this work, a novel local stereo-matching algorithm with a cost-aggregation method based on adaptive shape support window (ASSW) is proposed. First, we compute the initial cost volume, which uses both absolute intensity difference and gradient similarity to measure dissimilarity. Second, we apply an ASSW-based cost-aggregation method to get the aggregated cost within the support window. There are two main parts: at first we construct a local support skeleton anchoring each pixel with four varying arm lengths decided on color similarity; as a result, the support window integral of multiple horizontal segments spanned by pixels in the neighboring vertical is established. Then we utilize extended implementation of guided filter to aggregate cost volume within the ASSW, which has better edge-preserving smoothing property than bilateral filter independent of the filtering kernel size. In this way, the number of bad pixels located in the incorrect depth regions can be effectively reduced through finding optimal support windows with an arbitrary shape and size adaptively. Finally, the initial disparity value of each pixel is selected using winner takes all optimization and post processing symmetrically, considering both the reference and the target image, is adopted. The experimental results demonstrate that the proposed algorithm achieves outstanding matching performance compared with other existing local algorithms on the Middlebury stereo benchmark, especially in depth discontinuities and piecewise smooth regions.
代价聚合是局部立体匹配算法中最重要的步骤。在这项工作中,提出了一种基于自适应形状支持窗口(ASSW)的代价聚合方法的新型局部立体匹配算法。首先,我们计算初始代价体,它使用绝对强度差和梯度相似性来衡量差异。其次,我们应用基于ASSW的代价聚合方法来获取支持窗口内的聚合代价。它主要有两个部分:首先,我们构建一个局部支持骨架,通过基于颜色相似性确定的四个不同臂长来锚定每个像素;结果,建立了由相邻垂直方向上的像素跨越的多个水平段的支持窗口积分。然后,我们利用引导滤波器的扩展实现来在ASSW内聚合代价体,它比双边滤波器具有更好的保边平滑特性,且与滤波核大小无关。通过这种方式,可以通过自适应地找到任意形状和大小的最优支持窗口,有效地减少位于错误深度区域的坏像素数量。最后,使用胜者全得优化方法选择每个像素的初始视差值,并对称地考虑参考图像和目标图像进行后处理。实验结果表明,与Middlebury立体基准测试中的其他现有局部算法相比,该算法具有出色的匹配性能,特别是在深度不连续和分段平滑区域。