Guo Xiaojie, Li Yu, Ma Jiayi, Ling Haibin
IEEE Trans Pattern Anal Mach Intell. 2020 Mar;42(3):694-707. doi: 10.1109/TPAMI.2018.2883553. Epub 2018 Nov 28.
Filtering images is required by numerous multimedia, computer vision and graphics tasks. Despite diverse goals of different tasks, making effective rules is key to the filtering performance. Linear translation-invariant filters with manually designed kernels have been widely used. However, their performance suffers from content-blindness. To mitigate the content-blindness, a family of filters, called joint/guided filters, have attracted a great amount of attention from the community. The main drawback of most joint/guided filters comes from the ignorance of structural inconsistency between the reference and target signals like color, infrared, and depth images captured under different conditions. Simply adopting such guidelines very likely leads to unsatisfactory results. To address the above issues, this paper designs a simple yet effective filter, named mutually guided image filter (muGIF), which jointly preserves mutual structures, avoids misleading from inconsistent structures and smooths flat regions. The proposed muGIF is very flexible, which can work in various modes including dynamic only (self-guided), static/dynamic (reference-guided) and dynamic/dynamic (mutually guided) modes. Although the objective of muGIF is in nature non-convex, by subtly decomposing the objective, we can solve it effectively and efficiently. The advantages of muGIF in effectiveness and flexibility are demonstrated over other state-of-the-art alternatives on a variety of applications. Our code is publicly available at https://sites.google.com/view/xjguo/mugif.
众多多媒体、计算机视觉和图形任务都需要对图像进行滤波处理。尽管不同任务有不同的目标,但制定有效的规则是滤波性能的关键。具有手动设计内核的线性平移不变滤波器已被广泛使用。然而,它们的性能存在内容盲目性问题。为了减轻内容盲目性,一类称为联合/引导滤波器的滤波器受到了社区的大量关注。大多数联合/引导滤波器的主要缺点在于忽略了在不同条件下捕获的参考信号和目标信号(如彩色、红外和深度图像)之间的结构不一致性。简单地采用这些准则很可能导致不令人满意的结果。为了解决上述问题,本文设计了一种简单而有效的滤波器,称为互引导图像滤波器(muGIF),它能联合保留相互结构,避免不一致结构的误导并平滑平坦区域。所提出的muGIF非常灵活,它可以在各种模式下工作,包括仅动态(自引导)、静态/动态(参考引导)和动态/动态(互引导)模式。尽管muGIF的目标本质上是非凸的,但通过巧妙地分解目标,我们可以有效地解决它。在各种应用中,muGIF在有效性和灵活性方面的优势相对于其他现有替代方案得到了证明。我们的代码可在https://sites.google.com/view/xjguo/mugif上公开获取。