基于稀疏用户交互的深度图像抠图

Deep Image Matting With Sparse User Interactions.

作者信息

Wei Tianyi, Chen Dongdong, Zhou Wenbo, Liao Jing, Zhao Hanqing, Zhang Weiming, Hua Gang, Yu Nenghai

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):881-895. doi: 10.1109/TPAMI.2023.3326693. Epub 2024 Jan 8.

DOI:10.1109/TPAMI.2023.3326693

Abstract

Image matting is a fundamental and challenging problem in computer vision and graphics. Most existing matting methods leverage a user-supplied trimap as an auxiliary input to produce good alpha matte. However, obtaining high-quality trimap itself is arduous. Recently, some hint-free methods have emerged, however, the matting quality is still far behind the trimap-based methods. The main reason is that, some hints for removing semantic ambiguity and improving matting quality are essential. Apparently, there is a trade-off between interaction cost and matting quality. To balance performance and user-friendliness, we propose an improved deep image matting framework which is trimap-free and only needs sparse user click or scribble interaction to minimize the needed auxiliary constraints while still allowing interactivity. Moreover, we introduce uncertainty estimation that predicts which parts need polishing and conduct uncertainty-guided refinement. To trade off runtime against refinement quality, users can also choose different refinement modes. Experimental results show that our method performs better than existing trimap-free methods and comparably to state-of-the-art trimap-based methods with minimal user effort. Finally, we demonstrate the extensibility of our framework to video human matting without any structure modification, by adding optical flow-based sparse hint propagation and temporal consistency regularization imposed on the single frame.

摘要

图像抠图是计算机视觉和图形学中的一个基本且具有挑战性的问题。大多数现有的抠图方法利用用户提供的三值图作为辅助输入来生成良好的alpha遮罩。然而，获得高质量的三值图本身很艰巨。最近，一些无需提示的方法已经出现，但是，抠图质量仍然远远落后于基于三值图的方法。主要原因是，一些用于消除语义模糊和提高抠图质量的提示是必不可少的。显然，在交互成本和抠图质量之间存在权衡。为了平衡性能和用户友好性，我们提出了一种改进的深度图像抠图框架，该框架无需三值图，只需要稀疏的用户点击或涂鸦交互，以最小化所需的辅助约束，同时仍允许交互性。此外，我们引入了不确定性估计，以预测哪些部分需要优化，并进行不确定性引导的细化。为了在运行时和细化质量之间进行权衡，用户还可以选择不同的细化模式。实验结果表明，我们的方法比现有的无需三值图的方法表现更好，并且在用户工作量最小的情况下与基于三值图的最先进方法相当。最后，我们通过添加基于光流的稀疏提示传播和施加在单帧上的时间一致性正则化，展示了我们的框架在不进行任何结构修改的情况下扩展到视频人体抠图的能力。

相似文献

Deep Image Matting With Sparse User Interactions.基于稀疏用户交互的深度图像抠图

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):881-895. doi: 10.1109/TPAMI.2023.3326693. Epub 2024 Jan 8.

User-Guided Deep Human Image Matting Using Arbitrary Trimaps.基于用户引导的任意 Trimaps 的深度人体图像抠图。

IEEE Trans Image Process. 2022;31:2040-2052. doi: 10.1109/TIP.2022.3150295. Epub 2022 Feb 25.

Deep Interactive Image Matting With Feature Propagation.基于特征传播的深度交互式图像抠图

IEEE Trans Image Process. 2022;31:2421-2432. doi: 10.1109/TIP.2022.3155958. Epub 2022 Mar 15.

Medical matting: Medical image segmentation with uncertainty from the matting perspective.医学抠图：从抠图角度看具有不确定性的医学图像分割

Comput Biol Med. 2023 May;158:106714. doi: 10.1016/j.compbiomed.2023.106714. Epub 2023 Feb 28.

A Hierarchical Image Matting Model for Blood Vessel Segmentation in Fundus Images.一种用于眼底图像中血管分割的分层图像抠图模型。

IEEE Trans Image Process. 2018 Dec 17. doi: 10.1109/TIP.2018.2885495.

Real-Time Multi-Person Video Synthesis with Controllable Prior-Guided Matting.基于可控先验引导抠图的实时多人视频合成

Sensors (Basel). 2024 Apr 27;24(9):2795. doi: 10.3390/s24092795.

Sparse Coding for Alpha Matting.Alpha 抠图的稀疏编码。

IEEE Trans Image Process. 2016 Jul;25(7):3032-3043. doi: 10.1109/TIP.2016.2555705.

Unsupervised Video Matting via Sparse and Low-Rank Representation.基于稀疏和低秩表示的无监督视频抠图

IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1501-1514. doi: 10.1109/TPAMI.2019.2895331. Epub 2019 Jan 25.

3D matting: A benchmark study on soft segmentation method for pulmonary nodules applied in computed tomography.3D抠图：计算机断层扫描中应用的肺结节软分割方法的基准研究。

Comput Biol Med. 2022 Nov;150:106153. doi: 10.1016/j.compbiomed.2022.106153. Epub 2022 Oct 5.

Targeting accurate object extraction from an image: a comprehensive study of natural image matting.从图像中准确提取目标：自然图像抠图的综合研究。

IEEE Trans Neural Netw Learn Syst. 2015 Feb;26(2):185-207. doi: 10.1109/TNNLS.2014.2369426. Epub 2014 Nov 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于稀疏用户交互的深度图像抠图

Deep Image Matting With Sparse User Interactions.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献