Li Haoran, Feng Chun-Mei, Xu Yong, Zhou Tao, Yao Lina, Chang Xiaojun
IEEE Trans Image Process. 2023;32:5126-5137. doi: 10.1109/TIP.2023.3308295. Epub 2023 Sep 12.
The goal of Camouflaged object detection (COD) is to detect objects that are visually embedded in their surroundings. Existing COD methods only focus on detecting camouflaged objects from seen classes, while they suffer from performance degradation to detect unseen classes. However, in a real-world scenario, collecting sufficient data for seen classes is extremely difficult and labeling them requires high professional skills, thereby making these COD methods not applicable. In this paper, we propose a new zero-shot COD framework (termed as ZSCOD), which can effectively detect the never unseen classes. Specifically, our framework includes a Dynamic Graph Searching Network (DGSNet) and a Camouflaged Visual Reasoning Generator (CVRG). In details, DGSNet is proposed to adaptively capture more edge details for boosting the COD performance. CVRG is utilized to produce pseudo-features that are closer to the real features of the seen camouflaged objects, which can transfer knowledge from seen classes to unseen classes to help detect unseen objects. Besides, our graph reasoning is built on a dynamic searching strategy, which can pay more attention to the boundaries of objects for reducing the influences of background. More importantly, we construct the first zero-shot COD benchmark based on the COD10K dataset. Experimental results on public datasets show that our ZSCOD not only detects the camouflaged object of unseen classes but also achieves state-of-the-art performance in detecting seen classes.
伪装物体检测(COD)的目标是检测视觉上嵌入其周围环境中的物体。现有的COD方法仅专注于从已见类别中检测伪装物体,而在检测未见类别时性能会下降。然而,在现实场景中,为已见类别收集足够的数据极其困难,并且对其进行标注需要很高的专业技能,从而使得这些COD方法不适用。在本文中,我们提出了一种新的零样本COD框架(称为ZSCOD),它可以有效地检测从未见过的类别。具体来说,我们的框架包括一个动态图搜索网络(DGSNet)和一个伪装视觉推理生成器(CVRG)。详细而言,提出DGSNet是为了自适应地捕捉更多边缘细节以提升COD性能。CVRG用于生成更接近已见伪装物体真实特征的伪特征,它可以将知识从已见类别转移到未见类别以帮助检测未见物体。此外,我们的图推理基于动态搜索策略构建,该策略可以更多地关注物体边界以减少背景的影响。更重要的是,我们基于COD10K数据集构建了首个零样本COD基准。在公共数据集上的实验结果表明,我们的ZSCOD不仅能检测未见类别的伪装物体,而且在检测已见类别时也取得了领先的性能。