SceneSketcher-v2：基于自适应图卷积网络的细粒度场景级基于草图的图像检索

SceneSketcher-v2: Fine-Grained Scene-Level Sketch-Based Image Retrieval Using Adaptive GCNs.

作者信息

Liu Fang, Deng Xiaoming, Zou Changqing, Lai Yu-Kun, Chen Keqi, Zuo Ran, Ma Cuixia, Liu Yong-Jin, Wang Hongan

出版信息

IEEE Trans Image Process. 2022;31:3737-3751. doi: 10.1109/TIP.2022.3175403. Epub 2022 May 26.

DOI:10.1109/TIP.2022.3175403

Abstract

Sketch-based image retrieval (SBIR) is a long-standing research topic in computer vision. Existing methods mainly focus on category-level or instance-level image retrieval. This paper investigates the fine-grained scene-level SBIR problem where a free-hand sketch depicting a scene is used to retrieve desired images. This problem is useful yet challenging mainly because of two entangled facts: 1) achieving an effective representation of the input query data and scene-level images is difficult as it requires to model the information across multiple modalities such as object layout, relative size and visual appearances, and 2) there is a great domain gap between the query sketch input and target images. We present SceneSketcher-v2, a Graph Convolutional Network (GCN) based architecture to address these challenges. SceneSketcher-v2 employs a carefully designed graph convolution network to fuse the multi-modality information in the query sketch and target images and uses a triplet training process and end-to-end training manner to alleviate the domain gap. Extensive experiments demonstrate SceneSketcher-v2 outperforms state-of-the-art scene-level SBIR models with a significant margin.

摘要

基于草图的图像检索（SBIR）是计算机视觉中一个长期存在的研究课题。现有方法主要集中在类别级或实例级图像检索。本文研究细粒度场景级SBIR问题，即使用描绘场景的手绘草图来检索所需图像。这个问题很有用但也具有挑战性，主要是由于两个相互交织的事实：1）难以实现对输入查询数据和场景级图像的有效表示，因为这需要对跨多种模态的信息进行建模，如对象布局、相对大小和视觉外观；2）查询草图输入和目标图像之间存在很大的领域差距。我们提出了SceneSketcher-v2，一种基于图卷积网络（GCN）的架构来应对这些挑战。SceneSketcher-v2采用精心设计的图卷积网络来融合查询草图和目标图像中的多模态信息，并使用三元组训练过程和端到端训练方式来缩小领域差距。大量实验表明，SceneSketcher-v2显著优于现有最先进的场景级SBIR模型。

相似文献

SceneSketcher-v2: Fine-Grained Scene-Level Sketch-Based Image Retrieval Using Adaptive GCNs.SceneSketcher-v2：基于自适应图卷积网络的细粒度场景级基于草图的图像检索

IEEE Trans Image Process. 2022;31:3737-3751. doi: 10.1109/TIP.2022.3175403. Epub 2022 May 26.

Fine-Grained Video Retrieval With Scene Sketches.基于场景草图的细粒度视频检索。

IEEE Trans Image Process. 2023;32:3136-3149. doi: 10.1109/TIP.2023.3278474. Epub 2023 Jun 2.

Toward Fine-Grained Sketch-Based 3D Shape Retrieval.迈向基于细粒度草图的3D形状检索

IEEE Trans Image Process. 2021;30:8595-8606. doi: 10.1109/TIP.2021.3118975. Epub 2021 Oct 20.

Zero-Shot Sketch-Based Image Retrieval Using StyleGen and Stacked Siamese Neural Networks.使用StyleGen和堆叠暹罗神经网络的零样本基于草图的图像检索

J Imaging. 2024 Mar 27;10(4):79. doi: 10.3390/jimaging10040079.

Synergistic Instance-Level Subspace Alignment for Fine-Grained Sketch-Based Image Retrieval.协同实例级子空间对齐的细粒度草图基图像检索。

IEEE Trans Image Process. 2017 Dec;26(12):5908-5921. doi: 10.1109/TIP.2017.2745106. Epub 2017 Aug 25.

Enhancing Sketch-Based Image Retrieval by CNN Semantic Re-ranking.通过卷积神经网络语义重排增强基于草图的图像检索

IEEE Trans Cybern. 2020 Jul;50(7):3330-3342. doi: 10.1109/TCYB.2019.2894498. Epub 2019 Mar 15.

Augmented Multimodality Fusion for Generalized Zero-Shot Sketch-Based Visual Retrieval.用于广义零样本基于草图的视觉检索的增强多模态融合

IEEE Trans Image Process. 2022;31:3657-3668. doi: 10.1109/TIP.2022.3173815. Epub 2022 May 26.

A review of fine-grained sketch image retrieval based on deep learning.基于深度学习的细粒度草图图像检索综述。

Math Biosci Eng. 2023 Nov 28;20(12):21186-21210. doi: 10.3934/mbe.2023937.

Learning Structural Representations via Dynamic Object Landmarks Discovery for Sketch Recognition and Retrieval.

IEEE Trans Image Process. 2019 Apr 19. doi: 10.1109/TIP.2019.2910398.

Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval.用于零样本基于草图的图像检索的渐进式跨模态语义网络

IEEE Trans Image Process. 2020 Sep 10;PP. doi: 10.1109/TIP.2020.3020383.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验