Gopu Venkata Rama Muni Kumar, Dunna Madhavi
Department of Electrical, Electronics and Communication Engineering (EECE), Gitam School of Technology, Gitam Deemed to be University, Rushikonda, Visakhapatnam 530045, India.
J Imaging. 2024 Mar 27;10(4):79. doi: 10.3390/jimaging10040079.
Sketch-based image retrieval (SBIR) refers to a sub-class of content-based image retrieval problems where the input queries are ambiguous sketches and the retrieval repository is a database of natural images. In the zero-shot setup of SBIR, the query sketches are drawn from classes that do not match any of those that were used in model building. The SBIR task is extremely challenging as it is a cross-domain retrieval problem, unlike content-based image retrieval problems because sketches and images have a huge domain gap. In this work, we propose an elegant retrieval methodology, StyleGen, for generating fake candidate images that match the domain of the repository images, thus reducing the domain gap for retrieval tasks. The retrieval methodology makes use of a two-stage neural network architecture known as the stacked Siamese network, which is known to provide outstanding retrieval performance without losing the generalizability of the approach. Experimental studies on the image sketch datasets TU-Berlin Extended and Sketchy Extended, evaluated using the mean average precision (mAP) metric, demonstrate a marked performance improvement compared to the current state-of-the-art approaches in the domain.
基于草图的图像检索(SBIR)指的是基于内容的图像检索问题的一个子类,其中输入查询是模糊的草图,检索库是自然图像数据库。在SBIR的零样本设置中,查询草图是从与模型构建中使用的任何类都不匹配的类中绘制的。SBIR任务极具挑战性,因为它是一个跨域检索问题,与基于内容的图像检索问题不同,因为草图和图像存在巨大的域差距。在这项工作中,我们提出了一种优雅的检索方法StyleGen,用于生成与库图像域匹配的虚假候选图像,从而减少检索任务的域差距。该检索方法利用了一种称为堆叠连体网络的两阶段神经网络架构,已知该架构在不损失方法通用性的情况下能提供出色的检索性能。使用平均精度均值(mAP)指标在图像草图数据集TU-Berlin Extended和Sketchy Extended上进行的实验研究表明,与该领域当前的最先进方法相比,性能有显著提升。