Zhang Zhongyan, Wang Lei, Wang Yang, Zhou Luping, Zhang Jianjia, Chen Fang
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):247-263. doi: 10.1109/TPAMI.2022.3141433. Epub 2022 Dec 5.
Instance image retrieval could greatly benefit from discovering objects in the image dataset. This not only helps produce more reliable feature representation but also better informs users by delineating query-matched object regions. However, object classes are usually not predefined in a retrieval dataset and class label information is generally unavailable in image retrieval. This situation makes object discovery a challenging task. To address this, we propose a novel dataset-driven unsupervised object discovery framework. By utilizing deep feature representation and weakly-supervised object detection, we explore supervisory information from within an image dataset, construct class-wise object detectors, and assign multiple detectors to each image for detection. To efficiently construct object detectors for large image datasets, we propose a novel "base-detector repository" and derive a fast way to generate the base detectors. In addition, the whole framework is designed to work in a self-boosting manner to iteratively refine object discovery. Compared with existing unsupervised object detection methods, our framework produces more accurate object discovery results. Different from supervised detection, we need neither manual annotation nor auxiliary datasets to train object detectors. Experimental study demonstrates the effectiveness of the proposed framework and the improved performance for region-based instance image retrieval.
实例图像检索可以从在图像数据集中发现对象中大大受益。这不仅有助于生成更可靠的特征表示,还能通过描绘查询匹配的对象区域更好地为用户提供信息。然而,在检索数据集中通常没有预定义对象类别,并且在图像检索中通常无法获得类别标签信息。这种情况使得对象发现成为一项具有挑战性的任务。为了解决这个问题,我们提出了一种新颖的数据集驱动的无监督对象发现框架。通过利用深度特征表示和弱监督对象检测,我们从图像数据集中探索监督信息,构建按类别划分的对象检测器,并为每个图像分配多个检测器进行检测。为了有效地为大型图像数据集构建对象检测器,我们提出了一种新颖的“基础检测器存储库”,并得出了一种生成基础检测器的快速方法。此外,整个框架设计为以自增强的方式工作,以迭代地优化对象发现。与现有的无监督对象检测方法相比,我们的框架产生了更准确的对象发现结果。与监督检测不同,我们既不需要手动注释也不需要辅助数据集来训练对象检测器。实验研究证明了所提出框架的有效性以及基于区域的实例图像检索的性能提升。