迭代量化：一种用于大规模图像检索的学习二进制代码的普罗克汝斯忒斯方法。

Iterative quantization: a Procrustean approach to learning binary codes for large-scale image retrieval.

机构信息

University of Carolina at Chapel Hill, Chapel Hill.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Dec;35(12):2916-29. doi: 10.1109/TPAMI.2012.193.

DOI:10.1109/TPAMI.2012.193

Abstract

This paper addresses the problem of learning similarity-preserving binary codes for efficient similarity search in large-scale image collections. We formulate this problem in terms of finding a rotation of zero-centered data so as to minimize the quantization error of mapping this data to the vertices of a zero-centered binary hypercube, and propose a simple and efficient alternating minimization algorithm to accomplish this task. This algorithm, dubbed iterative quantization (ITQ), has connections to multiclass spectral clustering and to the orthogonal Procrustes problem, and it can be used both with unsupervised data embeddings such as PCA and supervised embeddings such as canonical correlation analysis (CCA). The resulting binary codes significantly outperform several other state-of-the-art methods. We also show that further performance improvements can result from transforming the data with a nonlinear kernel mapping prior to PCA or CCA. Finally, we demonstrate an application of ITQ to learning binary attributes or "classemes" on the ImageNet data set.

摘要

本文针对在大规模图像集合中进行高效相似性搜索的问题，研究了学习保相似性二进制代码的问题。我们将这个问题表述为寻找零中心数据的旋转，以最小化将该数据映射到零中心二进制超立方体顶点的量化误差，并提出了一种简单而有效的交替最小化算法来完成这项任务。这种算法被称为迭代量化（ITQ），它与多类谱聚类和正交 Procrustes 问题有关，既可以与无监督数据嵌入（如 PCA）一起使用，也可以与监督嵌入（如典型相关分析（CCA））一起使用。生成的二进制代码明显优于其他几种最先进的方法。我们还表明，在 PCA 或 CCA 之前使用非线性核映射对数据进行转换可以进一步提高性能。最后，我们展示了 ITQ 在学习 ImageNet 数据集上的二进制属性或“类内词”的应用。

相似文献

Iterative quantization: a Procrustean approach to learning binary codes for large-scale image retrieval.

IEEE Trans Pattern Anal Mach Intell. 2013 Dec;35(12):2916-29. doi: 10.1109/TPAMI.2012.193.

Hashing on nonlinear manifolds.

IEEE Trans Image Process. 2015 Jun;24(6):1839-51. doi: 10.1109/TIP.2015.2405340.

A Fast Optimization Method for General Binary Code Learning.

IEEE Trans Image Process. 2016 Dec;25(12):5610-5621. doi: 10.1109/TIP.2016.2612883. Epub 2016 Sep 22.

Asymmetric distances for binary embeddings.

IEEE Trans Pattern Anal Mach Intell. 2014 Jan;36(1):33-47. doi: 10.1109/TPAMI.2013.101.

Sub-Selective Quantization for Learning Binary Codes in Large-Scale Image Search.

IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1526-1532. doi: 10.1109/TPAMI.2017.2710186.

Cross-indexing of binary SIFT codes for large-scale image search.

IEEE Trans Image Process. 2014 May;23(5):2047-57. doi: 10.1109/TIP.2014.2312283.

Scalable Feature Matching by Dual Cascaded Scalar Quantization for Image Retrieval.

IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):159-71. doi: 10.1109/TPAMI.2015.2430329.

Hashing with Angular Reconstructive Embeddings.

IEEE Trans Image Process. 2018 Feb;27(2):545-555. doi: 10.1109/TIP.2017.2749147. Epub 2017 Sep 4.

Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search.

IEEE Trans Image Process. 2017 Nov;26(11):5324-5336. doi: 10.1109/TIP.2017.2729896. Epub 2017 Jul 24.

Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks.

IEEE Trans Pattern Anal Mach Intell. 2018 Feb;40(2):437-451. doi: 10.1109/TPAMI.2017.2666812. Epub 2017 Feb 9.

引用本文的文献

Enhancing image retrieval through optimal barcode representation.

Sci Rep. 2025 Aug 7;15(1):28847. doi: 10.1038/s41598-025-14576-x.

Enhanced Image Retrieval Using Multiscale Deep Feature Fusion in Supervised Hashing.

J Imaging. 2025 Jan 12;11(1):20. doi: 10.3390/jimaging11010020.

An Innovative Attention-based Triplet Deep Hashing Approach to Retrieve Histopathology Images.

J Imaging Inform Med. 2024 Nov 11. doi: 10.1007/s10278-024-01310-8.

An Efficient Supervised Deep Hashing Method for Image Retrieval.

Entropy (Basel). 2022 Oct 7;24(10):1425. doi: 10.3390/e24101425.

An Online Hashing Algorithm for Image Retrieval Based on Optical-Sensor Network.

Sensors (Basel). 2023 Feb 25;23(5):2576. doi: 10.3390/s23052576.

An Efficient Retrieval System Framework for Fabrics Based on Fine-Grained Similarity.

Entropy (Basel). 2022 Sep 19;24(9):1319. doi: 10.3390/e24091319.

Discrete Infomax Codes for Supervised Representation Learning.

Entropy (Basel). 2022 Apr 2;24(4):501. doi: 10.3390/e24040501.

Deep Disentangled Hashing with Momentum Triplets for Neuroimage Search.

Med Image Comput Comput Assist Interv. 2020;12261:191-201. doi: 10.1007/978-3-030-59710-8_19. Epub 2020 Sep 29.

Discriminative Codebook Hashing for Supervised Video Retrieval.

Comput Intell Neurosci. 2021 Aug 25;2021:5845094. doi: 10.1155/2021/5845094. eCollection 2021.

A Detection Method of Operated Fake-Images Using Robust Hashing.

J Imaging. 2021 Aug 5;7(8):134. doi: 10.3390/jimaging7080134.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

迭代量化：一种用于大规模图像检索的学习二进制代码的普罗克汝斯忒斯方法。

Iterative quantization: a Procrustean approach to learning binary codes for large-scale image retrieval.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献