IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2304-16. doi: 10.1109/TPAMI.2015.2408363.
Many binary code embedding schemes have been actively studied recently, since they can provide efficient similarity search, and compact data representations suitable for handling large scale image databases. Existing binary code embedding techniques encode high-dimensional data by using hyperplane-based hashing functions. In this paper we propose a novel hypersphere-based hashing function, spherical hashing, to map more spatially coherent data points into a binary code compared to hyperplane-based hashing functions. We also propose a new binary code distance function, spherical Hamming distance, tailored for our hypersphere-based binary coding scheme, and design an efficient iterative optimization process to achieve both balanced partitioning for each hash function and independence between hashing functions. Furthermore, we generalize spherical hashing to support various similarity measures defined by kernel functions. Our extensive experiments show that our spherical hashing technique significantly outperforms state-of-the-art techniques based on hyperplanes across various benchmarks with sizes ranging from one to 75 million of GIST, BoW and VLAD descriptors. The performance gains are consistent and large, up to 100 percent improvements over the second best method among tested methods. These results confirm the unique merits of using hyperspheres to encode proximity regions in high-dimensional spaces. Finally, our method is intuitive and easy to implement.
最近,许多二进制码嵌入方案得到了积极研究,因为它们可以提供高效的相似性搜索和紧凑的数据表示,适用于处理大规模图像数据库。现有的二进制码嵌入技术通过使用基于超平面的哈希函数对高维数据进行编码。在本文中,我们提出了一种新颖的基于超球的哈希函数,即球哈希,与基于超平面的哈希函数相比,它可以将更具有空间一致性的数据点映射到二进制码中。我们还提出了一种新的二进制码距离函数,即球汉明距离,专门针对我们的基于超球的二进制编码方案,并设计了一种高效的迭代优化过程,以实现每个哈希函数的平衡分区和哈希函数之间的独立性。此外,我们将球哈希推广到支持由核函数定义的各种相似性度量。我们的广泛实验表明,我们的球哈希技术在各种基准测试中,无论是在大小为 1 到 7500 万的 GIST、BoW 和 VLAD 描述符中,都明显优于基于超平面的最新技术。性能提升是一致的,在测试方法中,与排名第二的方法相比,最高可达 100%的提升。这些结果证实了使用超球在高维空间中编码邻近区域的独特优势。最后,我们的方法直观且易于实现。