Wu Gengshen, Lin Zijia, Ding Guiguang, Ni Qiang, Han Jungong
IEEE Trans Image Process. 2020 Sep 25;PP. doi: 10.1109/TIP.2020.3025437.
Despite the thrilling success achieved by existing binary descriptors, most of them are still in the mire of three limitations: 1) vulnerable to the geometric transformations; 2) incapable of preserving the manifold structure when learning binary codes; 3) NO guarantee to find the true match if multiple candidates happen to have the same Hamming distance to a given query. All these together make the binary descriptor less effective, given large-scale visual recognition tasks. In this paper, we propose a novel learning-based feature descriptor, namely Unsupervised Deep Binary Descriptor (UDBD), which learns transformation invariant binary descriptors via projecting the original data and their transformed sets into a joint binary space. Moreover, we involve a ℓ2,1-norm loss term in the binary embedding process to gain simultaneously the robustness against data noises and less probability of mistakenly flipping bits of the binary descriptor, on top of it, a graph constraint is used to preserve the original manifold structure in the binary space. Furthermore, a weak bit mechanism is adopted to find the real match from candidates sharing the same minimum Hamming distance, thus enhancing matching performance. Extensive experimental results on public datasets show the superiority of UDBD in terms of matching and retrieval accuracy over state-of-the-arts.
尽管现有二进制描述符取得了令人兴奋的成功,但它们中的大多数仍深陷三个局限之中:1)易受几何变换影响;2)在学习二进制编码时无法保留流形结构;3)如果多个候选与给定查询的汉明距离恰好相同,则无法保证找到真正匹配项。考虑到大规模视觉识别任务,所有这些因素共同导致二进制描述符的有效性降低。在本文中,我们提出了一种新颖的基于学习的特征描述符,即无监督深度二进制描述符(UDBD),它通过将原始数据及其变换后的集合投影到联合二进制空间中来学习变换不变的二进制描述符。此外,我们在二进制嵌入过程中引入了一个ℓ2,1范数损失项,以同时获得对数据噪声的鲁棒性,并降低二进制描述符误翻转位的概率,在此基础上,使用图约束来保留二进制空间中的原始流形结构。此外采用弱位机制从具有相同最小汉明距离的候选中找到真正匹配项,从而提高匹配性能。在公共数据集上的大量实验结果表明,UDBD在匹配和检索准确性方面优于现有技术。