IEEE Trans Pattern Anal Mach Intell. 2018 May;40(5):1059-1071. doi: 10.1109/TPAMI.2016.2645565. Epub 2016 Dec 28.
Recently, cross-modal search has attracted considerable attention but remains a very challenging task because of the integration complexity and heterogeneity of the multi-modal data. To address both challenges, in this paper, we propose a novel method termed hetero-manifold regularisation (HMR) to supervise the learning of hash functions for efficient cross-modal search. A hetero-manifold integrates multiple sub-manifolds defined by homogeneous data with the help of cross-modal supervision information. Taking advantages of the hetero-manifold, the similarity between each pair of heterogeneous data could be naturally measured by three order random walks on this hetero-manifold. Furthermore, a novel cumulative distance inequality defined on the hetero-manifold is introduced to avoid the computational difficulty induced by the discreteness of hash codes. By using the inequality, cross-modal hashing is transformed into a problem of hetero-manifold regularised support vector learning. Therefore, the performance of cross-modal search can be significantly improved by seamlessly combining the integrated information of the hetero-manifold and the strong generalisation of the support vector machine. Comprehensive experiments show that the proposed HMR achieve advantageous results over the state-of-the-art methods in several challenging cross-modal tasks.
最近,跨模态搜索引起了相当多的关注,但由于多模态数据的集成复杂性和异构性,仍然是一项极具挑战性的任务。为了解决这两个挑战,在本文中,我们提出了一种称为异质流形正则化(HMR)的新方法,以监督哈希函数的学习,从而实现高效的跨模态搜索。异质流形通过跨模态监督信息整合由同质地数据定义的多个子流形。利用异质流形,通过在该异质流形上进行三次随机游走,可以自然地测量每对异类数据之间的相似性。此外,我们还引入了一种新的异质流形上的累积距离不等式,以避免哈希码离散性引起的计算困难。通过使用该不等式,跨模态散列被转化为异质流形正则化支持向量学习的问题。因此,通过无缝结合异质流形的集成信息和支持向量机的强大泛化能力,可以显著提高跨模态搜索的性能。综合实验表明,所提出的 HMR 在几个具有挑战性的跨模态任务中优于最先进的方法。