Liang Meiyu, Du Junping, Yang Congxian, Xue Zhe, Li Haisheng, Kou Feifei, Geng Yue
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3634-3648. doi: 10.1109/TNNLS.2019.2945567. Epub 2019 Dec 11.
Cross-media search from large-scale social network big data has become increasingly valuable in our daily life because it can support querying different data modalities. Deep hash networks have shown high potential in achieving efficient and effective cross-media search performance. However, due to the fact that social network data often exhibit text sparsity, diversity, and noise characteristics, the search performance of existing methods often degrades when dealing with this data. In order to address this problem, this article proposes a novel end-to-end cross-media semantic correlation learning model based on a deep hash network and semantic expansion for social network cross-media search (DHNS). The approach combines deep network feature learning and hash-code quantization learning for multimodal data into a unified optimization architecture, which successfully preserves both intramedia similarity and intermedia correlation, by minimizing both cross-media correlation loss and binary hash quantization loss. In addition, our approach realizes semantic relationship expansion by constructing the image-word relation graph and mining the potential semantic relationship between images and words, and obtaining the semantic embedding based on both internal graph deep walk and an external knowledge base. Experimental results demonstrate that DHNS yields better cross-media search performance on standard benchmarks.
跨媒体搜索大规模社交网络大数据在我们日常生活中变得越来越有价值,因为它可以支持查询不同的数据模态。深度哈希网络在实现高效且有效的跨媒体搜索性能方面已显示出巨大潜力。然而,由于社交网络数据通常表现出文本稀疏性、多样性和噪声特征,现有方法在处理此类数据时搜索性能往往会下降。为了解决这个问题,本文提出了一种基于深度哈希网络和语义扩展的新颖端到端跨媒体语义关联学习模型,用于社交网络跨媒体搜索(DHNS)。该方法将多模态数据的深度网络特征学习和哈希码量化学习结合到一个统一的优化架构中,通过最小化跨媒体关联损失和二进制哈希量化损失,成功保留了媒体内相似性和媒体间相关性。此外,我们的方法通过构建图像 - 词关系图并挖掘图像与词之间的潜在语义关系,以及基于内部图深度游走和外部知识库获取语义嵌入,实现了语义关系扩展。实验结果表明,DHNS在标准基准上产生了更好的跨媒体搜索性能。