Suppr超能文献

学习为联合图像-文本检索嵌入语义相似度

Learning to Embed Semantic Similarity for Joint Image-Text Retrieval.

作者信息

Malali Noam, Keller Yosi

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):10252-10260. doi: 10.1109/TPAMI.2021.3132163. Epub 2022 Nov 7.

Abstract

We present a deep learning approach for learning the joint semantic embeddings of images and captions in a euclidean space, such that the semantic similarity is approximated by the L distances in the embedding space. For that, we introduce a metric learning scheme that utilizes multitask learning to learn the embedding of identical semantic concepts using a center loss. By introducing a differentiable quantization scheme into the end-to-end trainable network, we derive a semantic embedding of semantically similar concepts in euclidean space. We also propose a novel metric learning formulation using an adaptive margin hinge loss, that is refined during the training phase. The proposed scheme was applied to the MS-COCO, Flicke30K and Flickr8K datasets, and was shown to compare favorably with contemporary state-of-the-art approaches.

摘要

我们提出了一种深度学习方法,用于在欧几里得空间中学习图像和标题的联合语义嵌入,使得语义相似度可以通过嵌入空间中的L距离来近似。为此,我们引入了一种度量学习方案,该方案利用多任务学习通过中心损失来学习相同语义概念的嵌入。通过将可微量化方案引入到端到端可训练网络中,我们在欧几里得空间中得到了语义相似概念的语义嵌入。我们还提出了一种使用自适应边际铰链损失的新颖度量学习公式,该公式在训练阶段进行了优化。所提出的方案应用于MS-COCO、Flicke30K和Flickr8K数据集,并被证明与当代最先进的方法相比具有优势。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验