• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有嵌入共识的跨模态语义自动编码器。

Cross-modal semantic autoencoder with embedding consensus.

作者信息

Sun Shengzi, Guo Binghui, Mi Zhilong, Zheng Zhiming

机构信息

Beijing Advanced Innovation Center for Big Data and Brain Computing and NLSDE, Beihang University, Beijing, 100191, China.

Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China.

出版信息

Sci Rep. 2021 Oct 13;11(1):20319. doi: 10.1038/s41598-021-92750-7.

DOI:10.1038/s41598-021-92750-7
PMID:34645836
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8514517/
Abstract

Cross-modal retrieval has become a topic of popularity, since multi-data is heterogeneous and the similarities between different forms of information are worthy of attention. Traditional single-modal methods reconstruct the original information and lack of considering the semantic similarity between different data. In this work, a cross-modal semantic autoencoder with embedding consensus (CSAEC) is proposed, mapping the original data to a low-dimensional shared space to retain semantic information. Considering the similarity between the modalities, an automatic encoder is utilized to associate the feature projection to the semantic code vector. In addition, regularization and sparse constraints are applied to low-dimensional matrices to balance reconstruction errors. The high dimensional data is transformed into semantic code vector. Different models are constrained by parameters to achieve denoising. The experiments on four multi-modal data sets show that the query results are improved and effective cross-modal retrieval is achieved. Further, CSAEC can also be applied to fields related to computer and network such as deep and subspace learning. The model breaks through the obstacles in traditional methods, using deep learning methods innovatively to convert multi-modal data into abstract expression, which can get better accuracy and achieve better results in recognition.

摘要

跨模态检索已成为一个热门话题,因为多数据具有异构性,不同形式信息之间的相似性值得关注。传统的单模态方法会重建原始信息,且缺乏对不同数据之间语义相似性的考虑。在这项工作中,提出了一种具有嵌入一致性的跨模态语义自动编码器(CSAEC),将原始数据映射到低维共享空间以保留语义信息。考虑到模态之间的相似性,利用自动编码器将特征投影与语义代码向量相关联。此外,对低维矩阵应用正则化和稀疏约束以平衡重建误差。高维数据被转换为语义代码向量。不同模型通过参数进行约束以实现去噪。在四个多模态数据集上的实验表明,查询结果得到了改善,实现了有效的跨模态检索。此外,CSAEC还可应用于计算机和网络相关领域,如深度学习和子空间学习。该模型突破了传统方法中的障碍,创新性地使用深度学习方法将多模态数据转换为抽象表达,在识别中能够获得更好的准确性并取得更好的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/5c0368b0cdb4/41598_2021_92750_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/382c7ab0073e/41598_2021_92750_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/bf6e07e4a49d/41598_2021_92750_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/78317d92e057/41598_2021_92750_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/5c0368b0cdb4/41598_2021_92750_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/382c7ab0073e/41598_2021_92750_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/bf6e07e4a49d/41598_2021_92750_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/78317d92e057/41598_2021_92750_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a01/8514517/5c0368b0cdb4/41598_2021_92750_Fig4_HTML.jpg

相似文献

1
Cross-modal semantic autoencoder with embedding consensus.具有嵌入共识的跨模态语义自动编码器。
Sci Rep. 2021 Oct 13;11(1):20319. doi: 10.1038/s41598-021-92750-7.
2
Joint Specifics and Consistency Hash Learning for Large-Scale Cross-Modal Retrieval.用于大规模跨模态检索的关节细节与一致性哈希学习
IEEE Trans Image Process. 2022;31:5343-5358. doi: 10.1109/TIP.2022.3195059. Epub 2022 Aug 16.
3
Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval.用于跨媒体检索的可扩展离散矩阵分解与语义自动编码器
IEEE Trans Cybern. 2022 Jul;52(7):5947-5960. doi: 10.1109/TCYB.2020.3032017. Epub 2022 Jul 4.
4
Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval.用于跨模态检索的图卷积多标签哈希
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):7997-8009. doi: 10.1109/TNNLS.2024.3421583. Epub 2025 May 2.
5
Deep Semantic-Preserving Reconstruction Hashing for Unsupervised Cross-Modal Retrieval.用于无监督跨模态检索的深度语义保持重构哈希
Entropy (Basel). 2020 Nov 7;22(11):1266. doi: 10.3390/e22111266.
6
Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval.基于层次语义交互的深度哈希网络用于跨模态检索。
PeerJ Comput Sci. 2021 May 25;7:e552. doi: 10.7717/peerj-cs.552. eCollection 2021.
7
Fine-Grained Cross-Modal Semantic Consistency in Natural Conservation Image Data from a Multi-Task Perspective.从多任务视角看自然保护图像数据中的细粒度跨模态语义一致性
Sensors (Basel). 2024 May 14;24(10):3130. doi: 10.3390/s24103130.
8
Keyword-Based Diverse Image Retrieval With Variational Multiple Instance Graph.基于变分多实例图的基于关键词的多样图像检索
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10528-10537. doi: 10.1109/TNNLS.2022.3168431. Epub 2023 Nov 30.
9
Bridging multimedia heterogeneity gap via Graph Representation Learning for cross-modal retrieval.通过图表示学习弥合多媒体异质鸿沟进行跨模态检索。
Neural Netw. 2021 Feb;134:143-162. doi: 10.1016/j.neunet.2020.11.011. Epub 2020 Nov 28.
10
Triplet-Based Deep Hashing Network for Cross-Modal Retrieval.用于跨模态检索的基于三元组的深度哈希网络。
IEEE Trans Image Process. 2018 Aug;27(8):3893-3903. doi: 10.1109/TIP.2018.2821921. Epub 2018 Apr 4.

本文引用的文献

1
Joint Feature Selection and Subspace Learning for Cross-Modal Retrieval.跨模态检索的联合特征选择与子空间学习。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2010-23. doi: 10.1109/TPAMI.2015.2505311. Epub 2015 Dec 3.
2
Multimodal Similarity-Preserving Hashing.多模态相似保持哈希。
IEEE Trans Pattern Anal Mach Intell. 2014 Apr;36(4):824-30. doi: 10.1109/TPAMI.2013.225.
3
On the role of correlation and abstraction in cross-modal multimedia retrieval.在跨模态多媒体检索中的相关性和抽象性的作用。
IEEE Trans Pattern Anal Mach Intell. 2014 Mar;36(3):521-35. doi: 10.1109/TPAMI.2013.142.
4
Canonical correlation analysis: an overview with application to learning methods.典型相关分析:概述及其在学习方法中的应用
Neural Comput. 2004 Dec;16(12):2639-64. doi: 10.1162/0899766042321814.