用于3D形状表示的汉明嵌入敏感度引导融合网络

Hamming Embedding Sensitivity Guided Fusion Network for 3D Shape Representation.

作者信息

Gong Biao, Yan Chenggang, Bai Junjie, Zou Changqing, Gao Yue

出版信息

IEEE Trans Image Process. 2020 Aug 5;PP. doi: 10.1109/TIP.2020.3013138.

DOI:10.1109/TIP.2020.3013138

Abstract

Three-dimensional multi-modal data are used to represent 3D objects in the real world in different ways. Features separately extracted from multimodality data are often poorly correlated. Recent solutions leveraging the attention mechanism to learn a joint-network for the fusion of multimodality features have weak generalization capability. In this paper, we propose a hamming embedding sensitivity network to address the problem of effectively fusing multimodality features. The proposed network called HamNet is the first end-to-end framework with the capacity to theoretically integrate data from all modalities with a unified architecture for 3D shape representation, which can be used for 3D shape retrieval and recognition. HamNet uses the feature concealment module to achieve effective deep feature fusion. The basic idea of the concealment module is to re-weight the features from each modality at an early stage with the hamming embedding of these modalities. The hamming embedding also provides an effective solution for fast retrieval tasks on a large scale dataset. We have evaluated the proposed method on the large-scale ModelNet40 dataset for the tasks of 3D shape classification, single modality and cross-modality retrieval. Comprehensive experiments and comparisons with state-of-the-art methods demonstrate that the proposed approach can achieve superior performance.

摘要

三维多模态数据用于以不同方式表示现实世界中的三维物体。从多模态数据中分别提取的特征通常相关性较差。最近利用注意力机制学习用于融合多模态特征的联合网络的解决方案，其泛化能力较弱。在本文中，我们提出了一种汉明嵌入敏感性网络来解决有效融合多模态特征的问题。所提出的名为HamNet的网络是第一个具有理论上能够以统一架构整合来自所有模态的数据以进行三维形状表示的端到端框架，可用于三维形状检索和识别。HamNet使用特征隐藏模块来实现有效的深度特征融合。隐藏模块的基本思想是在早期阶段利用这些模态的汉明嵌入对来自每个模态的特征重新加权。汉明嵌入还为大规模数据集上的快速检索任务提供了有效的解决方案。我们已经在大规模的ModelNet40数据集上针对三维形状分类、单模态和跨模态检索任务评估了所提出的方法。与现有最先进方法的综合实验和比较表明，所提出的方法能够实现卓越的性能。

相似文献

Hamming Embedding Sensitivity Guided Fusion Network for 3D Shape Representation.用于3D形状表示的汉明嵌入敏感度引导融合网络

IEEE Trans Image Process. 2020 Aug 5;PP. doi: 10.1109/TIP.2020.3013138.

Multi-Scale Representation Learning on Hypergraph for 3D Shape Retrieval and Recognition.基于超图的多尺度表示学习用于3D形状检索与识别

IEEE Trans Image Process. 2021;30:5327-5338. doi: 10.1109/TIP.2021.3082765. Epub 2021 Jun 2.

Hypergraph-Based Multi-Modal Representation for Open-Set 3D Object Retrieval.基于超图的开放集3D物体检索多模态表示

IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2206-2223. doi: 10.1109/TPAMI.2023.3332768. Epub 2024 Mar 6.

Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval.用于跨模态检索的图卷积多标签哈希

IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):7997-8009. doi: 10.1109/TNNLS.2024.3421583. Epub 2025 May 2.

Multi-View 3D Object Retrieval With Deep Embedding Network.基于深度嵌入网络的多视图三维目标检索

IEEE Trans Image Process. 2016 Dec;25(12):5526-5537. doi: 10.1109/TIP.2016.2609814. Epub 2016 Sep 15.

MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN：用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。

Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.

Automated multi-modal Transformer network (AMTNet) for 3D medical images segmentation.用于3D医学图像分割的自动多模态Transformer网络（AMTNet）。

Phys Med Biol. 2023 Jan 9;68(2). doi: 10.1088/1361-6560/aca74c.

Cross-Modal Object Tracking via Modality-Aware Fusion Network and a Large-Scale Dataset.通过模态感知融合网络和大规模数据集实现跨模态目标跟踪

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6981-6994. doi: 10.1109/TNNLS.2024.3406189. Epub 2025 Apr 8.

Joint Specifics and Consistency Hash Learning for Large-Scale Cross-Modal Retrieval.用于大规模跨模态检索的关节细节与一致性哈希学习

IEEE Trans Image Process. 2022;31:5343-5358. doi: 10.1109/TIP.2022.3195059. Epub 2022 Aug 16.

Tasks Integrated Networks: Joint Detection and Retrieval for Image Search.任务集成网络：图像搜索的联合检测与检索。

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):456-473. doi: 10.1109/TPAMI.2020.3009758. Epub 2021 Dec 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于3D形状表示的汉明嵌入敏感度引导融合网络

Hamming Embedding Sensitivity Guided Fusion Network for 3D Shape Representation.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献