• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态数据融合的有效技术:比较分析。

Effective Techniques for Multimodal Data Fusion: A Comparative Analysis.

机构信息

Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa Street 75, 00-662 Warsaw, Poland.

WeSub, Adama Branickiego Street 17, 02-972 Warsaw, Poland.

出版信息

Sensors (Basel). 2023 Feb 21;23(5):2381. doi: 10.3390/s23052381.

DOI:10.3390/s23052381
PMID:36904585
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10007548/
Abstract

Data processing in robotics is currently challenged by the effective building of multimodal and common representations. Tremendous volumes of raw data are available and their smart management is the core concept of multimodal learning in a new paradigm for data fusion. Although several techniques for building multimodal representations have been proven successful, they have not yet been analyzed and compared in a given production setting. This paper explored three of the most common techniques, (1) the late fusion, (2) the early fusion, and (3) the sketch, and compared them in classification tasks. Our paper explored different types of data (modalities) that could be gathered by sensors serving a wide range of sensor applications. Our experiments were conducted on Amazon Reviews, MovieLens25M, and Movie-Lens1M datasets. Their outcomes allowed us to confirm that the choice of fusion technique for building multimodal representation is crucial to obtain the highest possible model performance resulting from the proper modality combination. Consequently, we designed criteria for choosing this optimal data fusion technique.

摘要

机器人的数据处理目前面临着有效构建多模态和通用表示的挑战。大量的原始数据可用,其智能管理是数据融合新范例中多模态学习的核心概念。尽管已经证明了几种构建多模态表示的技术是成功的,但它们尚未在给定的生产环境中进行分析和比较。本文探讨了三种最常见的技术,(1)晚期融合,(2)早期融合,和(3)草图,并在分类任务中对它们进行了比较。我们的论文探讨了可以由服务于各种传感器应用的传感器收集的不同类型的数据(模态)。我们的实验是在亚马逊评论、MovieLens25M 和 Movie-Lens1M 数据集上进行的。它们的结果使我们能够确认,选择融合技术来构建多模态表示对于获得最佳的模型性能是至关重要的,这是来自于适当的模态组合。因此,我们设计了选择这种最优数据融合技术的标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/5a00285e99a9/sensors-23-02381-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/e5f14be28ec5/sensors-23-02381-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/c130e0070d77/sensors-23-02381-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/9d39305b8f40/sensors-23-02381-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/49727ad71e43/sensors-23-02381-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/5a00285e99a9/sensors-23-02381-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/e5f14be28ec5/sensors-23-02381-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/c130e0070d77/sensors-23-02381-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/9d39305b8f40/sensors-23-02381-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/49727ad71e43/sensors-23-02381-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b1/10007548/5a00285e99a9/sensors-23-02381-g005.jpg

相似文献

1
Effective Techniques for Multimodal Data Fusion: A Comparative Analysis.多模态数据融合的有效技术:比较分析。
Sensors (Basel). 2023 Feb 21;23(5):2381. doi: 10.3390/s23052381.
2
Development of Multimodal Fusion Technology for Tomato Maturity Assessment.用于番茄成熟度评估的多模态融合技术的开发
Sensors (Basel). 2024 Apr 11;24(8):2467. doi: 10.3390/s24082467.
3
Artificial intelligence-based methods for fusion of electronic health records and imaging data.基于人工智能的电子健康记录与医学影像数据融合方法。
Sci Rep. 2022 Oct 26;12(1):17981. doi: 10.1038/s41598-022-22514-4.
4
Multimodal information bottleneck for deep reinforcement learning with multiple sensors.多模态信息瓶颈用于多传感器的深度强化学习。
Neural Netw. 2024 Aug;176:106347. doi: 10.1016/j.neunet.2024.106347. Epub 2024 Apr 27.
5
Reducing Annotation Burden Through Multimodal Learning.通过多模态学习减轻标注负担。
Front Big Data. 2020 Jun 2;3:19. doi: 10.3389/fdata.2020.00019. eCollection 2020.
6
Sensor-Fusion for Smartphone Location Tracking Using Hybrid Multimodal Deep Neural Networks.使用混合多模态深度神经网络的智能手机位置跟踪传感器融合。
Sensors (Basel). 2021 Nov 11;21(22):7488. doi: 10.3390/s21227488.
7
MolPROP: Molecular Property prediction with multimodal language and graph fusion.MolPROP:通过多模态语言与图形融合进行分子属性预测。
J Cheminform. 2024 May 22;16(1):56. doi: 10.1186/s13321-024-00846-9.
8
A review of deep learning-based information fusion techniques for multimodal medical image classification.深度学习在多模态医学图像分类中的信息融合技术综述。
Comput Biol Med. 2024 Jul;177:108635. doi: 10.1016/j.compbiomed.2024.108635. Epub 2024 May 22.
9
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
10
Multimodal Sentiment Analysis Based on Cross-Modal Attention and Gated Cyclic Hierarchical Fusion Networks.基于跨模态注意力和门控循环层次融合网络的多模态情感分析。
Comput Intell Neurosci. 2022 Aug 9;2022:4767437. doi: 10.1155/2022/4767437. eCollection 2022.

引用本文的文献

1
Intelligent sensing devices and systems for personalized mental health.用于个性化心理健康的智能传感设备与系统。
Med X. 2025 Dec;3(1). doi: 10.1007/s44258-025-00057-3. Epub 2025 Apr 2.
2
High-Accuracy Recognition Method for Diseased Chicken Feces Based on Image and Text Information Fusion.基于图像与文本信息融合的病鸡粪便高精度识别方法
Animals (Basel). 2025 Jul 22;15(15):2158. doi: 10.3390/ani15152158.
3
Multimodal fusion with relational learning for molecular property prediction.用于分子性质预测的基于关系学习的多模态融合

本文引用的文献

1
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.多模态基准测试:用于多模态表示学习的多尺度基准测试
Adv Neural Inf Process Syst. 2021 Dec;2021(DB1):1-20.
2
A Review of Multisensor Data Fusion Solutions in Smart Manufacturing: Systems and Trends.多传感器数据融合解决方案在智能制造中的综述:系统与趋势。
Sensors (Basel). 2022 Feb 23;22(5):1734. doi: 10.3390/s22051734.
3
Multimodal deep learning for biomedical data fusion: a review.多模态深度学习在生物医学数据融合中的应用综述。
Commun Chem. 2025 Jul 5;8(1):200. doi: 10.1038/s42004-025-01586-z.
4
GPS: Harnessing data fusion strategies to improve the accuracy of machine learning-based genomic and phenotypic selection.GPS:利用数据融合策略提高基于机器学习的基因组和表型选择的准确性。
Plant Commun. 2025 Aug 11;6(8):101416. doi: 10.1016/j.xplc.2025.101416. Epub 2025 Jun 11.
5
Multimodal malware classification using proposed ensemble deep neural network framework.使用所提出的集成深度神经网络框架进行多模态恶意软件分类。
Sci Rep. 2025 May 23;15(1):18006. doi: 10.1038/s41598-025-96203-3.
6
Recent Advances in Vehicle Driver Health Monitoring Systems.车辆驾驶员健康监测系统的最新进展
Sensors (Basel). 2025 Mar 14;25(6):1812. doi: 10.3390/s25061812.
7
A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model.一种用于精确肺炎诊断的多模态深度学习解决方案:PneumoFusion-Net模型。
Front Physiol. 2025 Mar 12;16:1512835. doi: 10.3389/fphys.2025.1512835. eCollection 2025.
8
Fault diagnosis in electric motors using multi-mode time series and ensemble transformers network.基于多模式时间序列和集成变压器网络的电动机故障诊断
Sci Rep. 2025 Mar 6;15(1):7834. doi: 10.1038/s41598-025-89695-6.
9
Advancing Emotionally Aware Child-Robot Interaction with Biophysical Data and Insight-Driven Affective Computing.借助生物物理数据和洞察驱动的情感计算推进情感感知型儿童与机器人的互动
Sensors (Basel). 2025 Feb 14;25(4):1161. doi: 10.3390/s25041161.
10
Ultrasonic Weld Quality Inspection Involving Strength Prediction and Defect Detection in Data-Constrained Training Environments.数据受限训练环境下涉及强度预测和缺陷检测的超声波焊接质量检测
Sensors (Basel). 2024 Oct 11;24(20):6553. doi: 10.3390/s24206553.
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab569.
4
A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets.计算机视觉深度多模态学习综述:进展、趋势、应用及数据集
Vis Comput. 2022;38(8):2939-2970. doi: 10.1007/s00371-021-02166-7. Epub 2021 Jun 10.
5
Advances in multimodal data fusion in neuroimaging: Overview, challenges, and novel orientation.神经影像学中多模态数据融合的进展:概述、挑战及新方向。
Inf Fusion. 2020 Dec;64:149-187. doi: 10.1016/j.inffus.2020.07.006. Epub 2020 Jul 17.
6
A Survey on Deep Learning for Multimodal Data Fusion.深度学习在多模态数据融合中的研究综述。
Neural Comput. 2020 May;32(5):829-864. doi: 10.1162/neco_a_01273. Epub 2020 Mar 18.
7
Multimodal Machine Learning: A Survey and Taxonomy.多模态机器学习:一项综述与分类法
IEEE Trans Pattern Anal Mach Intell. 2019 Feb;41(2):423-443. doi: 10.1109/TPAMI.2018.2798607. Epub 2018 Jan 25.
8
Representation learning: a review and new perspectives.表示学习:综述与新视角。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.