• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态联邦学习:一项综述。

Multimodal Federated Learning: A Survey.

作者信息

Che Liwei, Wang Jiaqi, Zhou Yao, Ma Fenglong

机构信息

College of Information Sciences and Technology, Pennsylvania State University, University Park, PA 16802, USA.

Instacart, San Francisco, CA 94105, USA.

出版信息

Sensors (Basel). 2023 Aug 6;23(15):6986. doi: 10.3390/s23156986.

DOI:10.3390/s23156986
PMID:37571768
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10422520/
Abstract

Federated learning (FL), which provides a collaborative training scheme for distributed data sources with privacy concerns, has become a burgeoning and attractive research area. Most existing FL studies focus on taking unimodal data, such as image and text, as the model input and resolving the heterogeneity challenge, i.e., the challenge of non-identical distribution (non-IID) caused by a data distribution imbalance related to data labels and data amount. In real-world applications, data are usually described by multiple modalities. However, to the best of our knowledge, only a handful of studies have been conducted to improve system performance utilizing multimodal data. In this survey paper, we identify the significance of this emerging research topic of multimodal federated learning (MFL) and present a literature review on the state-of-art MFL methods. Furthermore, we categorize multimodal federated learning into congruent and incongruent multimodal federated learning based on whether all clients possess the same modal combinations. We investigate the feasible application tasks and related benchmarks for MFL. Lastly, we summarize the promising directions and fundamental challenges in this field for future research.

摘要

联邦学习(FL)为有隐私顾虑的分布式数据源提供了一种协作训练方案,已成为一个新兴且有吸引力的研究领域。大多数现有的联邦学习研究专注于将单模态数据(如图像和文本)作为模型输入,并解决异质性挑战,即由与数据标签和数据量相关的数据分布不平衡导致的非相同分布(非IID)挑战。在实际应用中,数据通常由多种模态描述。然而,据我们所知,仅有少数研究利用多模态数据来提高系统性能。在这篇综述论文中,我们确定了多模态联邦学习(MFL)这一新兴研究主题的重要性,并对当前最先进的多模态联邦学习方法进行文献综述。此外,我们根据所有客户端是否拥有相同的模态组合,将多模态联邦学习分为一致多模态联邦学习和不一致多模态联邦学习。我们研究了多模态联邦学习的可行应用任务和相关基准。最后,我们总结了该领域未来研究的有前景的方向和基本挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/40c24e2baeab/sensors-23-06986-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/d23a80eca1d7/sensors-23-06986-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/164be9851203/sensors-23-06986-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/ae69f056aa29/sensors-23-06986-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/d22721b62bd9/sensors-23-06986-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/40c24e2baeab/sensors-23-06986-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/d23a80eca1d7/sensors-23-06986-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/164be9851203/sensors-23-06986-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/ae69f056aa29/sensors-23-06986-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/d22721b62bd9/sensors-23-06986-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ff/10422520/40c24e2baeab/sensors-23-06986-g005.jpg

相似文献

1
Multimodal Federated Learning: A Survey.多模态联邦学习:一项综述。
Sensors (Basel). 2023 Aug 6;23(15):6986. doi: 10.3390/s23156986.
2
A Survey on Heterogeneity Taxonomy, Security and Privacy Preservation in the Integration of IoT, Wireless Sensor Networks and Federated Learning.物联网、无线传感器网络与联邦学习集成中的异构性分类、安全与隐私保护研究
Sensors (Basel). 2024 Feb 1;24(3):968. doi: 10.3390/s24030968.
3
Cross-Silo, Privacy-Preserving, and Lightweight Federated Multimodal System for the Identification of Major Depressive Disorder Using Audio and Electroencephalogram.用于使用音频和脑电图识别重度抑郁症的跨孤岛、隐私保护且轻量级的联邦多模态系统
Diagnostics (Basel). 2023 Dec 25;14(1):43. doi: 10.3390/diagnostics14010043.
4
Privacy preserving federated learning for full heterogeneity.针对完全异构性的隐私保护联邦学习。
ISA Trans. 2023 Oct;141:73-83. doi: 10.1016/j.isatra.2023.04.020. Epub 2023 Apr 20.
5
Federated Learning for Vehicular Internet of Things: Recent Advances and Open Issues.车联网的联邦学习:最新进展与开放问题
IEEE Comput Graph Appl. 2020 May 5. doi: 10.1109/OJCS.2020.2992630.
6
Combined Federated and Split Learning in Edge Computing for Ubiquitous Intelligence in Internet of Things: State-of-the-Art and Future Directions.边缘计算中联合联邦和分割学习在物联网普遍智能中的应用:现状和未来方向。
Sensors (Basel). 2022 Aug 10;22(16):5983. doi: 10.3390/s22165983.
7
A survey on federated learning: challenges and applications.联邦学习综述:挑战与应用
Int J Mach Learn Cybern. 2023;14(2):513-535. doi: 10.1007/s13042-022-01647-y. Epub 2022 Nov 11.
8
Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data.通过减少非独立同分布数据的类别不平衡来提高联邦学习中的性能提升。
Sensors (Basel). 2023 Jan 19;23(3):1152. doi: 10.3390/s23031152.
9
Secure and decentralized federated learning framework with non-IID data based on blockchain.基于区块链的具有非独立同分布数据的安全且去中心化联邦学习框架。
Heliyon. 2024 Feb 29;10(5):e27176. doi: 10.1016/j.heliyon.2024.e27176. eCollection 2024 Mar 15.
10
A Comprehensive Overview of IoT-Based Federated Learning: Focusing on Client Selection Methods.基于物联网的联邦学习综述:聚焦客户端选择方法
Sensors (Basel). 2023 Aug 17;23(16):7235. doi: 10.3390/s23167235.

引用本文的文献

1
A Fair Contribution Measurement Method for Federated Learning.一种用于联邦学习的公平贡献度量方法。
Sensors (Basel). 2024 Jul 31;24(15):4967. doi: 10.3390/s24154967.
2
Federated Multi-Label Learning (FMLL): Innovative Method for Classification Tasks in Animal Science.联邦多标签学习(FMLL):动物科学分类任务的创新方法。
Animals (Basel). 2024 Jul 9;14(14):2021. doi: 10.3390/ani14142021.
3
Addressing 6 challenges in generative AI for digital health: A scoping review.应对数字健康领域生成式人工智能的六大挑战:一项范围综述

本文引用的文献

1
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.多模态基准测试:用于多模态表示学习的多尺度基准测试
Adv Neural Inf Process Syst. 2021 Dec;2021(DB1):1-20.
2
MIMIC-IV, a freely accessible electronic health record dataset.MIMIC-IV,一个可自由访问的电子健康记录数据集。
Sci Data. 2023 Jan 3;10(1):1. doi: 10.1038/s41597-022-01899-x.
3
Reducing variations in multi-center Alzheimer's disease classification with convolutional adversarial autoencoder.使用卷积对抗自动编码器减少多中心阿尔茨海默病分类中的变异性。
PLOS Digit Health. 2024 May 23;3(5):e0000503. doi: 10.1371/journal.pdig.0000503. eCollection 2024 May.
4
Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation.跨多种生物医学数据模式和队列的学习:创新面临的挑战与机遇
Patterns (N Y). 2024 Jan 17;5(2):100913. doi: 10.1016/j.patter.2023.100913. eCollection 2024 Feb 9.
Med Image Anal. 2022 Nov;82:102585. doi: 10.1016/j.media.2022.102585. Epub 2022 Aug 28.
4
Towards Personalized Federated Learning.迈向个性化联邦学习。
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):9587-9603. doi: 10.1109/TNNLS.2022.3160699. Epub 2023 Nov 30.
5
The future of digital health with federated learning.联合学习助力数字健康的未来。
NPJ Digit Med. 2020 Sep 14;3:119. doi: 10.1038/s41746-020-00323-1. eCollection 2020.
6
Multi-attention Recurrent Network for Human Communication Comprehension.用于人类交流理解的多注意力循环网络。
Proc AAAI Conf Artif Intell. 2018 Feb;2018:5642-5649.
7
Robust and Communication-Efficient Federated Learning From Non-i.i.d. Data.来自非独立同分布数据的稳健且通信高效的联邦学习
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3400-3413. doi: 10.1109/TNNLS.2019.2944481. Epub 2019 Nov 1.
8
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding.NTU RGB+D 120:用于三维人体活动理解的大规模基准测试。
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2684-2701. doi: 10.1109/TPAMI.2019.2916873. Epub 2019 May 14.
9
Multimodal Machine Learning: A Survey and Taxonomy.多模态机器学习:一项综述与分类法
IEEE Trans Pattern Anal Mach Intell. 2019 Feb;41(2):423-443. doi: 10.1109/TPAMI.2018.2798607. Epub 2018 Jan 25.
10
PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.生理信号库、生理信号处理工具包和生理信号网络:复杂生理信号新研究资源的组成部分。
Circulation. 2000 Jun 13;101(23):E215-20. doi: 10.1161/01.cir.101.23.e215.