Li Xingyu, Peng Lu, Wang Yu-Ping, Zhang Weihua
Department of Computer Science, Tulane University, New Orleans, LA, USA.
Department of Biomedical Engineering, Tulane University, New Orleans, LA, USA.
BioData Min. 2025 Jan 4;18(1):2. doi: 10.1186/s13040-024-00414-9.
This survey explores the transformative impact of foundation models (FMs) in artificial intelligence, focusing on their integration with federated learning (FL) in biomedical research. Foundation models such as ChatGPT, LLaMa, and CLIP, which are trained on vast datasets through methods including unsupervised pretraining, self-supervised learning, instructed fine-tuning, and reinforcement learning from human feedback, represent significant advancements in machine learning. These models, with their ability to generate coherent text and realistic images, are crucial for biomedical applications that require processing diverse data forms such as clinical reports, diagnostic images, and multimodal patient interactions. The incorporation of FL with these sophisticated models presents a promising strategy to harness their analytical power while safeguarding the privacy of sensitive medical data. This approach not only enhances the capabilities of FMs in medical diagnostics and personalized treatment but also addresses critical concerns about data privacy and security in healthcare. This survey reviews the current applications of FMs in federated settings, underscores the challenges, and identifies future research directions including scaling FMs, managing data diversity, and enhancing communication efficiency within FL frameworks. The objective is to encourage further research into the combined potential of FMs and FL, laying the groundwork for healthcare innovations.
本次调查探讨了基础模型(FMs)在人工智能领域的变革性影响,重点关注其在生物医学研究中与联邦学习(FL)的整合。诸如ChatGPT、LLaMa和CLIP等基础模型,通过无监督预训练、自监督学习、指令微调以及基于人类反馈的强化学习等方法在海量数据集上进行训练,代表了机器学习领域的重大进展。这些模型能够生成连贯的文本和逼真的图像,对于需要处理临床报告、诊断图像和多模态患者交互等多种数据形式的生物医学应用至关重要。将联邦学习与这些复杂模型相结合,是一种颇具前景的策略,既能利用其分析能力,又能保护敏感医疗数据的隐私。这种方法不仅增强了基础模型在医学诊断和个性化治疗方面的能力,还解决了医疗保健领域对数据隐私和安全的关键担忧。本次调查回顾了基础模型在联邦环境中的当前应用,强调了挑战,并确定了未来的研究方向,包括扩展基础模型、管理数据多样性以及提高联邦学习框架内的通信效率。目的是鼓励对基础模型和联邦学习的联合潜力进行进一步研究,为医疗保健创新奠定基础。