Deng Zhipeng, Yang Yuqiao, Suzuki Kenji
Biomedical Artificial Intelligence Research Unit (BMAI), Institute of Innovative Research, Tokyo Institute of Technology, Tokyo, Japan; Department of Information and Communications Engineering, School of Engineering, Tokyo Institute of Technology, Tokyo, Japan.
Biomedical Artificial Intelligence Research Unit (BMAI), Institute of Innovative Research, Tokyo Institute of Technology, Tokyo, Japan; Department of Information and Communications Engineering, School of Engineering, Tokyo Institute of Technology, Tokyo, Japan.
J Invest Dermatol. 2025 Feb;145(2):303-311. doi: 10.1016/j.jid.2024.05.023. Epub 2024 Jun 22.
Federated learning (FL) enables multiple institutes to train models collaboratively without sharing private data. Current FL research focuses on communication efficiency, privacy protection, and personalization and assumes that the data of FL have already been ideally collected. However, in medical scenarios, data annotation demands both expertise and intensive labor, which is a critical problem in FL. Active learning (AL) has shown promising performance in reducing the number of data annotations in medical image analysis. We propose a federated AL framework in which AL is executed periodically and interactively under FL. We exploit a local model in each hospital and a global model acquired from FL to construct an ensemble. We use ensemble entropy-based AL as an efficient data-annotation strategy in FL. Therefore, our federated AL framework can decrease the amount of annotated data and preserve patient privacy while maintaining the performance of FL. To our knowledge, this federated AL framework applied to medical images has not been previously reported. We validated our framework on real-world dermoscopic datasets. Using only 50% of samples, our framework was able to achieve state-of-the-art performance on a skin-lesion classification task. Our framework performed better than several state-of-the-art AL methods under FL and achieved comparable performance with full-data FL.
联邦学习(FL)使多个机构能够在不共享私有数据的情况下协作训练模型。当前的联邦学习研究集中在通信效率、隐私保护和个性化方面,并假设联邦学习的数据已经得到了理想的收集。然而,在医疗场景中,数据标注既需要专业知识又需要大量人力,这是联邦学习中的一个关键问题。主动学习(AL)在减少医学图像分析中的数据标注数量方面已显示出良好的性能。我们提出了一种联邦主动学习框架,其中主动学习在联邦学习下定期且交互式地执行。我们利用每个医院的本地模型和从联邦学习中获得的全局模型来构建一个集成模型。我们将基于集成熵的主动学习用作联邦学习中一种高效的数据标注策略。因此,我们的联邦主动学习框架可以减少标注数据的数量并保护患者隐私,同时保持联邦学习的性能。据我们所知,此前尚未报道过将这种联邦主动学习框架应用于医学图像的情况。我们在真实世界的皮肤镜数据集上验证了我们的框架。仅使用50%的样本,我们的框架就在皮肤病变分类任务上能够实现领先的性能。我们的框架在联邦学习下比几种领先的主动学习方法表现更好,并且与全数据联邦学习取得了相当的性能。