Haripriya Rahul, Khare Nilay, Pandey Manish
Department of Computer Science and Engineering, Maulana Azad National Institute of Technology, Bhopal, 462003, India.
Sci Rep. 2025 Apr 11;15(1):12482. doi: 10.1038/s41598-025-97565-4.
Ensuring data privacy in medical image classification is a critical challenge in healthcare, especially with the increasing reliance on AI-driven diagnostics. In fact, over 30% of healthcare organizations globally have experienced a data breach in the last year, highlighting the need for secure solutions. This study investigates the integration of transfer learning and federated learning for privacy-preserving medical image classification using GoogLeNet and VGG16 as baseline models to evaluate the generalizability of the proposed framework. Pre-trained on ImageNet and fine-tuned on three specialized medical datasets for TB chest X-rays, brain tumor MRI scans, and diabetic retinopathy images, these models achieved high classification accuracy across various aggregation methods. Additionally, the proposed dynamic aggregation method was further analyzed using modern architectures, EfficientNetV2 and ResNet-RS, to assess the scalability and robustness of the model. A key contribution is the introduction of a novel adaptive aggregation method, which dynamically alternates between Federated Averaging (FedAvg) and Federated Stochastic Gradient Descent (FedSGD), based on data divergence during communication rounds. This approach optimizes model convergence while preserving privacy in collaborative settings. The results demonstrate that transfer learning, when combined with federated learning, offers a scalable, robust, and secure solution for real-world medical diagnostics, enabling healthcare institutions to train highly accurate models without compromising sensitive patient data.
在医疗保健领域,确保医学图像分类中的数据隐私是一项关键挑战,尤其是在对人工智能驱动的诊断依赖日益增加的情况下。事实上,全球超过30%的医疗保健组织在去年经历过数据泄露事件,这凸显了对安全解决方案的需求。本研究使用GoogLeNet和VGG16作为基线模型,研究迁移学习和联邦学习在隐私保护医学图像分类中的整合,以评估所提出框架的通用性。这些模型在ImageNet上进行预训练,并在用于肺结核胸部X光、脑肿瘤MRI扫描和糖尿病视网膜病变图像的三个专门医学数据集上进行微调,在各种聚合方法下均取得了较高的分类准确率。此外,使用现代架构EfficientNetV2和ResNet-RS对所提出的动态聚合方法进行了进一步分析,以评估模型的可扩展性和鲁棒性。一个关键贡献是引入了一种新颖的自适应聚合方法,该方法在通信轮次中根据数据差异在联邦平均(FedAvg)和联邦随机梯度下降(FedSGD)之间动态交替。这种方法在协作环境中优化模型收敛的同时保护隐私。结果表明,迁移学习与联邦学习相结合,为现实世界的医学诊断提供了一种可扩展、稳健且安全的解决方案,使医疗机构能够在不泄露敏感患者数据的情况下训练高精度模型。