Fiorentino Maria Chiara, Migliorelli Giovanna, Villani Francesca Pia, Frontoni Emanuele, Moccia Sara
Department of Information Engineering, Università Politecnica delle Marche, Ancona, Italy.
Department of Law, Università degli Studi di Macerata, Macerata, Italy.
Int J Comput Assist Radiol Surg. 2025 May 21. doi: 10.1007/s11548-025-03400-6.
This study aims to improve federated learning (FL) for ultrasound fetal standard plane detection by addressing noisy labels and data size variability across decentralized clients. We propose a federated denoising framework leveraging prototypes from the largest dataset in the federation to refine noisy labels and enhance predictions in all clients while preserving privacy.
The proposed framework consists of two main steps. First, contrastive learning (SimCLR) is applied to the images of the largest client, generating robust embeddings. These embeddings are used to refine noisy labels in the same client by leveraging the latent space structure using a threshold-based k-nearest neighbors re-labeling strategy. As a second step, image prototypes, computed from the embeddings with noise-free labels, along with SimCLR trained backbone, are shared with the smallest client to guide the FL process effectively, without requiring the use of labels from the smallest client. To address possible image distribution shifts, an ensemble strategy is introduced, which uses a majority voting scheme to optimize label refinement in the smallest dataset while minimizing image discard.
Our framework showed improved performance compared to traditional FL approaches in standard plane detection, achieving the highest mean F1-score across planes.
The proposed strategy effectively improves fetal standard plane detection by leveraging high-quality prototypes, enabling robust performance even with noisy and heterogeneous data size across clients, while preserving privacy.
本研究旨在通过解决分散客户端之间的噪声标签和数据大小变异性问题,改进用于超声胎儿标准平面检测的联邦学习(FL)。我们提出了一个联邦去噪框架,利用联邦中最大数据集的原型来细化噪声标签,并在保护隐私的同时增强所有客户端的预测。
所提出的框架包括两个主要步骤。首先,将对比学习(SimCLR)应用于最大客户端的图像,生成鲁棒的嵌入。通过使用基于阈值的k近邻重新标记策略利用潜在空间结构,这些嵌入用于在同一客户端中细化噪声标签。作为第二步,从具有无噪声标签的嵌入中计算出的图像原型,连同经过SimCLR训练的主干,被共享给最小的客户端,以有效地指导联邦学习过程,而无需使用最小客户端的标签。为了解决可能的图像分布偏移问题,引入了一种集成策略,该策略使用多数投票方案来优化最小数据集中的标签细化,同时最小化图像丢弃。
与传统的联邦学习方法相比,我们的框架在标准平面检测中表现出更好的性能,在所有平面上实现了最高的平均F1分数。
所提出的策略通过利用高质量的原型有效地改进了胎儿标准平面检测,即使在客户端之间存在噪声和异构数据大小的情况下也能实现稳健的性能,同时保护了隐私。