Suppr超能文献

通过伪标签去噪实现医学图像分割的联邦半监督学习

Federated Semi-Supervised Learning for Medical Image Segmentation via Pseudo-Label Denoising.

作者信息

Qiu Liang, Cheng Jierong, Gao Huxin, Xiong Wei, Ren Hongliang

出版信息

IEEE J Biomed Health Inform. 2023 Oct;27(10):4672-4683. doi: 10.1109/JBHI.2023.3274498. Epub 2023 Oct 5.

Abstract

Distributed big data and digital healthcare technologies have great potential to promote medical services, but challenges arise when it comes to learning predictive model from diverse and complex e-health datasets. Federated Learning (FL), as a collaborative machine learning technique, aims to address the challenges by learning a joint predictive model across multi-site clients, especially for distributed medical institutions or hospitals. However, most existing FL methods assume that clients possess fully labeled data for training, which is often not the case in e-health datasets due to high labeling costs or expertise requirement. Therefore, this work proposes a novel and feasible approach to learn a Federated Semi-Supervised Learning (FSSL) model from distributed medical image domains, where a federated pseudo-labeling strategy for unlabeled clients is developed based on the embedded knowledge learned from labeled clients. This greatly mitigates the annotation deficiency at unlabeled clients and leads to a cost-effective and efficient medical image analysis tool. We demonstrated the effectiveness of our method by achieving significant improvements compared to the state-of-the-art in both fundus image and prostate MRI segmentation tasks, resulting in the highest Dice scores of 89.23% and 91.95% respectively even with only a few labeled clients participating in model training. This reveals the superiority of our method for practical deployment, ultimately facilitating the wider use of FL in healthcare and leading to better patient outcomes.

摘要

分布式大数据和数字医疗技术在促进医疗服务方面具有巨大潜力,但在从多样且复杂的电子健康数据集中学习预测模型时会面临挑战。联邦学习(FL)作为一种协作式机器学习技术,旨在通过跨多站点客户端学习联合预测模型来应对这些挑战,特别是针对分布式医疗机构或医院。然而,现有的大多数联邦学习方法都假定客户端拥有用于训练的完全标注数据,而在电子健康数据集中,由于标注成本高或需要专业知识,情况往往并非如此。因此,这项工作提出了一种新颖且可行的方法,用于从分布式医学图像领域学习联邦半监督学习(FSSL)模型,其中基于从已标注客户端学到的嵌入知识,为未标注客户端开发了一种联邦伪标注策略。这极大地缓解了未标注客户端的标注不足问题,并产生了一种经济高效的医学图像分析工具。我们通过在眼底图像和前列腺MRI分割任务中与现有最先进方法相比取得显著改进,证明了我们方法的有效性,即使只有少数已标注客户端参与模型训练,也分别获得了高达89.23%和91.95%的最高Dice分数。这揭示了我们的方法在实际部署中的优越性,最终促进了联邦学习在医疗保健中的更广泛应用,并带来更好的患者治疗效果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验