在医学影像的协作学习中，使用生成重放处理数据异质性。

Handling data heterogeneity with generative replay in collaborative learning for medical imaging.

机构信息

Department of Biomedical Data Science at Stanford University, Stanford, CA 94305, USA.

Department of Biomedical Data Science and Department of Radiology at Stanford University, Stanford, CA 94305, USA.

出版信息

Med Image Anal. 2022 May;78:102424. doi: 10.1016/j.media.2022.102424. Epub 2022 Mar 22.

DOI:10.1016/j.media.2022.102424

PMID:35390737

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9814954/

Abstract

Collaborative learning, which enables collaborative and decentralized training of deep neural networks at multiple institutions in a privacy-preserving manner, is rapidly emerging as a valuable technique in healthcare applications. However, its distributed nature often leads to significant heterogeneity in data distributions across institutions. In this paper, we present a novel generative replay strategy to address the challenge of data heterogeneity in collaborative learning methods. Different from traditional methods that directly aggregating the model parameters, we leverage generative adversarial learning to aggregate the knowledge from all the local institutions. Specifically, instead of directly training a model for task performance, we develop a novel dual model architecture: a primary model learns the desired task, and an auxiliary "generative replay model" allows aggregating knowledge from the heterogenous clients. The auxiliary model is then broadcasted to the central sever, to regulate the training of primary model with an unbiased target distribution. Experimental results demonstrate the capability of the proposed method in handling heterogeneous data across institutions. On highly heterogeneous data partitions, our model achieves ∼4.88% improvement in the prediction accuracy on a diabetic retinopathy classification dataset, and ∼49.8% reduction of mean absolution value on a Bone Age prediction dataset, respectively, compared to the state-of-the art collaborative learning methods.

摘要

协作学习是一种在保护隐私的情况下，使多个机构能够协作和分散式地训练深度神经网络的方法，它在医疗保健应用中迅速成为一种有价值的技术。然而，它的分布式本质通常会导致机构之间的数据分布存在显著的异质性。在本文中，我们提出了一种新的生成重放策略，以解决协作学习方法中数据异质性的挑战。与直接聚合模型参数的传统方法不同，我们利用生成对抗学习来聚合来自所有本地机构的知识。具体来说，我们不是直接为任务性能训练模型，而是开发了一种新的双模型架构：主模型学习所需的任务，辅助“生成重放模型”允许从异构客户端聚合知识。然后将辅助模型广播到中央服务器，使用无偏目标分布来调节主模型的训练。实验结果证明了该方法在处理机构间异质数据方面的能力。在高度异质的数据分区上，与最先进的协作学习方法相比，我们的模型在糖尿病视网膜病变分类数据集上的预测精度提高了约 4.88%，在骨龄预测数据集上的平均绝对误差值降低了约 49.8%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ca3/9814954/fa962f185ee8/nihms-1858604-f0003.jpg

相似文献

Handling data heterogeneity with generative replay in collaborative learning for medical imaging.在医学影像的协作学习中，使用生成重放处理数据异质性。

Med Image Anal. 2022 May;78:102424. doi: 10.1016/j.media.2022.102424. Epub 2022 Mar 22.

SplitAVG: A Heterogeneity-Aware Federated Deep Learning Method for Medical Imaging.SplitAVG：一种用于医学成像的异构感知联邦深度学习方法。

IEEE J Biomed Health Inform. 2022 Sep;26(9):4635-4644. doi: 10.1109/JBHI.2022.3185956. Epub 2022 Sep 9.

Lifelong Generative Adversarial Autoencoder.终身生成对抗自动编码器。

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14684-14698. doi: 10.1109/TNNLS.2023.3281091. Epub 2024 Oct 7.

Backdoor attack and defense in federated generative adversarial network-based medical image synthesis.联邦生成对抗网络的后门攻击与防御在医学图像合成中的应用。

Med Image Anal. 2023 Dec;90:102965. doi: 10.1016/j.media.2023.102965. Epub 2023 Sep 22.

A multicenter random forest model for effective prognosis prediction in collaborative clinical research network.多中心随机森林模型在协作临床研究网络中的有效预后预测。

Artif Intell Med. 2020 Mar;103:101814. doi: 10.1016/j.artmed.2020.101814. Epub 2020 Feb 5.

Label-Efficient Self-Supervised Federated Learning for Tackling Data Heterogeneity in Medical Imaging.基于标签高效的自监督联邦学习的医学影像数据异质性处理方法。

IEEE Trans Med Imaging. 2023 Jul;42(7):1932-1943. doi: 10.1109/TMI.2022.3233574. Epub 2023 Jun 30.

Privacy-preserving federated neural network learning for disease-associated cell classification.用于疾病相关细胞分类的隐私保护联邦神经网络学习

Patterns (N Y). 2022 Apr 18;3(5):100487. doi: 10.1016/j.patter.2022.100487. eCollection 2022 May 13.

Collaborative Deep Learning for Privacy Preserving Diabetic Retinopathy Detection.协同深度学习保护隐私的糖尿病视网膜病变检测。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2181-2184. doi: 10.1109/EMBC48229.2022.9871617.

Accounting for data variability in multi-institutional distributed deep learning for medical imaging.多机构分布式深度学习中医学成像数据变异性的考虑。

J Am Med Inform Assoc. 2020 May 1;27(5):700-708. doi: 10.1093/jamia/ocaa017.

Federated transfer learning for auxiliary classifier generative adversarial networks: framework and industrial application.用于辅助分类器生成对抗网络的联邦迁移学习：框架与工业应用

J Intell Manuf. 2023 May 5:1-16. doi: 10.1007/s10845-023-02126-z.

引用本文的文献

Privacy preservation for federated learning in health care.医疗保健领域联邦学习中的隐私保护。

Patterns (N Y). 2024 Jul 12;5(7):100974. doi: 10.1016/j.patter.2024.100974.

Recent methodological advances in federated learning for healthcare.医疗保健领域联邦学习的最新方法进展。

Patterns (N Y). 2024 Jun 14;5(6):101006. doi: 10.1016/j.patter.2024.101006.

Federated learning for medical image analysis: A survey.用于医学图像分析的联邦学习：一项综述。

Pattern Recognit. 2024 Jul;151. doi: 10.1016/j.patcog.2024.110424. Epub 2024 Mar 12.

Medical Imaging Applications of Federated Learning.联邦学习的医学成像应用

Diagnostics (Basel). 2023 Oct 6;13(19):3140. doi: 10.3390/diagnostics13193140.

Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning.重新思考用于解决联邦学习中数据异构性的架构设计

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2022 Jun;2022:10051-10061. doi: 10.1109/cvpr52688.2022.00982. Epub 2022 Sep 27.

SplitAVG: A Heterogeneity-Aware Federated Deep Learning Method for Medical Imaging.SplitAVG：一种用于医学成像的异构感知联邦深度学习方法。

IEEE J Biomed Health Inform. 2022 Sep;26(9):4635-4644. doi: 10.1109/JBHI.2022.3185956. Epub 2022 Sep 9.

本文引用的文献

Federated Learning With Privacy-Preserving Ensemble Attention Distillation.联邦学习中的隐私保护集成注意力蒸馏

IEEE Trans Med Imaging. 2023 Jul;42(7):2057-2067. doi: 10.1109/TMI.2022.3213244. Epub 2023 Jun 30.

Federated learning for predicting clinical outcomes in patients with COVID-19.基于联邦学习的 COVID-19 患者临床结局预测

Nat Med. 2021 Oct;27(10):1735-1743. doi: 10.1038/s41591-021-01506-3. Epub 2021 Sep 15.

Accounting for data variability in multi-institutional distributed deep learning for medical imaging.多机构分布式深度学习中医学成像数据变异性的考虑。

J Am Med Inform Assoc. 2020 May 1;27(5):700-708. doi: 10.1093/jamia/ocaa017.

Association of genomic subtypes of lower-grade gliomas with shape features automatically extracted by a deep learning algorithm.低级别胶质瘤基因组亚型与深度学习算法自动提取的形态特征的关联。

Comput Biol Med. 2019 Jun;109:218-225. doi: 10.1016/j.compbiomed.2019.05.002. Epub 2019 May 3.

A Roadmap for Foundational Research on Artificial Intelligence in Medical Imaging: From the 2018 NIH/RSNA/ACR/The Academy Workshop.人工智能在医学影像领域基础研究路线图：来自 2018 年 NIH/RSNA/ACR/美国学院联合研讨会

Radiology. 2019 Jun;291(3):781-791. doi: 10.1148/radiol.2019190613. Epub 2019 Apr 16.

The RSNA Pediatric Bone Age Machine Learning Challenge.RSNA 儿科骨龄机器学习挑战赛。

Radiology. 2019 Feb;290(2):498-503. doi: 10.1148/radiol.2018180736. Epub 2018 Nov 27.

Distributed deep learning networks among institutions for medical imaging.医疗机构之间的分布式深度学习网络。

J Am Med Inform Assoc. 2018 Aug 1;25(8):945-954. doi: 10.1093/jamia/ocy017.

Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features.利用专家分割标签和放射组学特征推进癌症基因组图谱胶质细胞瘤 MRI 数据集。

Sci Data. 2017 Sep 5;4:170117. doi: 10.1038/sdata.2017.117.

The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS).多模态脑肿瘤图像分割基准（BRATS）。

IEEE Trans Med Imaging. 2015 Oct;34(10):1993-2024. doi: 10.1109/TMI.2014.2377694. Epub 2014 Dec 4.

Differentially Private Empirical Risk Minimization.差分隐私经验风险最小化

J Mach Learn Res. 2011 Mar;12:1069-1109.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验