Department of Biomedical Data Science at Stanford University, Stanford, CA 94305, USA.
Department of Biomedical Data Science and Department of Radiology at Stanford University, Stanford, CA 94305, USA.
Med Image Anal. 2022 May;78:102424. doi: 10.1016/j.media.2022.102424. Epub 2022 Mar 22.
Collaborative learning, which enables collaborative and decentralized training of deep neural networks at multiple institutions in a privacy-preserving manner, is rapidly emerging as a valuable technique in healthcare applications. However, its distributed nature often leads to significant heterogeneity in data distributions across institutions. In this paper, we present a novel generative replay strategy to address the challenge of data heterogeneity in collaborative learning methods. Different from traditional methods that directly aggregating the model parameters, we leverage generative adversarial learning to aggregate the knowledge from all the local institutions. Specifically, instead of directly training a model for task performance, we develop a novel dual model architecture: a primary model learns the desired task, and an auxiliary "generative replay model" allows aggregating knowledge from the heterogenous clients. The auxiliary model is then broadcasted to the central sever, to regulate the training of primary model with an unbiased target distribution. Experimental results demonstrate the capability of the proposed method in handling heterogeneous data across institutions. On highly heterogeneous data partitions, our model achieves ∼4.88% improvement in the prediction accuracy on a diabetic retinopathy classification dataset, and ∼49.8% reduction of mean absolution value on a Bone Age prediction dataset, respectively, compared to the state-of-the art collaborative learning methods.
协作学习是一种在保护隐私的情况下,使多个机构能够协作和分散式地训练深度神经网络的方法,它在医疗保健应用中迅速成为一种有价值的技术。然而,它的分布式本质通常会导致机构之间的数据分布存在显著的异质性。在本文中,我们提出了一种新的生成重放策略,以解决协作学习方法中数据异质性的挑战。与直接聚合模型参数的传统方法不同,我们利用生成对抗学习来聚合来自所有本地机构的知识。具体来说,我们不是直接为任务性能训练模型,而是开发了一种新的双模型架构:主模型学习所需的任务,辅助“生成重放模型”允许从异构客户端聚合知识。然后将辅助模型广播到中央服务器,使用无偏目标分布来调节主模型的训练。实验结果证明了该方法在处理机构间异质数据方面的能力。在高度异质的数据分区上,与最先进的协作学习方法相比,我们的模型在糖尿病视网膜病变分类数据集上的预测精度提高了约 4.88%,在骨龄预测数据集上的平均绝对误差值降低了约 49.8%。