Brain and Mind Centre, The University of Sydney, NSW 2050, Australia; School of Electrical and Information Engineering, The University of Sydney, NSW 2006, Australia.
Brain and Mind Centre, The University of Sydney, NSW 2050, Australia; Sydney Neuroimaging Analysis Centre, 94 Mallett Street, NSW 2050, Australia.
Artif Intell Med. 2024 Jun;152:102872. doi: 10.1016/j.artmed.2024.102872. Epub 2024 Apr 17.
Accurately measuring the evolution of Multiple Sclerosis (MS) with magnetic resonance imaging (MRI) critically informs understanding of disease progression and helps to direct therapeutic strategy. Deep learning models have shown promise for automatically segmenting MS lesions, but the scarcity of accurately annotated data hinders progress in this area. Obtaining sufficient data from a single clinical site is challenging and does not address the heterogeneous need for model robustness. Conversely, the collection of data from multiple sites introduces data privacy concerns and potential label noise due to varying annotation standards. To address this dilemma, we explore the use of the federated learning framework while considering label noise. Our approach enables collaboration among multiple clinical sites without compromising data privacy under a federated learning paradigm that incorporates a noise-robust training strategy based on label correction. Specifically, we introduce a Decoupled Hard Label Correction (DHLC) strategy that considers the imbalanced distribution and fuzzy boundaries of MS lesions, enabling the correction of false annotations based on prediction confidence. We also introduce a Centrally Enhanced Label Correction (CELC) strategy, which leverages the aggregated central model as a correction teacher for all sites, enhancing the reliability of the correction process. Extensive experiments conducted on two multi-site datasets demonstrate the effectiveness and robustness of our proposed methods, indicating their potential for clinical applications in multi-site collaborations to train better deep learning models with lower cost in data collection and annotation.
准确地使用磁共振成像(MRI)测量多发性硬化症(MS)的演变对于理解疾病进展至关重要,并有助于指导治疗策略。深度学习模型在自动分割 MS 病变方面显示出了潜力,但由于缺乏准确标注的数据,这一领域的进展受到了阻碍。从单个临床站点获取足够的数据具有挑战性,并且不能解决模型稳健性的异质性需求。相反,从多个站点收集数据会引入数据隐私问题和潜在的标签噪声,这是由于标注标准的不同。为了解决这个困境,我们在考虑标签噪声的情况下探索使用联邦学习框架。我们的方法允许在不损害数据隐私的情况下,在联邦学习范例下,通过基于标签校正的稳健训练策略,在多个临床站点之间进行协作。具体来说,我们引入了一种解耦硬标签校正(DHLC)策略,该策略考虑了 MS 病变的不平衡分布和模糊边界,能够根据预测置信度纠正错误的注释。我们还引入了一种集中增强标签校正(CELC)策略,该策略利用聚合的中央模型作为所有站点的校正教师,增强了校正过程的可靠性。在两个多站点数据集上进行的广泛实验证明了我们提出的方法的有效性和稳健性,表明它们有可能在多站点合作中用于临床应用,以较低的数据收集和注释成本训练更好的深度学习模型。