Islam Humayera, Mosa Abu
Institute for Data Science and Informatics.
Center for Biomedical Informatics; University of Missouri School of Medicine, Columbia, Missouri.
AMIA Annu Symp Proc. 2022 Feb 21;2021:556-564. eCollection 2021.
Chronic diabetes can lead to microvascular complications, including diabetic eye disease, diabetic kidney disease, and diabetic neuropathy. However, the long-term complications often remain undetected at the early stages of diagnosis. Developing a machine learning model to identify the patients at high risk of developing diabetes-related complications can help design better treatment interventions. Building robust machine learning models require large datasets which further requires sharing data among different healthcare systems, hence, involving privacy and confidentiality concerns. The main of this study is to design a decentralized privacy-protected federated learning architecture that can deliver comparable performance to centralized learning. We demonstrate the potential of adopting federated learning to address the challenges such as class-imbalance in using real-world clinical data. In all our experiments, federated learning showed comparable performance to the gold-standard of centralized learning, and applying class balancing techniques improved performance across all cohorts.
慢性糖尿病会导致微血管并发症,包括糖尿病眼病、糖尿病肾病和糖尿病神经病变。然而,这些长期并发症在诊断早期往往仍未被发现。开发一种机器学习模型来识别有发生糖尿病相关并发症高风险的患者,有助于设计更好的治疗干预措施。构建强大的机器学习模型需要大型数据集,而这进一步要求在不同医疗系统之间共享数据,因此涉及隐私和保密问题。本研究的主要目的是设计一种去中心化的隐私保护联邦学习架构,其性能可与集中式学习相媲美。我们展示了采用联邦学习来应对使用真实世界临床数据时出现的诸如类别不平衡等挑战的潜力。在我们所有的实验中,联邦学习表现出与集中式学习的金标准相当的性能,并且应用类别平衡技术提高了所有队列的性能。