Rjoub Gaith, Wahab Omar Abdel, Bentahar Jamal, Cohen Robin, Bataineh Ahmed Saleh
Concordia Institute for Information Systems Engineering, Concordia University, 1455 De Maisonneuve Blvd. W.2, Montreal, H3G 1M8 Quebec Canada.
Department of Computer Science and Engineering, Université du Québec en Outaouais, 101, Saint-Jean-Bosco, C.P. 1250, succursale Hull, Gatineau, J8X 3X7 Quebec Canada.
Inf Syst Front. 2022 Jul 18:1-18. doi: 10.1007/s10796-022-10307-z.
In the context of distributed machine learning, the concept of federated learning (FL) has emerged as a solution to the privacy concerns that users have about sharing their own data with a third-party server. FL allows a group of users (often referred to as ) to locally train a single machine learning model on their devices without sharing their raw data. One of the main challenges in FL is how to select the most appropriate clients to participate in the training of a certain task. In this paper, we address this challenge and propose a trust-based deep reinforcement learning approach to select the most adequate clients in terms of resource consumption and training time. On top of the client selection mechanism, we embed a transfer learning approach to handle the scarcity of data in some regions and compensate potential lack of learning at some servers. We apply our solution in the healthcare domain in a COVID-19 detection scenario over IoT devices. In the considered scenario, edge servers collaborate with IoT devices to train a COVID-19 detection model using FL without having to share any raw confidential data. Experiments conducted on a real-world COVID-19 dataset reveal that our solution achieves a good trade-off between detection accuracy and model execution time compared to existing approaches.
在分布式机器学习的背景下,联邦学习(FL)的概念已成为一种解决方案,用于解决用户在与第三方服务器共享自己的数据时所面临的隐私问题。联邦学习允许一组用户(通常称为 )在其设备上本地训练单个机器学习模型,而无需共享其原始数据。联邦学习的主要挑战之一是如何选择最合适的客户端来参与特定任务的训练。在本文中,我们解决了这一挑战,并提出了一种基于信任的深度强化学习方法,以便在资源消耗和训练时间方面选择最合适的客户端。在客户端选择机制之上,我们嵌入了一种迁移学习方法,以处理某些区域数据稀缺的问题,并弥补某些服务器上潜在的学习不足。我们将我们的解决方案应用于医疗保健领域,在物联网设备上的新冠病毒检测场景中。在所考虑的场景中,边缘服务器与物联网设备协作,使用联邦学习训练新冠病毒检测模型,而无需共享任何原始机密数据。在真实世界的新冠病毒数据集上进行的实验表明,与现有方法相比,我们的解决方案在检测准确性和模型执行时间之间实现了良好的权衡。