Topaloglu Mustafa Y, Morrell Elisabeth M, Rajendran Suraj, Topaloglu Umit
Wake Forest University, Winston Salem, NC, United States.
Wake Forest School of Medicine, Winston Salem, NC, United States.
Front Artif Intell. 2021 Oct 6;4:746497. doi: 10.3389/frai.2021.746497. eCollection 2021.
Artificial Intelligence and its subdomain, Machine Learning (ML), have shown the potential to make an unprecedented impact in healthcare. Federated Learning (FL) has been introduced to alleviate some of the limitations of ML, particularly the capability to train on larger datasets for improved performance, which is usually cumbersome for an inter-institutional collaboration due to existing patient protection laws and regulations. Moreover, FL may also play a crucial role in circumventing ML's exigent bias problem by accessing underrepresented groups' data spanning geographically distributed locations. In this paper, we have discussed three FL challenges, namely: privacy of the model exchange, ethical perspectives, and legal considerations. Lastly, we have proposed a model that could aide in assessing data contributions of a FL implementation. In light of the expediency and adaptability of using the Sørensen-Dice Coefficient over the more limited (e.g., horizontal FL) and computationally expensive Shapley Values, we sought to demonstrate a new paradigm that we hope, will become invaluable for sharing any profit and responsibilities that may accompany a FL endeavor.
人工智能及其子领域机器学习(ML)已显示出在医疗保健领域产生前所未有的影响的潜力。联邦学习(FL)已被引入以缓解ML的一些局限性,特别是在更大数据集上进行训练以提高性能的能力,由于现有的患者保护法律法规,这对于机构间合作通常很麻烦。此外,联邦学习还可以通过访问地理分布位置的代表性不足群体的数据,在规避机器学习严重的偏差问题方面发挥关键作用。在本文中,我们讨论了联邦学习的三个挑战,即:模型交换的隐私、伦理观点和法律考量。最后,我们提出了一个可以帮助评估联邦学习实施中数据贡献的模型。鉴于使用 Sørensen-Dice 系数比更有限的(例如水平联邦学习)和计算成本高昂的 Shapley 值更具便利性和适应性,我们试图展示一种新的范式,我们希望这种范式对于分享联邦学习努力可能带来的任何收益和责任将变得非常宝贵。