Commonwealth Scientific and Industrial Research Organisation, Data61, Sydney 2122, Australia.
School of Chemical Engineering, The University of New South Wales, Sydney 2052, Australia.
Sensors (Basel). 2023 Apr 27;23(9):4346. doi: 10.3390/s23094346.
The use of artificial intelligence (AI) to detect phishing emails is primarily dependent on large-scale centralized datasets, which has opened it up to a myriad of privacy, trust, and legal issues. Moreover, organizations have been loath to share emails, given the risk of leaking commercially sensitive information. Consequently, it has been difficult to obtain sufficient emails to train a global AI model efficiently. Accordingly, privacy-preserving distributed and collaborative machine learning, particularly federated learning (FL), is a desideratum. As it is already prevalent in the healthcare sector, questions remain regarding the effectiveness and efficacy of FL-based phishing detection within the context of multi-organization collaborations. To the best of our knowledge, the work herein was the first to investigate the use of FL in phishing email detection. This study focused on building upon a deep neural network model, particularly recurrent convolutional neural network (RNN) and bidirectional encoder representations from transformers (BERT), for phishing email detection. We analyzed the FL-entangled learning performance in various settings, including (i) a balanced and asymmetrical data distribution among organizations and (ii) scalability. Our results corroborated the comparable performance statistics of FL in phishing email detection to centralized learning for balanced datasets and low organizational counts. Moreover, we observed a variation in performance when increasing the organizational counts. For a fixed total email dataset, the global RNN-based model had a 1.8% accuracy decrease when the organizational counts were increased from 2 to 10. In contrast, BERT accuracy increased by 0.6% when increasing organizational counts from 2 to 5. However, if we increased the overall email dataset by introducing new organizations in the FL framework, the organizational level performance improved by achieving a faster convergence speed. In addition, FL suffered in its overall global model performance due to highly unstable outputs if the email dataset distribution was highly asymmetric.
人工智能(AI)用于检测网络钓鱼电子邮件主要依赖于大规模集中式数据集,这使其面临着众多隐私、信任和法律问题。此外,由于担心泄露商业敏感信息,组织不愿共享电子邮件。因此,很难获得足够的电子邮件来有效地训练全球 AI 模型。因此,需要使用隐私保护的分布式和协作机器学习,特别是联邦学习(FL)。由于它已经在医疗保健领域得到广泛应用,因此在多组织协作的背景下,基于 FL 的网络钓鱼检测的有效性和功效仍然存在疑问。据我们所知,本文首次研究了在网络钓鱼电子邮件检测中使用 FL。本研究侧重于在深度神经网络模型(特别是递归卷积神经网络(RNN)和来自转换器的双向编码器表示(BERT))的基础上构建,用于网络钓鱼电子邮件检测。我们分析了 FL 纠缠学习在各种设置中的性能,包括(i)组织之间的平衡和非对称数据分布,以及(ii)可扩展性。我们的结果证实了 FL 在网络钓鱼电子邮件检测中的性能与集中式学习在平衡数据集和低组织计数方面的可比统计数据。此外,我们观察到随着组织数量的增加,性能会发生变化。对于固定的总电子邮件数据集,当组织数量从 2 增加到 10 时,基于 RNN 的全局模型的准确性下降了 1.8%。相比之下,当组织数量从 2 增加到 5 时,BERT 的准确性增加了 0.6%。但是,如果我们在 FL 框架中引入新的组织来增加总体电子邮件数据集,则可以通过实现更快的收敛速度来提高组织级别的性能。此外,如果电子邮件数据集分布高度不对称,FL 的整体全局模型性能会受到高度不稳定输出的影响。