Liu Yezi, Chen Hanning, Imani Mohsen
Department of Electrical Engineering and Computer Science, University of California, Irvine, Irvine, CA, United States.
Donald Bren School of Information and Computer Sciences, University of California, Irvine, Irvine, CA, United States.
Front Big Data. 2024 Oct 24;7:1489306. doi: 10.3389/fdata.2024.1489306. eCollection 2024.
Link prediction is a crucial task in network analysis, but it has been shown to be prone to biased predictions, particularly when links are unfairly predicted between nodes from different sensitive groups. In this paper, we study the fair link prediction problem, which aims to ensure that the predicted link probability is independent of the sensitive attributes of the connected nodes. Existing methods typically incorporate debiasing techniques within graph embeddings to mitigate this issue. However, training on large real-world graphs is already challenging, and adding fairness constraints can further complicate the process. To overcome this challenge, we propose FairLink, a method that learns a fairness-enhanced graph to bypass the need for debiasing during the link predictor's training. FairLink maintains link prediction accuracy by ensuring that the enhanced graph follows a training trajectory similar to that of the original input graph. Meanwhile, it enhances fairness by minimizing the absolute difference in link probabilities between node pairs within the same sensitive group and those between node pairs from different sensitive groups. Our extensive experiments on multiple large-scale graphs demonstrate that FairLink not only promotes fairness but also often achieves link prediction accuracy comparable to baseline methods. Most importantly, the enhanced graph exhibits strong generalizability across different GNN architectures. FairLink is highly scalable, making it suitable for deployment in real-world large-scale graphs, where maintaining both fairness and accuracy is critical.
链接预测是网络分析中的一项关键任务,但已被证明容易出现有偏差的预测,特别是当在来自不同敏感组的节点之间不公平地预测链接时。在本文中,我们研究公平链接预测问题,其目的是确保预测的链接概率与相连节点的敏感属性无关。现有方法通常在图嵌入中纳入去偏技术来缓解这个问题。然而,在大型真实世界图上进行训练已经具有挑战性,而添加公平性约束会使这个过程更加复杂。为了克服这一挑战,我们提出了FairLink,一种学习公平增强图的方法,以在链接预测器训练期间绕过去偏的需要。FairLink通过确保增强图遵循与原始输入图相似的训练轨迹来维持链接预测准确性。同时,它通过最小化同一敏感组内节点对与不同敏感组内节点对之间链接概率的绝对差异来提高公平性。我们在多个大规模图上进行的广泛实验表明,FairLink不仅促进了公平性,而且通常能达到与基线方法相当的链接预测准确性。最重要的是,增强图在不同的GNN架构中表现出很强的通用性。FairLink具有高度可扩展性,使其适合部署在真实世界的大规模图中,在其中维持公平性和准确性都至关重要。