Wang Yue, Wu Xintao
Software and Information Systems Department, University of North Carolina at Charlotte, Charlotte, NC 28223, USA.
Trans Data Priv. 2013 Aug 1;6(2):127-145.
Enabling accurate analysis of social network data while preserving differential privacy has been challenging since graph features such as cluster coefficient often have high sensitivity, which is different from traditional aggregate functions (e.g., count and sum) on tabular data. In this paper, we study the problem of enforcing edge differential privacy in graph generation. The idea is to enforce differential privacy on graph model parameters learned from the original network and then generate the graphs for releasing using the graph model with the private parameters. In particular, we develop a differential privacy preserving graph generator based on the dK-graph generation model. We first derive from the original graph various parameters (i.e., degree correlations) used in the dK-graph model, then enforce edge differential privacy on the learned parameters, and finally use the dK-graph model with the perturbed parameters to generate graphs. For the 2K-graph model, we enforce the edge differential privacy by calibrating noise based on the smooth sensitivity, rather than the global sensitivity. By doing this, we achieve the strict differential privacy guarantee with smaller magnitude noise. We conduct experiments on four real networks and compare the performance of our private dK-graph models with the stochastic Kronecker graph generation model in terms of utility and privacy tradeoff. Empirical evaluations show the developed private dK-graph generation models significantly outperform the approach based on the stochastic Kronecker generation model.
在保留差分隐私的同时实现对社交网络数据的准确分析一直具有挑战性,因为诸如聚类系数等图特征通常具有高敏感性,这与表格数据上的传统聚合函数(例如计数和求和)不同。在本文中,我们研究了在图生成中实施边差分隐私的问题。其思路是对从原始网络学习到的图模型参数实施差分隐私,然后使用具有私有参数的图模型生成用于发布的图。具体而言,我们基于dK-图生成模型开发了一种差分隐私保护图生成器。我们首先从原始图中导出dK-图模型中使用的各种参数(即度相关性),然后对学习到的参数实施边差分隐私,最后使用具有扰动参数的dK-图模型来生成图。对于2K-图模型,我们基于平滑敏感性而非全局敏感性来校准噪声,从而实施边差分隐私。通过这样做,我们以较小幅度的噪声实现了严格的差分隐私保证。我们在四个真实网络上进行实验,并在效用和隐私权衡方面将我们的私有dK-图模型的性能与随机克罗内克图生成模型进行比较。实证评估表明,所开发的私有dK-图生成模型显著优于基于随机克罗内克生成模型的方法。