Liu Zichun, Huang Liusheng, Xu Hongli, Yang Wei
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230026, China.
Entropy (Basel). 2023 Jan 9;25(1):130. doi: 10.3390/e25010130.
Graph data are widely collected and exploited by organizations, providing convenient services from policy formation and market decisions to medical care and social interactions. Yet, recent exposures of private data abuses have caused huge financial and reputational costs to both organizations and their users, enabling designing efficient privacy protection mechanisms a top priority. Local differential privacy (LDP) is an emerging privacy preservation standard and has been studied in various fields, including graph data aggregation. However, existing research studies of graph aggregation with LDP mainly provide single edge privacy for pure graph, leaving heterogeneous graph data aggregation with stronger privacy as an open challenge. In this paper, we take a step toward simultaneously collecting mixed attributed graph data while retaining intrinsic associations, with stronger local differential privacy protecting more than single edge. Specifically, we first propose a moderate granularity attributewise local differential privacy (ALDP) and formulate the problem of aggregating mixed attributed graph data as collecting two statistics under ALDP. Then we provide mechanisms to privately collect these statistics. For the categorical-attributed graph, we devise a utility-improved PrivAG mechanism, which randomizes and aggregates subsets of attribute and degree vectors. For heterogeneous graph, we present an adaptive binning scheme (ABS) to dynamically segment and simultaneously collect mixed attributed data, and extend the prior mechanism to a generalized PrivHG mechanism based on it. Finally, we practically optimize the utility of the mechanisms by reducing the computation costs and estimation errors. The effectiveness and efficiency of the mechanisms are validated through extensive experiments, and better performance is shown compared with the state-of-the-art mechanisms.
图形数据被组织广泛收集和利用,为从政策制定、市场决策到医疗保健和社交互动等提供便捷服务。然而,最近私人数据滥用事件的曝光给组织及其用户带来了巨大的财务和声誉损失,使得设计高效的隐私保护机制成为当务之急。局部差分隐私(LDP)是一种新兴的隐私保护标准,已在包括图形数据聚合在内的各个领域得到研究。然而,现有的关于使用LDP进行图形聚合的研究主要为纯图形提供单边隐私,将具有更强隐私保护的异构图形数据聚合作为一个开放挑战。在本文中,我们朝着同时收集混合属性图形数据迈出了一步,同时保留内在关联,并使用更强的局部差分隐私保护多个边而不仅仅是单边。具体而言,我们首先提出一种适度粒度的逐属性局部差分隐私(ALDP),并将混合属性图形数据的聚合问题表述为在ALDP下收集两个统计量。然后我们提供私下收集这些统计量的机制。对于分类属性图形,我们设计了一种效用改进的PrivAG机制,该机制对属性和度向量的子集进行随机化和聚合。对于异构图形,我们提出一种自适应分箱方案(ABS)来动态分割并同时收集混合属性数据,并在此基础上将先前的机制扩展为广义的PrivHG机制。最后,我们通过降低计算成本和估计误差来实际优化这些机制的效用。通过广泛的实验验证了这些机制的有效性和效率,并且与现有最先进的机制相比表现出更好的性能。