Jiang Yuzhou, Yilmaz Emre, Ayday Erman
Case Western Reserve University Cleveland, Ohio, USA.
University of Houston-Downtown Houston, Texas, USA.
Proc Priv Enhanc Technol. 2023 Jul;2023(4):5-20. doi: 10.56553/popets-2023-0095.
Location-based services have brought significant convenience to people in their daily lives, and the collected location data are also in high demand. However, directly releasing those data raises privacy and liability (e.g., due to unauthorized distribution of such datasets) concerns since location data contain users' sensitive information, e.g., regular moving patterns and favorite spots. To address this, we propose a novel fingerprinting scheme that simultaneously identifies unauthorized redistribution of location datasets and provides differential privacy guarantees for the shared data. Observing data utility degradation due to differentially-private mechanisms, we introduce a utility-focused post-processing scheme to regain spatiotemporal correlations between points in a location trajectory. We further integrate this post-processing scheme into our fingerprinting scheme as a sampling method. The proposed fingerprinting scheme alleviates the degradation in the utility of the shared dataset due to the noise introduced by differentially-private mechanisms (i.e., adds the fingerprint by preserving the publicly known statistics of the data). Meanwhile, it does not violate differential privacy throughout the entire process due to immunity to post-processing, a fundamental property of differential privacy. Our proposed fingerprinting scheme is robust against known and well-studied attacks against a fingerprinting scheme including random flipping attacks, correlation-based flipping attacks, and collusions among multiple parties, which makes it hard for the attackers to infer the fingerprint codes and avoid accusation. Via experiments on two real-life location datasets and two synthetic ones, we show that our scheme achieves high fingerprinting robustness and outperforms existing approaches. Besides, the proposed fingerprinting scheme increases data utility for differentially-private datasets, which is beneficial for data analyzers.
基于位置的服务给人们的日常生活带来了极大便利,同时对所收集的位置数据的需求也很高。然而,直接发布这些数据会引发隐私和责任(例如,由于此类数据集的未经授权分发)问题,因为位置数据包含用户的敏感信息,例如常规移动模式和常去地点。为了解决这个问题,我们提出了一种新颖的指纹识别方案,该方案既能识别位置数据集的未经授权重新分发,又能为共享数据提供差分隐私保证。鉴于差分隐私机制会导致数据效用下降,我们引入了一种以效用为重点的后处理方案,以恢复位置轨迹中各点之间的时空相关性。我们进一步将此 后处理方案作为一种采样方法集成到我们的指纹识别方案中。所提出的指纹识别方案减轻了由于差分隐私机制引入的噪声导致的共享数据集效用的下降(即,通过保留数据的公开已知统计信息来添加指纹)。同时,由于差分隐私的一个基本属性——对后处理具有免疫力,它在整个过程中都不会违反差分隐私。我们提出的指纹识别方案对于针对指纹识别方案的已知且经过充分研究的攻击(包括随机翻转攻击、基于相关性的翻转攻击以及多方勾结)具有鲁棒性,这使得攻击者难以推断指纹代码并逃避指控。通过对两个真实生活位置数据集和两个合成数据集进行实验,我们表明我们的方案实现了高指纹识别鲁棒性,并且优于现有方法。此外,所提出的指纹识别方案提高了差分隐私数据集的数据效用,这对数据分析人员是有益的。