Zhang Jing, Li Yanzi, Ding Qian, Lin Liwei, Ye Xiucai
School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China.
Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou 350118, China.
Entropy (Basel). 2022 Aug 23;24(9):1172. doi: 10.3390/e24091172.
The publication of trajectory data provides critical information for various location-based services, and it is critical to publish trajectory data safely while ensuring its availability. Differential privacy is a promising privacy protection technology for publishing trajectory data securely. Most of the existing trajectory privacy protection schemes do not take into account the user's preference for location and the influence of semantic location. Besides, differential privacy for trajectory protection still has the problem of balance between the privacy budget and service quality. In this paper, a semantics- and prediction-based differential privacy protection scheme for trajectory data is proposed. Firstly, trajectory data are transformed into a prefix tree structure to ensure that they satisfy differential privacy. Secondly, considering the influence of semantic location on trajectory, semantic sensitivity combined with location check-in frequency is used to calculate the sensitivity of each position in the trajectory. The privacy level of the position is classified by setting thresholds. Moreover, the corresponding privacy budget is allocated according to the location privacy level. Finally, a Markov chain is used to predict the attack probability of each position in the trajectory. On this basis, the allocation of the privacy budget is further adjusted and its utilization rate is improved. Thus, the problem of the balance between the privacy budget and service quality is solved. Experimental results show that the proposed scheme is able to ensure data availability while protecting data privacy.
轨迹数据的发布为各种基于位置的服务提供了关键信息,在确保其可用性的同时安全地发布轨迹数据至关重要。差分隐私是一种用于安全发布轨迹数据的很有前景的隐私保护技术。现有的大多数轨迹隐私保护方案没有考虑用户对位置的偏好以及语义位置的影响。此外,轨迹保护的差分隐私仍然存在隐私预算和服务质量之间的平衡问题。本文提出了一种基于语义和预测的轨迹数据差分隐私保护方案。首先,将轨迹数据转换为前缀树结构以确保其满足差分隐私。其次,考虑语义位置对轨迹的影响,结合位置签到频率使用语义敏感度来计算轨迹中每个位置的敏感度。通过设置阈值对位置的隐私级别进行分类。此外,根据位置隐私级别分配相应的隐私预算。最后,使用马尔可夫链预测轨迹中每个位置的攻击概率。在此基础上,进一步调整隐私预算的分配并提高其利用率。从而解决了隐私预算和服务质量之间的平衡问题。实验结果表明,所提出的方案能够在保护数据隐私的同时确保数据可用性。