Kuang Linghong, Shi Wenlong, Chen Xueqi, Zhang Jing, Liao Huaxiong
School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou, 350118, China.
Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou, 350118, China.
Sci Rep. 2025 Apr 30;15(1):15227. doi: 10.1038/s41598-025-88553-9.
The utilization of numerous location-based intelligent services yields massive traffic trajectory data. Mining such data unveils internal and external user features, offering significant application value across various domains. Nonetheless, while trajectory data mining enhances user convenience, it also exposes their privacy to potential breaches. To address the problem that existing traffic trajectory privacy protection methods rarely consider the location semantics and the spatial influence of interest points when constructing k-anonymity sets, which makes user trajectories vulnerable to attacks, a Location Semantic Privacy Protection Model based on Spatial Influence (LSPPM-SI) is proposed to resist semantic attacks. Firstly, a location semantic mining algorithm is proposed to classify the stopovers based on positional semantics, thereby simplifying the semantic information contained in user trajectories. Secondly, a diversified semantic dummy location selecting algorithm is proposed to resist semantic attacks. To enhance the availability of traffic trajectory data while safeguarding location semantics, a Hilbert curves is used to reduce the area of anonymous regions, and a diversified semantic anonymous set is constructed. Thirdly, the spatial influence of interest points is defined and used to verify the rationality of dummy trajectories within the anonymous trajectory set, thereby preventing attackers from identifying dummy trajectories. Finally, the problem of synthesizing dummy trajectories is transformed into a matching problem for directed bipartite graphs and the optimal k-anonymity set is obtained using the Kuhn Munkres algorithm. Experimental results demonstrate that the proposed model improves traffic trajectory data availability and semantic protection performance by 14% and 46.5%, respectively, compared to traditional models.
众多基于位置的智能服务的使用产生了海量的交通轨迹数据。挖掘此类数据可揭示用户的内在和外在特征,在各个领域具有重大应用价值。然而,虽然轨迹数据挖掘提高了用户便利性,但也使他们的隐私面临潜在泄露风险。为了解决现有交通轨迹隐私保护方法在构建k匿名集时很少考虑位置语义和兴趣点的空间影响,从而使用户轨迹易受攻击的问题,提出了一种基于空间影响的位置语义隐私保护模型(LSPPM-SI)来抵御语义攻击。首先,提出一种位置语义挖掘算法,基于位置语义对停留点进行分类,从而简化用户轨迹中包含的语义信息。其次,提出一种多样化语义虚拟位置选择算法来抵御语义攻击。为了在保护位置语义的同时提高交通轨迹数据的可用性,使用希尔伯特曲线缩小匿名区域的面积,并构建多样化语义匿名集。第三,定义兴趣点的空间影响并用于验证匿名轨迹集中虚拟轨迹的合理性,从而防止攻击者识别虚拟轨迹。最后,将虚拟轨迹合成问题转化为有向二分图的匹配问题,并使用匈牙利算法获得最优k匿名集。实验结果表明,与传统模型相比,所提模型分别将交通轨迹数据可用性和语义保护性能提高了14%和46.5%。