以高实用性对1:M微数据进行匿名化处理。

Anonymizing 1:M microdata with high utility.

作者信息

Gong Qiyuan, Luo Junzhou, Yang Ming, Ni Weiwei, Li Xiao-Bai

机构信息

Southeast University, Nanjing, China.

University of Massachusetts Lowell, Massachusetts, USA.

出版信息

Knowl Based Syst. 2017 Jan 1;115:15-26. doi: 10.1016/j.knosys.2016.10.012. Epub 2016 Oct 21.

DOI:10.1016/j.knosys.2016.10.012

PMID:28603388

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5464735/

Abstract

Preserving privacy and utility during data publishing and data mining is essential for individuals, data providers and researchers. However, studies in this area typically assume that one individual has only one record in a dataset, which is unrealistic in many applications. Having multiple records for an individual leads to new privacy leakages. We call such a dataset a 1:M dataset. In this paper, we propose a novel privacy model called ()-diversity that addresses disclosure risks in 1:M data publishing. Based on this model, we develop an efficient algorithm named 1:M-Generalization to preserve privacy and data utility, and compare it with alternative approaches. Extensive experiments on real-world data show that our approach outperforms the state-of-the-art technique, in terms of data utility and computational cost.

摘要

在数据发布和数据挖掘过程中保护隐私和实用性，对个人、数据提供者和研究人员来说至关重要。然而，该领域的研究通常假定数据集中的一个人只有一条记录，这在许多应用中是不现实的。一个人拥有多条记录会导致新的隐私泄露。我们将这样的数据集称为1:M数据集。在本文中，我们提出了一种名为( )-多样性的新型隐私模型，用于解决1:M数据发布中的披露风险。基于此模型，我们开发了一种名为1:M-泛化的高效算法来保护隐私和数据实用性，并将其与其他方法进行比较。对真实世界数据的大量实验表明，我们的方法在数据实用性和计算成本方面优于现有技术。

相似文献

Anonymizing 1:M microdata with high utility.

Knowl Based Syst. 2017 Jan 1;115:15-26. doi: 10.1016/j.knosys.2016.10.012. Epub 2016 Oct 21.

Privacy preserving data anonymization of spontaneous ADE reporting system dataset.

BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.

Utility-preserving anonymization for health data publishing.

BMC Med Inform Decis Mak. 2017 Jul 11;17(1):104. doi: 10.1186/s12911-017-0499-0.

Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values.

BMC Med Inform Decis Mak. 2020 Jul 8;20(1):155. doi: 10.1186/s12911-020-01171-5.

Designing a Novel Approach Using a Greedy and Information-Theoretic Clustering-Based Algorithm for Anonymizing Microdata Sets.

Entropy (Basel). 2023 Dec 1;25(12):1613. doi: 10.3390/e25121613.

Improved angelization technique against background knowledge attack for 1:M microdata.

PeerJ Comput Sci. 2023 Mar 15;9:e1255. doi: 10.7717/peerj-cs.1255. eCollection 2023.

The cost of quality: Implementing generalization and suppression for anonymizing biomedical data with minimal information loss.

J Biomed Inform. 2015 Dec;58:37-48. doi: 10.1016/j.jbi.2015.09.007. Epub 2015 Sep 15.

Anonymizing datasets with demographics and diagnosis codes in the presence of utility constraints.

J Biomed Inform. 2017 Jan;65:76-96. doi: 10.1016/j.jbi.2016.11.001. Epub 2016 Nov 8.

Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.

JMIR Med Inform. 2021 Oct 28;9(10):e28752. doi: 10.2196/28752.

Digression and Value Concatenation to Enable Privacy-Preserving Regression.

MIS Q. 2014 Sep;38(3):679-698. doi: 10.25300/misq/2014/38.3.03.

引用本文的文献

Improved angelization technique against background knowledge attack for 1:M microdata.

PeerJ Comput Sci. 2023 Mar 15;9:e1255. doi: 10.7717/peerj-cs.1255. eCollection 2023.

Recent Developments in Privacy-Preserving Mining of Clinical Data.

ACM IMS Trans Data Sci. 2021 Nov;2(4). doi: 10.1145/3447774.

本文引用的文献

Disassociation for electronic health record privacy.

J Biomed Inform. 2014 Aug;50:46-61. doi: 10.1016/j.jbi.2014.05.009. Epub 2014 May 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

以高实用性对1:M微数据进行匿名化处理。

Anonymizing 1:M microdata with high utility.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献