分布不变差分隐私

Distribution-Invariant Differential Privacy.

作者信息

Bi Xuan, Shen Xiaotong

机构信息

Information and Decision Sciences, Carlson School of Management, University of Minnesota, Minneapolis, MN.

School of Statistics, University of Minnesota, Minneapolis, MN.

出版信息

J Econom. 2023 Aug;235(2):444-453. doi: 10.1016/j.jeconom.2022.05.004. Epub 2022 Jun 18.

DOI:10.1016/j.jeconom.2022.05.004

PMID:37701878

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10495082/

Abstract

Differential privacy is becoming one gold standard for protecting the privacy of publicly shared data. It has been widely used in social science, data science, public health, information technology, and the U.S. decennial census. Nevertheless, to guarantee differential privacy, existing methods may unavoidably alter the conclusion of original data analysis, as privatization often changes the sample distribution. This phenomenon is known as the trade-off between privacy protection and statistical accuracy. In this work, we mitigate this trade-off by developing a distribution-invariant privatization (DIP) method to reconcile both high statistical accuracy and strict differential privacy. As a result, any downstream statistical or machine learning task yields essentially the same conclusion as if one used the original data. Numerically, under the same strictness of privacy protection, DIP achieves superior statistical accuracy in a wide range of simulation studies and real-world benchmarks.

摘要

差分隐私正成为保护公开共享数据隐私的一项黄金标准。它已在社会科学、数据科学、公共卫生、信息技术以及美国十年一度的人口普查中得到广泛应用。然而，为了保证差分隐私，现有方法可能不可避免地会改变原始数据分析的结论，因为数据私有化往往会改变样本分布。这种现象被称为隐私保护与统计准确性之间的权衡。在这项工作中，我们通过开发一种分布不变私有化（DIP）方法来缓解这种权衡，以兼顾高统计准确性和严格的差分隐私。结果是，任何下游的统计或机器学习任务得出的结论与使用原始数据时基本相同。在数值上，在相同的隐私保护严格程度下，DIP在广泛的模拟研究和实际基准测试中都实现了卓越的统计准确性。

相似文献

Distribution-Invariant Differential Privacy.

J Econom. 2023 Aug;235(2):444-453. doi: 10.1016/j.jeconom.2022.05.004. Epub 2022 Jun 18.

Data Flush.

Harv Data Sci Rev. 2022 Spring;4(2). doi: 10.1162/99608f92.681fe3bd. Epub 2022 May 9.

Differential privacy in the 2020 US census: what will it do? Quantifying the accuracy/privacy tradeoff.

Gates Open Res. 2020 Apr 6;3:1722. doi: 10.12688/gatesopenres.13089.2. eCollection 2019.

Differential Privacy Protection Against Membership Inference Attack on Machine Learning for Genomic Data.

Pac Symp Biocomput. 2021;26:26-37.

Differential Privacy for the Vast Majority.

ACM Trans Manag Inf Syst. 2019 Jul;10(2). doi: 10.1145/3329717.

FLPP: A Federated-Learning-Based Scheme for Privacy Protection in Mobile Edge Computing.

Entropy (Basel). 2023 Nov 16;25(11):1551. doi: 10.3390/e25111551.

Approximating Functions with Approximate Privacy for Applications in Signal Estimation and Learning.

Entropy (Basel). 2023 May 22;25(5):825. doi: 10.3390/e25050825.

Toward a Comparison of Classical and New Privacy Mechanism.

Entropy (Basel). 2021 Apr 15;23(4):467. doi: 10.3390/e23040467.

Equitable differential privacy.

Front Big Data. 2024 Aug 16;7:1420344. doi: 10.3389/fdata.2024.1420344. eCollection 2024.

Task-Specific Adaptive Differential Privacy Method for Structured Data.

Sensors (Basel). 2023 Feb 10;23(4):1980. doi: 10.3390/s23041980.

引用本文的文献

Medical laboratory data-based models: opportunities, obstacles, and solutions.

J Transl Med. 2025 Jul 24;23(1):823. doi: 10.1186/s12967-025-06802-x.

Application of the joint clustering algorithm based on Gaussian kernels and differential privacy in lung cancer identification.

Sci Rep. 2025 May 16;15(1):17094. doi: 10.1038/s41598-025-01873-8.

Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.

J Am Med Inform Assoc. 2025 Apr 1;32(4):656-664. doi: 10.1093/jamia/ocae313.

Novel Uncertainty Quantification Through Perturbation-Assisted Sample Synthesis.

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):7813-7824. doi: 10.1109/TPAMI.2024.3393364. Epub 2024 Nov 6.

Data Flush.

Harv Data Sci Rev. 2022 Spring;4(2). doi: 10.1162/99608f92.681fe3bd. Epub 2022 May 9.

本文引用的文献

Forecasting influenza activity using machine-learned mobility map.

Nat Commun. 2021 Feb 9;12(1):726. doi: 10.1038/s41467-021-21018-5.

Breaking medical data sharing boundaries by using synthesized radiographs.

Sci Adv. 2020 Dec 2;6(49). doi: 10.1126/sciadv.abb7973. Print 2020 Dec.

Transparency and reproducibility in artificial intelligence.

Nature. 2020 Oct;586(7829):E14-E16. doi: 10.1038/s41586-020-2766-y. Epub 2020 Oct 14.

How differential privacy will affect our understanding of health disparities in the United States.

Proc Natl Acad Sci U S A. 2020 Jun 16;117(24):13405-13412. doi: 10.1073/pnas.2003714117. Epub 2020 May 28.

Towards formalizing the GDPR's notion of singling out.

Proc Natl Acad Sci U S A. 2020 Apr 14;117(15):8344-8352. doi: 10.1073/pnas.1914598117. Epub 2020 Mar 31.

Realizing private and practical pharmacological collaboration.

Science. 2018 Oct 19;362(6412):347-350. doi: 10.1126/science.aat4807.

Private algorithms for the protected in social network search.

Proc Natl Acad Sci U S A. 2016 Jan 26;113(4):913-8. doi: 10.1073/pnas.1510612113. Epub 2016 Jan 11.

Machine learning: Trends, perspectives, and prospects.

Science. 2015 Jul 17;349(6245):255-60. doi: 10.1126/science.aaa8415.

Information access. Balancing privacy versus accuracy in research protocols.

Science. 2015 Jan 30;347(6221):479-80. doi: 10.1126/science.aaa3483.

Differentially Private Empirical Risk Minimization.

J Mach Learn Res. 2011 Mar;12:1069-1109.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

分布不变差分隐私

Distribution-Invariant Differential Privacy.

作者信息

Bi Xuan, Shen Xiaotong

机构信息

Information and Decision Sciences, Carlson School of Management, University of Minnesota, Minneapolis, MN.

School of Statistics, University of Minnesota, Minneapolis, MN.

出版信息

J Econom. 2023 Aug;235(2):444-453. doi: 10.1016/j.jeconom.2022.05.004. Epub 2022 Jun 18.

DOI:10.1016/j.jeconom.2022.05.004

PMID:37701878

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10495082/

Abstract

摘要

分布不变差分隐私

Distribution-Invariant Differential Privacy.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

分布不变差分隐私

Distribution-Invariant Differential Privacy.

作者信息

机构信息

出版信息