• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

去卷积核密度估计和回归用于局部差分隐私数据。

Deconvoluting kernel density estimation and regression for locally differentially private data.

机构信息

Department of Electrical and Electronic Engineering, The University of Melbourne, Parkville, VIC, 3010, Australia.

出版信息

Sci Rep. 2020 Dec 7;10(1):21361. doi: 10.1038/s41598-020-78323-0.

DOI:10.1038/s41598-020-78323-0
PMID:33288799
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7721740/
Abstract

Local differential privacy has become the gold-standard of privacy literature for gathering or releasing sensitive individual data points in a privacy-preserving manner. However, locally differential data can twist the probability density of the data because of the additive noise used to ensure privacy. In fact, the density of privacy-preserving data (no matter how many samples we gather) is always flatter in comparison with the density function of the original data points due to convolution with privacy-preserving noise density function. The effect is especially more pronounced when using slow-decaying privacy-preserving noises, such as the Laplace noise. This can result in under/over-estimation of the heavy-hitters. This is an important challenge facing social scientists due to the use of differential privacy in the 2020 Census in the United States. In this paper, we develop density estimation methods using smoothing kernels. We use the framework of deconvoluting kernel density estimators to remove the effect of privacy-preserving noise. This approach also allows us to adapt the results from non-parametric regression with errors-in-variables to develop regression models based on locally differentially private data. We demonstrate the performance of the developed methods on financial and demographic datasets.

摘要

局部差分隐私已成为隐私文献的黄金标准,用于以保护隐私的方式收集或发布敏感的个人数据点。然而,由于用于确保隐私的加性噪声,局部差分数据可能会扭曲数据的概率密度。实际上,由于与隐私保护噪声密度函数的卷积,隐私保护数据(无论我们收集多少个样本)的密度始终比原始数据点的密度函数更平坦。当使用缓慢衰减的隐私保护噪声(如拉普拉斯噪声)时,这种效果更加明显。这可能导致重尾的低估/高估。由于美国 2020 年人口普查中使用了差分隐私,因此这是社会科学家面临的一个重要挑战。在本文中,我们使用平滑核开发了密度估计方法。我们使用解卷积核密度估计器的框架来消除隐私保护噪声的影响。这种方法还允许我们根据带有误差的变量的非参数回归结果来开发基于局部差分隐私数据的回归模型。我们在金融和人口数据集上展示了所开发方法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/aa835db7b3f8/41598_2020_78323_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/99e2c6aef867/41598_2020_78323_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/fce7cb2ba8eb/41598_2020_78323_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/6b80082942a1/41598_2020_78323_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/f54e29fb06c6/41598_2020_78323_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/5640eea27b1d/41598_2020_78323_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/6bb89aa7f07a/41598_2020_78323_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/a5e937d736ff/41598_2020_78323_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/93752ab070e2/41598_2020_78323_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/aa835db7b3f8/41598_2020_78323_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/99e2c6aef867/41598_2020_78323_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/fce7cb2ba8eb/41598_2020_78323_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/6b80082942a1/41598_2020_78323_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/f54e29fb06c6/41598_2020_78323_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/5640eea27b1d/41598_2020_78323_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/6bb89aa7f07a/41598_2020_78323_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/a5e937d736ff/41598_2020_78323_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/93752ab070e2/41598_2020_78323_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0da8/7721740/aa835db7b3f8/41598_2020_78323_Fig9_HTML.jpg

相似文献

1
Deconvoluting kernel density estimation and regression for locally differentially private data.去卷积核密度估计和回归用于局部差分隐私数据。
Sci Rep. 2020 Dec 7;10(1):21361. doi: 10.1038/s41598-020-78323-0.
2
Preserving differential privacy in deep neural networks with relevance-based adaptive noise imposition.基于相关性的自适应噪声引入保护深度神经网络的差分隐私。
Neural Netw. 2020 May;125:131-141. doi: 10.1016/j.neunet.2020.02.001. Epub 2020 Feb 11.
3
Privacy preserving RBF kernel support vector machine.隐私保护径向基核支持向量机
Biomed Res Int. 2014;2014:827371. doi: 10.1155/2014/827371. Epub 2014 Jun 12.
4
Convergence Rates for Differentially Private Statistical Estimation.差分隐私统计估计的收敛速率
Proc Int Conf Mach Learn. 2012 Jul;2012:1327-1334.
5
Preserving Differential Privacy in Degree-Correlation based Graph Generation.在基于度相关性的图生成中保护差分隐私
Trans Data Priv. 2013 Aug 1;6(2):127-145.
6
Selecting Optimal Subset to release under Differentially Private M-estimators from Hybrid Datasets.从混合数据集中选择最优子集以在差分隐私M估计器下发布。
IEEE Trans Knowl Data Eng. 2018 Mar 1;30(3):573-584. doi: 10.1109/TKDE.2017.2773545. Epub 2017 Nov 14.
7
DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing.DPSynthesizer:用于隐私保护数据共享的差分隐私数据合成器。
Proceedings VLDB Endowment. 2014 Aug;7(13):1677-1680. doi: 10.14778/2733004.2733059.
8
Differentially private distributed logistic regression using private and public data.使用私有和公共数据的差分隐私分布式逻辑回归
BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S14. doi: 10.1186/1755-8794-7-S1-S14. Epub 2014 May 8.
9
Differentially Private Empirical Risk Minimization.差分隐私经验风险最小化
J Mach Learn Res. 2011 Mar;12:1069-1109.
10
Differential Privacy via Haar Wavelet Transform and Gaussian Mechanism for Range Query.基于 Haar 小波变换和高斯机制的范围查询中的差分隐私。
Comput Intell Neurosci. 2022 Sep 12;2022:8139813. doi: 10.1155/2022/8139813. eCollection 2022.

引用本文的文献

1
A Generalized Regression Neural Network Model for Predicting the Curing Characteristics of Carbon Black-Filled Rubber Blends.一种用于预测炭黑填充橡胶共混物固化特性的广义回归神经网络模型。
Polymers (Basel). 2022 Feb 9;14(4):653. doi: 10.3390/polym14040653.

本文引用的文献

1
A simple method to estimate prediction intervals and predictive distributions: Summarizing meta-analyses beyond means and confidence intervals.一种简单的估计预测区间和预测分布的方法:超越均值和置信区间的元分析总结。
Res Synth Methods. 2019 Jun;10(2):255-266. doi: 10.1002/jrsm.1345. Epub 2019 Apr 3.
2
Researchers object to census privacy measure.研究人员反对人口普查隐私措施。
Science. 2019 Jan 11;363(6423):114. doi: 10.1126/science.363.6423.114.
3
Randomized response: a survey technique for eliminating evasive answer bias.随机化回答:一种消除回避性回答偏差的调查技术。
J Am Stat Assoc. 1965 Mar;60(309):63-6.