• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Selecting Optimal Subset to release under Differentially Private M-estimators from Hybrid Datasets.从混合数据集中选择最优子集以在差分隐私M估计器下发布。
IEEE Trans Knowl Data Eng. 2018 Mar 1;30(3):573-584. doi: 10.1109/TKDE.2017.2773545. Epub 2017 Nov 14.
2
Differential privacy under dependent tuples-the case of genomic privacy.相依元组下的差分隐私-基因组隐私案例。
Bioinformatics. 2020 Mar 1;36(6):1696-1703. doi: 10.1093/bioinformatics/btz837.
3
Tuning Privacy-Utility Tradeoff in Genomic Studies Using Selective SNP Hiding.利用选择性单核苷酸多态性隐藏调整基因组研究中的隐私-效用权衡
Proc Asia Pac Bioinform Conf. 2023 Apr;2023.
4
Inference attacks against differentially private query results from genomic datasets including dependent tuples.针对包含依赖元组的基因组数据集的差分隐私查询结果的推理攻击。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i136-i145. doi: 10.1093/bioinformatics/btaa475.
5
Differentially Private Outcome-Weighted Learning for Optimal Dynamic Treatment Regime Estimation.用于最优动态治疗方案估计的差分隐私结果加权学习
Stat. 2024;13(1). doi: 10.1002/sta4.641. Epub 2024 Jan 17.
6
Deconvoluting kernel density estimation and regression for locally differentially private data.去卷积核密度估计和回归用于局部差分隐私数据。
Sci Rep. 2020 Dec 7;10(1):21361. doi: 10.1038/s41598-020-78323-0.
7
Medical imaging deep learning with differential privacy.医学影像深度学习中的差分隐私。
Sci Rep. 2021 Jun 29;11(1):13524. doi: 10.1038/s41598-021-93030-0.
8
Convergence Rates for Differentially Private Statistical Estimation.差分隐私统计估计的收敛速率
Proc Int Conf Mach Learn. 2012 Jul;2012:1327-1334.
9
Differentially Private Histogram Publication For Dynamic Datasets: An Adaptive Sampling Approach.动态数据集的差分隐私直方图发布:一种自适应采样方法。
Proc ACM Int Conf Inf Knowl Manag. 2015 Oct;2015:1001-1010. doi: 10.1145/2806416.2806441.
10
Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability.基于准确性、隐私性和可靠性的差分隐私贝叶斯神经网络
Mach Learn Knowl Discov Databases. 2023;13716:604-619. doi: 10.1007/978-3-031-26412-2_37. Epub 2023 Mar 17.

引用本文的文献

1
Possibilistic distribution distance metric: a robust domain adaptation learning method.可能性分布距离度量:一种稳健的域适应学习方法。
Front Neurosci. 2023 Nov 9;17:1247082. doi: 10.3389/fnins.2023.1247082. eCollection 2023.
2
Privacy protection of medical data in social network.社交网络中医疗数据的隐私保护。
BMC Med Inform Decis Mak. 2021 Oct 18;21(Suppl 1):286. doi: 10.1186/s12911-021-01645-0.
3
Differential privacy in health research: A scoping review.健康研究中的差分隐私:范围综述。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2269-2276. doi: 10.1093/jamia/ocab135.

本文引用的文献

1
Individual privacy versus public good: protecting confidentiality in health research.个人隐私与公共利益:保护健康研究中的保密性
Stat Med. 2015 Oct 15;34(23):3081-103. doi: 10.1002/sim.6543. Epub 2015 Jun 5.
2
Choosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery.盲目而明智地选择:用于疾病标志物发现的DNA数据集的差分隐私征集。
J Am Med Inform Assoc. 2015 Jan;22(1):100-8. doi: 10.1136/amiajnl-2014-003043. Epub 2014 Oct 28.
3
Differentially private distributed logistic regression using private and public data.使用私有和公共数据的差分隐私分布式逻辑回归
BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S14. doi: 10.1186/1755-8794-7-S1-S14. Epub 2014 May 8.
4
Privacy preserving RBF kernel support vector machine.隐私保护径向基核支持向量机
Biomed Res Int. 2014;2014:827371. doi: 10.1155/2014/827371. Epub 2014 Jun 12.
5
Differential privacy based on importance weighting.基于重要性加权的差分隐私。
Mach Learn. 2013 Oct;93(1):163-183. doi: 10.1007/s10994-013-5396-x.
6
Imputation of confidential data sets with spatial locations using disease mapping models.使用疾病映射模型对具有空间位置的机密数据集进行插补。
Stat Med. 2014 May 20;33(11):1928-45. doi: 10.1002/sim.6078. Epub 2014 Jan 7.
7
Disclosure control using partially synthetic data for large-scale health surveys, with applications to CanCORS.使用部分合成数据进行大规模健康调查的披露控制及其在癌症队列研究中的应用
Stat Med. 2013 Oct 30;32(24):4139-61. doi: 10.1002/sim.5841. Epub 2013 May 13.
8
Privacy-preserving heterogeneous health data sharing.隐私保护的异构健康数据共享。
J Am Med Inform Assoc. 2013 May 1;20(3):462-9. doi: 10.1136/amiajnl-2012-001027. Epub 2012 Dec 13.
9
Methods for observational post-licensure medical product safety surveillance.上市后医疗产品安全性观察性监测方法。
Stat Methods Med Res. 2015 Apr;24(2):177-93. doi: 10.1177/0962280211413452. Epub 2011 Dec 2.
10
Sensible use of observational clinical data.合理利用观察性临床数据。
Stat Methods Med Res. 2013 Feb;22(1):7-13. doi: 10.1177/0962280211403598. Epub 2011 Aug 9.

从混合数据集中选择最优子集以在差分隐私M估计器下发布。

Selecting Optimal Subset to release under Differentially Private M-estimators from Hybrid Datasets.

作者信息

Wang Meng, Ji Zhanglong, Kim Hyeon-Eui, Wang Shuang, Xiong Li, Jiang Xiaoqian

机构信息

Department of Biomedical Informatics, University of California at San Diego, CA, 92093 U.S., and now is with the Department of Genetics, Stanford University, CA, 94305, U.S.

Department of Biomedical Informatics, University of California at San Diego, CA, 92093 U.S.

出版信息

IEEE Trans Knowl Data Eng. 2018 Mar 1;30(3):573-584. doi: 10.1109/TKDE.2017.2773545. Epub 2017 Nov 14.

DOI:10.1109/TKDE.2017.2773545
PMID:30034201
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6051552/
Abstract

Privacy concern in data sharing especially for health data gains particularly increasing attention nowadays. Now some patients agree to open their information for research use, which gives rise to a new question of how to effectively use the public information to better understand the private dataset without breaching privacy. In this paper, we specialize this question as selecting an optimal subset of the public dataset for M-estimators in the framework of differential privacy (DP) in [1]. From a perspective of non-interactive learning, we first construct the weighted private density estimation from the hybrid datasets under DP. Along the same line as [2], we analyze the accuracy of the DP M-estimators based on the hybrid datasets. Our main contributions are (i) we find that the bias-variance tradeoff in the performance of our M-estimators can be characterized in the sample size of the released dataset; (2) based on this finding, we develop an algorithm to select the optimal subset of the public dataset to release under DP. Our simulation studies and application to the real datasets confirm our findings and set a guideline in the real application.

摘要

如今,数据共享中的隐私问题,尤其是健康数据方面,受到了越来越多的关注。现在一些患者同意公开其信息以供研究使用,这引发了一个新问题:如何在不侵犯隐私的情况下,有效利用公开信息更好地理解私有数据集。在本文中,我们将这个问题具体化为在[1]中差分隐私(DP)框架下为M估计器选择公共数据集的最优子集。从非交互式学习的角度出发,我们首先在DP下从混合数据集中构建加权私有密度估计。与[2]思路一致,我们分析了基于混合数据集的DP M估计器的准确性。我们的主要贡献在于:(i)我们发现M估计器性能中的偏差 - 方差权衡可以通过发布数据集的样本大小来表征;(2)基于这一发现,我们开发了一种算法,用于在DP下选择要发布的公共数据集的最优子集。我们的模拟研究以及对真实数据集的应用证实了我们的发现,并为实际应用设定了指导方针。