• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学微观数据的差分隐私发布:一种保护信息属性值的高效实用方法。

Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values.

机构信息

Department of Computer Science and Engineering, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul, 02841, Republic of Korea.

出版信息

BMC Med Inform Decis Mak. 2020 Jul 8;20(1):155. doi: 10.1186/s12911-020-01171-5.

DOI:10.1186/s12911-020-01171-5
PMID:32641043
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7346516/
Abstract

BACKGROUND

Various methods based on k-anonymity have been proposed for publishing medical data while preserving privacy. However, the k-anonymity property assumes that adversaries possess fixed background knowledge. Although differential privacy overcomes this limitation, it is specialized for aggregated results. Thus, it is difficult to obtain high-quality microdata. To address this issue, we propose a differentially private medical microdata release method featuring high utility.

METHODS

We propose a method of anonymizing medical data under differential privacy. To improve data utility, especially by preserving informative attribute values, the proposed method adopts three data perturbation approaches: (1) generalization, (2) suppression, and (3) insertion. The proposed method produces an anonymized dataset that is nearly optimal with regard to utility, while preserving privacy.

RESULTS

The proposed method achieves lower information loss than existing methods. Based on a real-world case study, we prove that the results of data analyses using the original dataset and those obtained using a dataset anonymized via the proposed method are considerably similar.

CONCLUSIONS

We propose a novel differentially private anonymization method that preserves informative values for the release of medical data. Through experiments, we show that the utility of medical data that has been anonymized via the proposed method is significantly better than that of existing methods.

摘要

背景

为了在发布医疗数据的同时保护隐私,已经提出了各种基于 k-匿名的方法。然而,k-匿名属性假设对手拥有固定的背景知识。尽管差分隐私克服了这一限制,但它是专门针对聚合结果的。因此,很难获得高质量的微观数据。针对这个问题,我们提出了一种具有高实用性的差分隐私医疗微观数据发布方法。

方法

我们提出了一种在差分隐私下对医疗数据进行匿名化的方法。为了提高数据的实用性,特别是保留有信息量的属性值,所提出的方法采用了三种数据扰动方法:(1)泛化,(2)抑制,(3)插入。所提出的方法生成了一个在效用方面几乎是最优的匿名数据集,同时保护了隐私。

结果

所提出的方法比现有方法实现了更低的信息损失。基于一个真实世界的案例研究,我们证明了使用原始数据集和使用通过所提出的方法匿名化的数据集进行数据分析的结果非常相似。

结论

我们提出了一种新的差分隐私匿名化方法,用于发布医疗数据,保留有信息量的值。通过实验,我们表明通过所提出的方法匿名化的医疗数据的实用性明显优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/d1f6d2369663/12911_2020_1171_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/ee9b810a5b1f/12911_2020_1171_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/db0e16673f57/12911_2020_1171_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/00294ef91da2/12911_2020_1171_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/df9fc9a6eb0a/12911_2020_1171_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/d982503fa188/12911_2020_1171_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/d1f6d2369663/12911_2020_1171_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/ee9b810a5b1f/12911_2020_1171_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/db0e16673f57/12911_2020_1171_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/00294ef91da2/12911_2020_1171_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/df9fc9a6eb0a/12911_2020_1171_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/d982503fa188/12911_2020_1171_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/489a/7346516/d1f6d2369663/12911_2020_1171_Fig8_HTML.jpg

相似文献

1
Differentially private release of medical microdata: an efficient and practical approach for preserving informative attribute values.医学微观数据的差分隐私发布:一种保护信息属性值的高效实用方法。
BMC Med Inform Decis Mak. 2020 Jul 8;20(1):155. doi: 10.1186/s12911-020-01171-5.
2
Utility-preserving anonymization for health data publishing.用于健康数据发布的效用保持匿名化
BMC Med Inform Decis Mak. 2017 Jul 11;17(1):104. doi: 10.1186/s12911-017-0499-0.
3
Privacy preserving data anonymization of spontaneous ADE reporting system dataset.自发不良药物事件报告系统数据集的隐私保护数据匿名化
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.
4
The cost of quality: Implementing generalization and suppression for anonymizing biomedical data with minimal information loss.质量成本:在信息损失最小化的情况下,对生物医学数据进行匿名化处理时实施泛化和抑制。
J Biomed Inform. 2015 Dec;58:37-48. doi: 10.1016/j.jbi.2015.09.007. Epub 2015 Sep 15.
5
Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.自发不良药物事件报告数据定期发布的隐私保护匿名性:算法开发与验证
JMIR Med Inform. 2021 Oct 28;9(10):e28752. doi: 10.2196/28752.
6
Privacy-preserving data cube for electronic medical records: An experimental evaluation.用于电子病历的隐私保护数据立方体:实验评估
Int J Med Inform. 2017 Jan;97:33-42. doi: 10.1016/j.ijmedinf.2016.09.008. Epub 2016 Sep 24.
7
Anonymizing 1:M microdata with high utility.以高实用性对1:M微数据进行匿名化处理。
Knowl Based Syst. 2017 Jan 1;115:15-26. doi: 10.1016/j.knosys.2016.10.012. Epub 2016 Oct 21.
8
Designing a Novel Approach Using a Greedy and Information-Theoretic Clustering-Based Algorithm for Anonymizing Microdata Sets.设计一种基于贪心和信息论聚类算法的新颖方法,用于对微数据集进行匿名化处理。
Entropy (Basel). 2023 Dec 1;25(12):1613. doi: 10.3390/e25121613.
9
Diversity-Aware Anonymization for Structured Health Data.面向结构化健康数据的多样性感知匿名化。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2148-2154. doi: 10.1109/EMBC46164.2021.9629918.
10
On Anonymizing Medical Microdata with Large-Scale Missing Values - A Case Study with the FAERS Dataset.关于使用大规模缺失值对医学微观数据进行匿名化处理——以FAERS数据集为例的研究
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:6505-6508. doi: 10.1109/EMBC.2019.8857025.

引用本文的文献

1
Differential privacy medical data publishing method based on attribute correlation.基于属性相关性的差分隐私医疗数据发布方法。
Sci Rep. 2022 Sep 21;12(1):15725. doi: 10.1038/s41598-022-19544-3.

本文引用的文献

1
A statistical analysis of vaccine-adverse event data.疫苗不良事件数据的统计分析。
BMC Med Inform Decis Mak. 2019 May 28;19(1):101. doi: 10.1186/s12911-019-0818-8.
2
A visual interactive analytic tool for filtering and summarizing large health data sets coded with hierarchical terminologies (VIADS).用于过滤和总结使用分层术语编码的大型健康数据集的可视化交互式分析工具 (VIADS)。
BMC Med Inform Decis Mak. 2019 Feb 14;19(1):31. doi: 10.1186/s12911-019-0750-y.
3
Utility-preserving anonymization for health data publishing.用于健康数据发布的效用保持匿名化
BMC Med Inform Decis Mak. 2017 Jul 11;17(1):104. doi: 10.1186/s12911-017-0499-0.
4
Differentially Private Histogram Publication For Dynamic Datasets: An Adaptive Sampling Approach.动态数据集的差分隐私直方图发布:一种自适应采样方法。
Proc ACM Int Conf Inf Knowl Manag. 2015 Oct;2015:1001-1010. doi: 10.1145/2806416.2806441.
5
Privacy-preserving heterogeneous health data sharing.隐私保护的异构健康数据共享。
J Am Med Inform Assoc. 2013 May 1;20(3):462-9. doi: 10.1136/amiajnl-2012-001027. Epub 2012 Dec 13.