• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

保护生物医学数据免受属性泄露。

Protecting Biomedical Data Against Attribute Disclosure.

作者信息

Spengler Helmut, Prasser Fabian

机构信息

Technical University of Munich, School of Medicine, Institute of Medical Informatics, Statistics and Epidemiology, Munich, Germany.

出版信息

Stud Health Technol Inform. 2019 Sep 3;267:207-214. doi: 10.3233/SHTI190829.

DOI:10.3233/SHTI190829
PMID:31483274
Abstract

Modern medical research requires access to patient-level data of significant detail and volume. In this context, privacy concerns and legal requirements demand careful consideration. Data anonymization, which means that data is transformed to reduce privacy risks, is an important building block of data protection concepts. However, common methods of data anonymization often fail to protect data against inference of sensitive attribute values (also called attribute disclosure). Measures against such attacks have been developed, but it has been argued that they are of little practical relevance, as they involve significant data transformations which reduce output data utility to an unacceptable degree. In this article, we present an experimental study of the degree of protection and impact on data utility provided by different approaches for protecting biomedical data from attribute disclosure. We quantified the utility and privacy risks of datasets that have been protected using different anonymization methods and parameterizations. We put the results into relation with trivial baseline approaches, visualized them in the form of risk-utility curves and analyzed basic statistical properties of the sensitive attributes (e.g. the skewness of their distribution). Our results confirm that it is difficult to protect data from attribute disclosure, but they also indicate that it can be possible to achieve reasonable degrees of protection when appropriate methods are chosen based on data characteristics. While it is hard to give general recommendations, the approach presented in this article and the tools that we have used can be helpful for deciding how a given dataset can best be protected in a specific usage scenario.

摘要

现代医学研究需要获取具有大量详细信息的患者层面数据。在这种情况下,隐私问题和法律要求需要仔细考量。数据匿名化,即对数据进行转换以降低隐私风险,是数据保护概念的一个重要组成部分。然而,常见的数据匿名化方法往往无法保护数据免受敏感属性值推断(也称为属性泄露)的影响。针对此类攻击的防范措施已经有所发展,但有人认为这些措施实际意义不大,因为它们涉及重大的数据转换,会将输出数据的效用降低到不可接受的程度。在本文中,我们针对不同方法保护生物医学数据免遭属性泄露所提供的保护程度及其对数据效用的影响进行了一项实验研究。我们对使用不同匿名化方法和参数设置进行保护的数据集的效用和隐私风险进行了量化。我们将结果与简单的基线方法进行比较,以风险 - 效用曲线的形式进行可视化展示,并分析了敏感属性的基本统计特性(例如其分布的偏度)。我们的结果证实,保护数据免遭属性泄露并非易事,但也表明,根据数据特征选择合适的方法时,可以实现合理程度的保护。虽然很难给出一般性建议,但本文所介绍的方法以及我们所使用的工具有助于确定在特定使用场景下如何最好地保护给定数据集。

相似文献

1
Protecting Biomedical Data Against Attribute Disclosure.保护生物医学数据免受属性泄露。
Stud Health Technol Inform. 2019 Sep 3;267:207-214. doi: 10.3233/SHTI190829.
2
Efficient Protection of Health Data from Sensitive Attribute Disclosure.有效保护健康数据免遭敏感属性泄露。
Stud Health Technol Inform. 2020 Jun 16;270:193-197. doi: 10.3233/SHTI200149.
3
The cost of quality: Implementing generalization and suppression for anonymizing biomedical data with minimal information loss.质量成本:在信息损失最小化的情况下,对生物医学数据进行匿名化处理时实施泛化和抑制。
J Biomed Inform. 2015 Dec;58:37-48. doi: 10.1016/j.jbi.2015.09.007. Epub 2015 Sep 15.
4
Privacy preserving data anonymization of spontaneous ADE reporting system dataset.自发不良药物事件报告系统数据集的隐私保护数据匿名化
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.
5
Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.生物医学文献中匿名化和去识别化的使用与理解:范围综述
J Med Internet Res. 2019 May 31;21(5):e13484. doi: 10.2196/13484.
6
The Importance of Context: Risk-based De-identification of Biomedical Data.背景的重要性:基于风险的生物医学数据去识别化
Methods Inf Med. 2016 Aug 5;55(4):347-55. doi: 10.3414/ME16-01-0012. Epub 2016 Jun 20.
7
Reconsidering Anonymization-Related Concepts and the Term "Identification" Against the Backdrop of the European Legal Framework.在欧洲法律框架背景下重新审视与匿名化相关的概念及“识别”一词
Biopreserv Biobank. 2016 Oct;14(5):367-374. doi: 10.1089/bio.2015.0100. Epub 2016 Apr 22.
8
Privacy-enhancing ETL-processes for biomedical data.用于生物医学数据的隐私增强型 ETL 流程。
Int J Med Inform. 2019 Jun;126:72-81. doi: 10.1016/j.ijmedinf.2019.03.006. Epub 2019 Mar 23.
9
Better Safe than Sorry - Implementing Reliable Health Data Anonymization.安全总比遗憾好——实施可靠的健康数据匿名化
Stud Health Technol Inform. 2020 Jun 16;270:68-72. doi: 10.3233/SHTI200124.
10
Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.自发不良药物事件报告数据定期发布的隐私保护匿名性:算法开发与验证
JMIR Med Inform. 2021 Oct 28;9(10):e28752. doi: 10.2196/28752.

引用本文的文献

1
Using preregistration as a tool for transparent fNIRS study design.将预注册作为功能近红外光谱研究透明设计的一种工具。
Neurophotonics. 2023 Apr;10(2):023515. doi: 10.1117/1.NPh.10.2.023515. Epub 2023 Mar 8.
2
Privacy-preserving data sharing infrastructures for medical research: systematization and comparison.用于医学研究的隐私保护数据共享基础架构:系统梳理与比较。
BMC Med Inform Decis Mak. 2021 Aug 12;21(1):242. doi: 10.1186/s12911-021-01602-x.