• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种可扩展且实用的高质量健康数据安全共享方法。

A Scalable and Pragmatic Method for the Safe Sharing of High-Quality Health Data.

出版信息

IEEE J Biomed Health Inform. 2018 Mar;22(2):611-622. doi: 10.1109/JBHI.2017.2676880. Epub 2017 Mar 23.

DOI:10.1109/JBHI.2017.2676880
PMID:28358693
Abstract

The sharing of sensitive personal health data is an important aspect of biomedical research. Methods of data de-identification are often used in this process to trade the granularity of data off against privacy risks. However, traditional approaches, such as HIPAA safe harbor or -anonymization, often fail to provide data with sufficient quality. Alternatively, data can be de-identified only to a degree which still allows us to use it as required, e.g., to carry out specific analyses. Controlled environments, which restrict the ways recipients can interact with the data, can then be used to cope with residual risks. The contributions of this article are twofold. First, we present a method for implementing controlled data sharing environments and analyze its privacy properties. Second, we present a de-identification method which is specifically suited for sanitizing health data which is to be shared in such environments. Traditional de-identification methods control the uniqueness of records in a dataset. The basic idea of our approach is to reduce the probability that a record in a dataset has characteristics which are unique within the underlying population. As the characteristics of the population are typically not known, we have implemented a pragmatic solution in which properties of the population are modeled with statistical methods. We have further developed an accompanying process for evaluating and validating the degree of protection provided. The results of an extensive experimental evaluation show that our approach enables the safe sharing of high-quality data and that it is highly scalable.

摘要

敏感个人健康数据的共享是生物医学研究的一个重要方面。在这个过程中,通常使用数据去识别方法来权衡数据的粒度和隐私风险。然而,传统的方法,如 HIPAA 安全港或匿名化,往往不能提供足够质量的数据。或者,数据只能被去识别到一定程度,仍然允许我们按照要求使用它,例如,进行特定的分析。然后,可以使用受控制的环境来限制收件人与数据交互的方式,以应对剩余的风险。本文的贡献有两个方面。首先,我们提出了一种实现受控数据共享环境的方法,并分析了其隐私属性。其次,我们提出了一种去识别方法,专门用于清理要在这种环境中共享的健康数据。传统的去识别方法控制数据集中记录的唯一性。我们方法的基本思想是降低数据集内记录具有在基础人群中唯一特征的概率。由于人群的特征通常是未知的,我们已经实现了一个实用的解决方案,其中使用统计方法对人群的特征进行建模。我们进一步开发了一个伴随的过程来评估和验证所提供的保护程度。广泛的实验评估结果表明,我们的方法能够安全地共享高质量的数据,并且具有高度的可扩展性。

相似文献

1
A Scalable and Pragmatic Method for the Safe Sharing of High-Quality Health Data.一种可扩展且实用的高质量健康数据安全共享方法。
IEEE J Biomed Health Inform. 2018 Mar;22(2):611-622. doi: 10.1109/JBHI.2017.2676880. Epub 2017 Mar 23.
2
The Importance of Context: Risk-based De-identification of Biomedical Data.背景的重要性:基于风险的生物医学数据去识别化
Methods Inf Med. 2016 Aug 5;55(4):347-55. doi: 10.3414/ME16-01-0012. Epub 2016 Jun 20.
3
Never too old for anonymity: a statistical standard for demographic data sharing via the HIPAA Privacy Rule.永远不要因为年龄而放弃匿名:通过 HIPAA 隐私规则共享人口统计数据的统计标准。
J Am Med Inform Assoc. 2011 Jan-Feb;18(1):3-10. doi: 10.1136/jamia.2010.004622.
4
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.
5
Efficient and effective pruning strategies for health data de-identification.用于健康数据去识别化的高效且有效的修剪策略。
BMC Med Inform Decis Mak. 2016 Apr 30;16:49. doi: 10.1186/s12911-016-0287-2.
6
The project data sphere initiative: accelerating cancer research by sharing data.项目数据领域计划:通过数据共享加速癌症研究
Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15.
7
An Experimental Comparison of Quality Models for Health Data De-Identification.健康数据去标识化质量模型的实验比较
Stud Health Technol Inform. 2017;245:704-708.
8
A Generic Method for Assessing the Quality of De-Identified Health Data.一种评估去标识化健康数据质量的通用方法。
Stud Health Technol Inform. 2016;228:312-6.
9
Security controls in an integrated Biobank to protect privacy in data sharing: rationale and study design.综合生物样本库中保护数据共享隐私的安全控制措施:基本原理与研究设计。
BMC Med Inform Decis Mak. 2017 Jul 6;17(1):100. doi: 10.1186/s12911-017-0494-5.
10
Reducing patient re-identification risk for laboratory results within research datasets.降低研究数据集内实验室结果的患者再识别风险。
J Am Med Inform Assoc. 2013 Jan 1;20(1):95-101. doi: 10.1136/amiajnl-2012-001026. Epub 2012 Jul 21.

引用本文的文献

1
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.
2
[Artificial intelligence and secure use of health data in the KI-FDZ project: anonymization, synthetization, and secure processing of real-world data].[人工智能与KI-FDZ项目中健康数据的安全使用:现实世界数据的匿名化、合成及安全处理]
Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2024 Feb;67(2):171-179. doi: 10.1007/s00103-023-03823-z. Epub 2024 Jan 4.
3
Balancing Personal Privacy and Public Safety During COVID-19: The Case of South Korea.
新冠疫情期间平衡个人隐私与公共安全:以韩国为例。
IEEE Access. 2020 Sep 22;8:171325-171333. doi: 10.1109/ACCESS.2020.3025971. eCollection 2020.
4
A comprehensive tool for creating and evaluating privacy-preserving biomedical prediction models.用于创建和评估隐私保护的生物医学预测模型的综合工具。
BMC Med Inform Decis Mak. 2020 Feb 11;20(1):29. doi: 10.1186/s12911-020-1041-3.
5
NPMA: A Novel Privacy-Preserving Mutual Authentication in TMIS for Mobile Edge-Cloud Architecture.NPMA:移动边缘云架构中 TMIS 的一种新的隐私保护的相互认证方法。
J Med Syst. 2019 Sep 14;43(10):318. doi: 10.1007/s10916-019-1444-9.
6
Willingness to Participate in Health Information Networks with Diverse Data Use: Evaluating Public Perspectives.参与具有多样化数据用途的健康信息网络的意愿:评估公众观点。
EGEMS (Wash DC). 2019 Jul 25;7(1):33. doi: 10.5334/egems.288.
7
Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.生物医学文献中匿名化和去识别化的使用与理解:范围综述
J Med Internet Res. 2019 May 31;21(5):e13484. doi: 10.2196/13484.
8
Data Integration for Future Medicine (DIFUTURE).未来医学数据集成(DIFUTURE)
Methods Inf Med. 2018 Jul;57(S 01):e57-e65. doi: 10.3414/ME17-02-0022. Epub 2018 Jul 17.