• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

健康数据去标识化质量模型的实验比较

An Experimental Comparison of Quality Models for Health Data De-Identification.

作者信息

Eicher Johanna, Kuhn Klaus A, Prasser Fabian

机构信息

Institute of Medical Statistics and Epidemiology, University Hospital rechts der Isar, Technical University of Munich, Germany.

出版信息

Stud Health Technol Inform. 2017;245:704-708.

PMID:29295189
Abstract

When individual-level health data are shared in biomedical research, the privacy of patients must be protected. This is typically achieved by data de-identification methods, which transform data in such a way that formal privacy requirements are met. In the process, it is important to minimize the loss of information to maintain data quality. Although several models have been proposed for measuring this aspect, it remains unclear which model is best suited for which application. We have therefore performed an extensive experimental comparison. We first implemented several common quality models into the ARX de-identification tool for biomedical data. We then used each model to de-identify a patient discharge dataset covering almost 4 million cases and outputs were analyzed to measure the impact of different quality models on real-world applications. Our results show that different models are best suited for specific applications, but that one model (Non-Uniform Entropy) is particularly well suited for general-purpose use.

摘要

当个体层面的健康数据在生物医学研究中共享时,患者的隐私必须得到保护。这通常通过数据去识别方法来实现,这些方法以满足正式隐私要求的方式对数据进行转换。在此过程中,尽量减少信息损失以保持数据质量非常重要。尽管已经提出了几种模型来衡量这一方面,但仍不清楚哪种模型最适合哪种应用。因此,我们进行了广泛的实验比较。我们首先在用于生物医学数据的ARX去识别工具中实现了几种常见的质量模型。然后,我们使用每个模型对包含近400万个病例的患者出院数据集进行去识别,并对输出进行分析,以衡量不同质量模型对实际应用的影响。我们的结果表明,不同的模型最适合特定的应用,但有一种模型(非均匀熵)特别适合通用用途。

相似文献

1
An Experimental Comparison of Quality Models for Health Data De-Identification.健康数据去标识化质量模型的实验比较
Stud Health Technol Inform. 2017;245:704-708.
2
The Importance of Context: Risk-based De-identification of Biomedical Data.背景的重要性:基于风险的生物医学数据去识别化
Methods Inf Med. 2016 Aug 5;55(4):347-55. doi: 10.3414/ME16-01-0012. Epub 2016 Jun 20.
3
A Generic Method for Assessing the Quality of De-Identified Health Data.一种评估去标识化健康数据质量的通用方法。
Stud Health Technol Inform. 2016;228:312-6.
4
Better Safe than Sorry - Implementing Reliable Health Data Anonymization.安全总比遗憾好——实施可靠的健康数据匿名化
Stud Health Technol Inform. 2020 Jun 16;270:68-72. doi: 10.3233/SHTI200124.
5
Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.生物医学文献中匿名化和去识别化的使用与理解:范围综述
J Med Internet Res. 2019 May 31;21(5):e13484. doi: 10.2196/13484.
6
Privacy preserving data anonymization of spontaneous ADE reporting system dataset.自发不良药物事件报告系统数据集的隐私保护数据匿名化
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.
7
A comprehensive tool for creating and evaluating privacy-preserving biomedical prediction models.用于创建和评估隐私保护的生物医学预测模型的综合工具。
BMC Med Inform Decis Mak. 2020 Feb 11;20(1):29. doi: 10.1186/s12911-020-1041-3.
8
Efficient and effective pruning strategies for health data de-identification.用于健康数据去识别化的高效且有效的修剪策略。
BMC Med Inform Decis Mak. 2016 Apr 30;16:49. doi: 10.1186/s12911-016-0287-2.
9
Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models.在公共云计算环境中增强观察性医疗结局伙伴关系通用数据模型(OMOP-CDM)匿名性的去标识策略的提出与评估:使用隐私模型对医疗数据进行匿名化。
J Med Internet Res. 2020 Nov 26;22(11):e19597. doi: 10.2196/19597.
10
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.

引用本文的文献

1
Exploring the tradeoff between data privacy and utility with a clinical data analysis use case.探讨临床数据分析用例中数据隐私与效用之间的权衡。
BMC Med Inform Decis Mak. 2024 May 30;24(1):147. doi: 10.1186/s12911-024-02545-9.
2
Best practices for data management and sharing in experimental biomedical research.实验生物医学研究中数据管理和共享的最佳实践。
Physiol Rev. 2024 Jul 1;104(3):1387-1408. doi: 10.1152/physrev.00043.2023. Epub 2024 Mar 7.