• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OMOP-CDM 数据库中重新识别风险的感知:一项横断面调查。

Perceived Risk of Re-Identification in OMOP-CDM Database: A Cross-Sectional Survey.

机构信息

Department of Information Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea.

Department of Biomedical Systems Informatics, Yonsei University College of Medicine, Seoul, Korea.

出版信息

J Korean Med Sci. 2022 Jul 4;37(26):e205. doi: 10.3346/jkms.2022.37.e205.

DOI:10.3346/jkms.2022.37.e205
PMID:35790207
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9259248/
Abstract

BACKGROUND

The advancement of information technology has immensely increased the quality and volume of health data. This has led to an increase in observational study, as well as to the threat of privacy invasion. Recently, a distributed research network based on the common data model (CDM) has emerged, enabling collaborative international medical research without sharing patient-level data. Although the CDM database for each institution is built inside a firewall, the risk of re-identification requires management. Hence, this study aims to elucidate the perceptions CDM users have towards CDM and risk management for re-identification.

METHODS

The survey, targeted to answer specific in-depth questions on CDM, was conducted from October to November 2020. We targeted well-experienced researchers who actively use CDM. Basic statistics (total number and percent) were computed for all covariates.

RESULTS

There were 33 valid respondents. Of these, 43.8% suggested additional anonymization was unnecessary beyond, "minimum cell count" policy, which obscures a cell with a value lower than certain number (usually 5) in shared results to minimize the liability of re-identification due to rare conditions. During extract-transform-load processes, 81.8% of respondents assumed structured data is under control from the risk of re-identification. However, respondents noted that date of birth and death were highly re-identifiable information. The majority of respondents (n = 22, 66.7%) conceded the possibility of identifier-contained unstructured data in the table.

CONCLUSION

Overall, CDM users generally attributed high reliability for privacy protection to the intrinsic nature of CDM. There was little demand for additional de-identification methods. However, unstructured data in the CDM were suspected to have risks. The necessity for a coordinating consortium to define and manage the re-identification risk of CDM was urged.

摘要

背景

信息技术的进步极大地提高了医疗数据的质量和数量。这导致观察性研究的增加,以及隐私侵犯的威胁。最近,出现了一种基于通用数据模型 (CDM) 的分布式研究网络,使国际医疗合作研究能够在不共享患者数据的情况下进行。虽然每个机构的 CDM 数据库都构建在防火墙内,但重新识别的风险仍需要管理。因此,本研究旨在阐明 CDM 用户对 CDM 的看法以及重新识别风险的管理。

方法

这项调查旨在回答有关 CDM 的具体深入问题,于 2020 年 10 月至 11 月进行。我们的目标是经验丰富且积极使用 CDM 的研究人员。对所有协变量进行了基本统计(总数和百分比)计算。

结果

共有 33 份有效回复。其中,43.8%的受访者认为,除了“最小单元格计数”政策之外,不需要进行额外的匿名化处理,该政策通过将共享结果中低于特定数字(通常为 5)的单元格模糊化,以最小化因罕见情况导致重新识别的责任。在提取-转换-加载过程中,81.8%的受访者认为结构化数据受到控制,不会有重新识别的风险。然而,受访者指出出生日期和死亡日期是高度可重新识别的信息。大多数受访者(n=22,66.7%)认为表中可能包含标识符的非结构化数据。

结论

总体而言,CDM 用户通常认为 CDM 的内在性质能够提供高度可靠的隐私保护。他们几乎没有要求使用额外的去识别方法。然而,CDM 中的非结构化数据被怀疑存在风险。需要一个协调的联盟来定义和管理 CDM 的重新识别风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/12ac70c50d65/jkms-37-e205-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/124a686265bd/jkms-37-e205-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/e0d082d9a0e3/jkms-37-e205-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/177971399efe/jkms-37-e205-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/d1a7f71ce496/jkms-37-e205-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/935e39010fe3/jkms-37-e205-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/b280f6d382ce/jkms-37-e205-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/12ac70c50d65/jkms-37-e205-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/124a686265bd/jkms-37-e205-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/e0d082d9a0e3/jkms-37-e205-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/177971399efe/jkms-37-e205-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/d1a7f71ce496/jkms-37-e205-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/935e39010fe3/jkms-37-e205-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/b280f6d382ce/jkms-37-e205-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/9259248/12ac70c50d65/jkms-37-e205-g007.jpg

相似文献

1
Perceived Risk of Re-Identification in OMOP-CDM Database: A Cross-Sectional Survey.OMOP-CDM 数据库中重新识别风险的感知:一项横断面调查。
J Korean Med Sci. 2022 Jul 4;37(26):e205. doi: 10.3346/jkms.2022.37.e205.
2
Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models.在公共云计算环境中增强观察性医疗结局伙伴关系通用数据模型(OMOP-CDM)匿名性的去标识策略的提出与评估:使用隐私模型对医疗数据进行匿名化。
J Med Internet Res. 2020 Nov 26;22(11):e19597. doi: 10.2196/19597.
3
Genomic Common Data Model for Seamless Interoperation of Biomedical Data in Clinical Practice: Retrospective Study.临床实践中生物医学数据无缝互操作的基因组通用数据模型:回顾性研究
J Med Internet Res. 2019 Mar 26;21(3):e13249. doi: 10.2196/13249.
4
Seamless EMR data access: Integrated governance, digital health and the OMOP-CDM.无缝电子病历数据访问:综合治理、数字健康和 OMOP-CDM。
BMJ Health Care Inform. 2024 Feb 21;31(1):e100953. doi: 10.1136/bmjhci-2023-100953.
5
An evaluation of the THIN database in the OMOP Common Data Model for active drug safety surveillance.THIN 数据库在 OMOP 通用数据模型中用于主动药物安全性监测的评估。
Drug Saf. 2013 Feb;36(2):119-34. doi: 10.1007/s40264-012-0009-3.
6
Exploring the potential of OMOP common data model for process mining in healthcare.探索 OMOP 通用数据模型在医疗保健流程挖掘中的潜力。
PLoS One. 2023 Jan 3;18(1):e0279641. doi: 10.1371/journal.pone.0279641. eCollection 2023.
7
Towards ETL Processes to OMOP CDM Using Metadata and Modularization.使用元数据和模块化实现 OMOP CDM 的 ETL 流程。
Stud Health Technol Inform. 2023 May 18;302:751-752. doi: 10.3233/SHTI230256.
8
Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases.将注册数据标准化为OMOP通用数据模型:来自三个肺动脉高压数据库的经验。
BMC Med Res Methodol. 2021 Nov 2;21(1):238. doi: 10.1186/s12874-021-01434-3.
9
Conversion of National Health Insurance Service-National Sample Cohort (NHIS-NSC) Database into Observational Medical Outcomes Partnership-Common Data Model (OMOP-CDM).将国民健康保险服务-全国样本队列(NHIS-NSC)数据库转换为观察性医疗结局合作组织-通用数据模型(OMOP-CDM)。
Stud Health Technol Inform. 2017;245:467-470.
10
Data harmonization and federated learning for multi-cohort dementia research using the OMOP common data model: A Netherlands consortium of dementia cohorts case study.利用 OMOP 通用数据模型进行多队列痴呆症研究的数据协调和联合学习:荷兰痴呆症队列联盟的案例研究。
J Biomed Inform. 2024 Jul;155:104661. doi: 10.1016/j.jbi.2024.104661. Epub 2024 May 26.

引用本文的文献

1
Machine-Learning Parsimonious Prediction Model for Diagnostic Screening of Severe Hematological Adverse Events in Cancer Patients Treated with PD-1/PD-L1 Inhibitors: Retrospective Observational Study by Using the Common Data Model.用于接受PD-1/PD-L1抑制剂治疗的癌症患者严重血液学不良事件诊断筛查的机器学习简约预测模型:使用通用数据模型的回顾性观察研究
Diagnostics (Basel). 2025 Jan 20;15(2):226. doi: 10.3390/diagnostics15020226.

本文引用的文献

1
Ensuring a safe(r) harbor: Excising personally identifiable information from structured electronic health record data.确保更安全的避风港:从结构化电子健康记录数据中删除个人身份信息。
J Clin Transl Sci. 2021 Dec 9;6(1):e10. doi: 10.1017/cts.2021.880. eCollection 2022.
2
Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models.在公共云计算环境中增强观察性医疗结局伙伴关系通用数据模型(OMOP-CDM)匿名性的去标识策略的提出与评估:使用隐私模型对医疗数据进行匿名化。
J Med Internet Res. 2020 Nov 26;22(11):e19597. doi: 10.2196/19597.
3
Association of Ticagrelor vs Clopidogrel With Net Adverse Clinical Events in Patients With Acute Coronary Syndrome Undergoing Percutaneous Coronary Intervention.
替格瑞洛与氯吡格雷对行经皮冠状动脉介入治疗的急性冠状动脉综合征患者净临床不良事件的影响。
JAMA. 2020 Oct 27;324(16):1640-1650. doi: 10.1001/jama.2020.16167.
4
Design and Refinement of a Data Quality Assessment Workflow for a Large Pediatric Research Network.大型儿科研究网络数据质量评估工作流程的设计与优化
EGEMS (Wash DC). 2019 Aug 1;7(1):36. doi: 10.5334/egems.294.
5
Choosing Among Common Data Models for Real-World Data Analyses Fit for Making Decisions About the Effectiveness of Medical Products.选择适用于医疗产品有效性决策的真实世界数据分析常用数据模型。
Clin Pharmacol Ther. 2020 Apr;107(4):827-833. doi: 10.1002/cpt.1577. Epub 2019 Aug 25.
6
Cross-Network Directory Service: Infrastructure to enable collaborations across distributed research networks.跨网络目录服务:实现分布式研究网络间协作的基础设施。
Learn Health Syst. 2019 Feb 14;3(2):e10187. doi: 10.1002/lrh2.10187. eCollection 2019 Apr.
7
Evaluating common data models for use with a longitudinal community registry.评估用于纵向社区登记处的通用数据模型。
J Biomed Inform. 2016 Dec;64:333-341. doi: 10.1016/j.jbi.2016.10.016. Epub 2016 Oct 29.
8
Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.观察性健康数据科学与信息学(OHDSI):观察性研究人员的机遇。
Stud Health Technol Inform. 2015;216:574-8.
9
Individual privacy versus public good: protecting confidentiality in health research.个人隐私与公共利益:保护健康研究中的保密性
Stat Med. 2015 Oct 15;34(23):3081-103. doi: 10.1002/sim.6543. Epub 2015 Jun 5.
10
Validation of a common data model for active safety surveillance research.主动安全监测研究通用数据模型的验证。
J Am Med Inform Assoc. 2012 Jan-Feb;19(1):54-60. doi: 10.1136/amiajnl-2011-000376. Epub 2011 Oct 28.