• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

澳大利亚初级保健数据中的 OMOP 通用数据模型:构建一个高质量、可用于研究的协调数据集。

The OMOP common data model in Australian primary care data: Building a quality research ready harmonised dataset.

机构信息

Health & Biomedical Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, Faculty of Medicine, Dentistry & Health Sciences, The University of Melbourne, Parkville, Victoria, Australia.

出版信息

PLoS One. 2024 Apr 18;19(4):e0301557. doi: 10.1371/journal.pone.0301557. eCollection 2024.

DOI:10.1371/journal.pone.0301557
PMID:38635655
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11025850/
Abstract

BACKGROUND

The use of routinely collected health data for secondary research purposes is increasingly recognised as a methodology that advances medical research, improves patient outcomes, and guides policy. This secondary data, as found in electronic medical records (EMRs), can be optimised through conversion into a uniform data structure to enable analysis alongside other comparable health metric datasets. This can be achieved with the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM), which employs a standardised vocabulary to facilitate systematic analysis across various observational databases. The concept behind the OMOP-CDM is the conversion of data into a common format through the harmonisation of terminologies, vocabularies, and coding schemes within a unique repository. The OMOP model enhances research capacity through the development of shared analytic and prediction techniques; pharmacovigilance for the active surveillance of drug safety; and 'validation' analyses across multiple institutions across Australia, the United States, Europe, and the Asia Pacific. In this research, we aim to investigate the use of the open-source OMOP-CDM in the PATRON primary care data repository.

METHODS

We used standard structured query language (SQL) to construct, extract, transform, and load scripts to convert the data to the OMOP-CDM. The process of mapping distinct free-text terms extracted from various EMRs presented a substantial challenge, as many terms could not be automatically matched to standard vocabularies through direct text comparison. This resulted in a number of terms that required manual assignment. To address this issue, we implemented a strategy where our clinical mappers were instructed to focus only on terms that appeared with sufficient frequency. We established a specific threshold value for each domain, ensuring that more than 95% of all records were linked to an approved vocabulary like SNOMED once appropriate mapping was completed. To assess the data quality of the resultant OMOP dataset we utilised the OHDSI Data Quality Dashboard (DQD) to evaluate the plausibility, conformity, and comprehensiveness of the data in the PATRON repository according to the Kahn framework.

RESULTS

Across three primary care EMR systems we converted data on 2.03 million active patients to version 5.4 of the OMOP common data model. The DQD assessment involved a total of 3,570 individual evaluations. Each evaluation compared the outcome against a predefined threshold. A 'FAIL' occurred when the percentage of non-compliant rows exceeded the specified threshold value. In this assessment of the primary care OMOP database described here, we achieved an overall pass rate of 97%.

CONCLUSION

The OMOP CDM's widespread international use, support, and training provides a well-established pathway for data standardisation in collaborative research. Its compatibility allows the sharing of analysis packages across local and international research groups, which facilitates rapid and reproducible data comparisons. A suite of open-source tools, including the OHDSI Data Quality Dashboard (Version 1.4.1), supports the model. Its simplicity and standards-based approach facilitates adoption and integration into existing data processes.

摘要

背景

将常规收集的健康数据用于二次研究目的,这种方法已逐渐被认可,因为它可以促进医学研究、改善患者预后并指导政策制定。在电子病历 (EMR) 中发现的这种二次数据,可以通过转换为统一的数据结构,与其他可比健康指标数据集一起进行分析,从而得到优化。这可以通过观察医学结局伙伴关系通用数据模型(OMOP-CDM)实现,该模型采用标准化词汇,以便在各种观察数据库中进行系统分析。OMOP-CDM 的概念是通过在唯一存储库中协调术语、词汇和编码方案,将数据转换为通用格式。该模型通过开发共享分析和预测技术、药物警戒以主动监测药物安全性以及在澳大利亚、美国、欧洲和亚太地区的多个机构进行“验证”分析,增强了研究能力。在这项研究中,我们旨在调查在 PATRON 初级保健数据存储库中使用开源 OMOP-CDM 的情况。

方法

我们使用标准的结构化查询语言 (SQL) 构建、提取、转换和加载脚本,将数据转换为 OMOP-CDM。从各种 EMR 中提取的独特自由文本术语的映射过程提出了重大挑战,因为许多术语无法通过直接文本比较自动匹配到标准词汇。这导致许多术语需要手动分配。为了解决这个问题,我们实施了一项策略,指示我们的临床映射器仅关注出现频率足够高的术语。我们为每个域设定了一个特定的阈值,以确保在完成适当的映射后,超过 95%的所有记录都可以链接到 SNOMED 等批准的词汇。为了评估生成的 OMOP 数据集的数据质量,我们使用 OHDSI 数据质量仪表板 (DQD) 根据 Kahn 框架评估 PATRON 存储库中数据的合理性、一致性和全面性。

结果

在三个初级保健 EMR 系统中,我们将 203 万活跃患者的数据转换为 OMOP 通用数据模型的版本 5.4。DQD 评估共涉及 3570 项单项评估。每项评估都将结果与预定义阈值进行比较。当不符合规定的行数百分比超过规定阈值时,就会出现“FAIL”。在对这里描述的初级保健 OMOP 数据库的评估中,我们实现了 97%的总体通过率。

结论

OMOP CDM 在国际上的广泛使用、支持和培训为协作研究中的数据标准化提供了一个成熟的途径。其兼容性允许本地和国际研究小组共享分析包,从而促进快速和可重复的数据比较。一套开源工具,包括 OHDSI 数据质量仪表板(版本 1.4.1),支持该模型。其简单性和基于标准的方法促进了采用和集成到现有数据流程中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/82b9281681b5/pone.0301557.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/bd78170578f8/pone.0301557.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/c6ef5e9e1bea/pone.0301557.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/82b9281681b5/pone.0301557.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/bd78170578f8/pone.0301557.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/c6ef5e9e1bea/pone.0301557.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b394/11025850/82b9281681b5/pone.0301557.g003.jpg

相似文献

1
The OMOP common data model in Australian primary care data: Building a quality research ready harmonised dataset.澳大利亚初级保健数据中的 OMOP 通用数据模型:构建一个高质量、可用于研究的协调数据集。
PLoS One. 2024 Apr 18;19(4):e0301557. doi: 10.1371/journal.pone.0301557. eCollection 2024.
2
Assessing the Use of German Claims Data Vocabularies for Research in the Observational Medical Outcomes Partnership Common Data Model: Development and Evaluation Study.评估德国索赔数据词汇表在观察性医疗结局合作组织通用数据模型研究中的应用:开发与评估研究
JMIR Med Inform. 2023 Nov 7;11:e47959. doi: 10.2196/47959.
3
Transforming French Electronic Health Records into the Observational Medical Outcome Partnership's Common Data Model: A Feasibility Study.将法国电子健康记录转化为观察性医疗结局伙伴关系的通用数据模型:一项可行性研究。
Appl Clin Inform. 2020 Jan;11(1):13-22. doi: 10.1055/s-0039-3402754. Epub 2020 Jan 8.
4
Transforming Primary Care Data Into the Observational Medical Outcomes Partnership Common Data Model: Development and Usability Study.将初级保健数据转化为观察性医疗结果合作组织通用数据模型:开发与可用性研究。
JMIR Med Inform. 2024 Aug 13;12:e49542. doi: 10.2196/49542.
5
Seamless EMR data access: Integrated governance, digital health and the OMOP-CDM.无缝电子病历数据访问:综合治理、数字健康和 OMOP-CDM。
BMJ Health Care Inform. 2024 Feb 21;31(1):e100953. doi: 10.1136/bmjhci-2023-100953.
6
Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset.电子健康记录和问卷数据向 OMOP CDM 的转换:使用 SG_T2DM 数据集的可行性研究。
Appl Clin Inform. 2021 Aug;12(4):757-767. doi: 10.1055/s-0041-1732301. Epub 2021 Aug 11.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Transforming Anesthesia Data Into the Observational Medical Outcomes Partnership Common Data Model: Development and Usability Study.将麻醉数据转化为观察性医疗结局伙伴关系通用数据模型:开发和可用性研究。
J Med Internet Res. 2021 Oct 29;23(10):e29259. doi: 10.2196/29259.
9
Conceptual design of a generic data harmonization process for OMOP common data model.通用数据协调流程的概念设计,适用于 OMOP 通用数据模型。
BMC Med Inform Decis Mak. 2024 Feb 26;24(1):58. doi: 10.1186/s12911-024-02458-7.
10
Eos and OMOCL: Towards a seamless integration of openEHR records into the OMOP Common Data Model.Eos 和 OMOCL:实现 openEHR 记录与 OMOP 通用数据模型的无缝集成。
J Biomed Inform. 2023 Aug;144:104437. doi: 10.1016/j.jbi.2023.104437. Epub 2023 Jul 12.

引用本文的文献

1
Impact of a Meds-to-Beds PCSK9i Initiation Program on LDL-C in Patients Undergoing ASCVD Revascularization.药物到病床的前蛋白转化酶枯草溶菌素9抑制剂(PCSK9i)启动计划对接受动脉粥样硬化性心血管疾病(ASCVD)血运重建患者低密度脂蛋白胆固醇(LDL-C)的影响。
JACC Adv. 2025 Aug 19;4(9):102075. doi: 10.1016/j.jacadv.2025.102075.

本文引用的文献

1
Seamless EMR data access: Integrated governance, digital health and the OMOP-CDM.无缝电子病历数据访问:综合治理、数字健康和 OMOP-CDM。
BMJ Health Care Inform. 2024 Feb 21;31(1):e100953. doi: 10.1136/bmjhci-2023-100953.
2
Applying the OMOP Common Data Model to Facilitate Benefit-Risk Assessments of Medicinal Products Using Real-World Data from Singapore and South Korea.应用OMOP通用数据模型,利用来自新加坡和韩国的真实世界数据促进药品的获益-风险评估。
Healthc Inform Res. 2022 Apr;28(2):112-122. doi: 10.4258/hir.2022.28.2.112. Epub 2022 Apr 30.
3
Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases.
将注册数据标准化为OMOP通用数据模型:来自三个肺动脉高压数据库的经验。
BMC Med Res Methodol. 2021 Nov 2;21(1):238. doi: 10.1186/s12874-021-01434-3.
4
Linked electronic health records for research on a nationwide cohort of more than 54 million people in England: data resource.用于研究英格兰超过 5400 万人的全国队列的关联电子健康记录:数据资源。
BMJ. 2021 Apr 7;373:n826. doi: 10.1136/bmj.n826.
5
Common Problems, Common Data Model Solutions: Evidence Generation for Health Technology Assessment.常见问题,通用数据模型解决方案:用于卫生技术评估的证据生成。
Pharmacoeconomics. 2021 Mar;39(3):275-285. doi: 10.1007/s40273-020-00981-9. Epub 2020 Dec 18.
6
Principles of Clinical Ethics and Their Application to Practice.临床伦理学原则及其在实践中的应用。
Med Princ Pract. 2021;30(1):17-28. doi: 10.1159/000509119. Epub 2020 Jun 4.
7
Ontoserver: a syndicated terminology server.Ontoserver:一个联合术语服务器。
J Biomed Semantics. 2018 Sep 17;9(1):24. doi: 10.1186/s13326-018-0191-z.
8
A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data.电子健康记录数据二次使用的统一数据质量评估术语和框架。
EGEMS (Wash DC). 2016 Sep 11;4(1):1244. doi: 10.13063/2327-9214.1244. eCollection 2016.
9
Data quality assessment framework to assess electronic medical record data for use in research.用于评估电子病历数据以供研究使用的数据质量评估框架。
Int J Med Inform. 2016 Jun;90:40-7. doi: 10.1016/j.ijmedinf.2016.03.006. Epub 2016 Mar 24.
10
Linked Hospital and Primary Care Database Analysis of the Incidence and Impact of Psychiatric Morbidity Following Gastrointestinal Cancer Surgery in England.英国胃肠道癌手术后精神疾病发病率及影响的医院与基层医疗数据库联合分析
Ann Surg. 2016 Jul;264(1):93-9. doi: 10.1097/SLA.0000000000001415.