• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

All Of Us 研究计划的数据模型协调:将 i2b2 数据转换为 OMOP 通用数据模型。

Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model.

机构信息

Research Information Science and Computing, Partners Healthcare, Boston, Massachusetts, United States of America.

Harvard Medical School, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2019 Feb 19;14(2):e0212463. doi: 10.1371/journal.pone.0212463. eCollection 2019.

DOI:10.1371/journal.pone.0212463
PMID:30779778
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6380544/
Abstract

BACKGROUND

The All Of Us Research Program (AOU) is building a nationwide cohort of one million patients' EHR and genomic data. Data interoperability is paramount to the program's success. AOU is standardizing its EHR data around the Observational Medical Outcomes Partnership (OMOP) data model. OMOP is one of several standard data models presently used in national-scale initiatives. Each model is unique enough to make interoperability difficult. The i2b2 data warehousing and analytics platform is used at over 200 sites worldwide, which uses a flexible ontology-driven approach for data storage. We previously demonstrated this ontology system can drive data reconfiguration, to transform data into new formats without site-specific programming. We previously implemented this on our 12-site Accessible Research Commons for Health (ARCH) network to transform i2b2 into the Patient Centered Outcomes Research Network model.

METHODS AND RESULTS

Here, we leverage our investment in i2b2 high-performance transformations to support the AOU OMOP data pipeline. Because the ARCH ontology has gained widespread national interest (through the Accrual to Clinical Trials network, other PCORnet networks, and the Nebraska Lexicon), we leveraged sites' existing investments into this standard ontology. We developed an i2b2-to-OMOP transformation, driven by the ARCH-OMOP ontology and the OMOP concept mapping dictionary. We demonstrated and validated our approach in the AOU New England HPO (NEHPO). First, we transformed into OMOP a fake patient dataset in i2b2 and verified through AOU tools that the data was structurally compliant with OMOP. We then transformed a subset of data in the Partners Healthcare data warehouse into OMOP. We developed a checklist of assessments to ensure the transformed data had self-integrity (e.g., the distributions have an expected shape and required fields are populated), using OMOP's visual Achilles data quality tool. This i2b2-to-OMOP transformation is being used to send NEHPO production data to AOU. It is open-source and ready for use by other research projects.

摘要

背景

全美研究计划(AOU)正在建立一个包含一百万名患者电子病历和基因组数据的全国性队列。数据互操作性对该计划的成功至关重要。AOU 正在围绕观察医学结局伙伴关系(OMOP)数据模型标准化其电子病历数据。OMOP 是目前在国家规模计划中使用的几个标准数据模型之一。每个模型都有足够的独特性,使得互操作性变得困难。i2b2 数据仓库和分析平台在全球 200 多个站点使用,该平台使用灵活的基于本体的方法进行数据存储。我们之前证明了这个本体系统可以驱动数据重构,无需特定于站点的编程即可将数据转换为新格式。我们之前在我们的 12 个可访问研究健康共同体(ARCH)网络上实现了这一点,将 i2b2 转换为患者为中心的结果研究网络模型。

方法和结果

在这里,我们利用我们在 i2b2 高性能转换方面的投资来支持 AOU OMOP 数据管道。由于 ARCH 本体已经获得了广泛的全国性关注(通过临床试验网络、其他 PCORnet 网络和内布拉斯加州词典),我们利用站点对这个标准本体的现有投资。我们开发了一个由 ARCH-OMOP 本体和 OMOP 概念映射字典驱动的 i2b2 到 OMOP 的转换。我们在 AOU 新英格兰 HPO(NEHPO)中展示和验证了我们的方法。首先,我们将一个虚假患者数据集从 i2b2 转换为 OMOP,并通过 AOU 工具验证数据在结构上符合 OMOP。然后,我们将合作伙伴医疗保健数据仓库中的一部分数据转换为 OMOP。我们开发了一个检查表来确保转换后的数据具有自我完整性(例如,分布具有预期的形状,并且必填字段已填充),使用 OMOP 的视觉 Achilles 数据质量工具。这个 i2b2 到 OMOP 的转换正在用于将 NEHPO 生产数据发送到 AOU。它是开源的,并准备供其他研究项目使用。

相似文献

1
Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model.All Of Us 研究计划的数据模型协调:将 i2b2 数据转换为 OMOP 通用数据模型。
PLoS One. 2019 Feb 19;14(2):e0212463. doi: 10.1371/journal.pone.0212463. eCollection 2019.
2
Data interchange using i2b2.使用i2b2进行数据交换。
J Am Med Inform Assoc. 2016 Sep;23(5):909-15. doi: 10.1093/jamia/ocv188. Epub 2016 Feb 5.
3
Web services for data warehouses: OMOP and PCORnet on i2b2.数据仓库的 Web 服务:i2b2 上的 OMOP 和 PCORnet。
J Am Med Inform Assoc. 2018 Oct 1;25(10):1331-1338. doi: 10.1093/jamia/ocy093.
4
Evaluation of OMOP CDM, i2b2 and ICGC ARGO for supporting data harmonization in a breast cancer use case of a multicentric European AI project.评估OMOP通用数据模型、i2b2和国际癌症基因组联盟ARGO,以支持一个多中心欧洲人工智能项目乳腺癌用例中的数据协调。
J Biomed Inform. 2023 Nov;147:104505. doi: 10.1016/j.jbi.2023.104505. Epub 2023 Sep 27.
5
Developing an ETL tool for converting the PCORnet CDM into the OMOP CDM to facilitate the COVID-19 data integration.开发一个 ETL 工具,用于将 PCORnet CDM 转换为 OMOP CDM,以方便 COVID-19 数据集成。
J Biomed Inform. 2022 Mar;127:104002. doi: 10.1016/j.jbi.2022.104002. Epub 2022 Jan 22.
6
The OMOP common data model in Australian primary care data: Building a quality research ready harmonised dataset.澳大利亚初级保健数据中的 OMOP 通用数据模型:构建一个高质量、可用于研究的协调数据集。
PLoS One. 2024 Apr 18;19(4):e0301557. doi: 10.1371/journal.pone.0301557. eCollection 2024.
7
A Clinical Data Warehouse Based on OMOP and i2b2 for Austrian Health Claims Data.基于OMOP和i2b2的奥地利健康索赔数据临床数据仓库
Stud Health Technol Inform. 2018;248:94-99.
8
An evaluation of the THIN database in the OMOP Common Data Model for active drug safety surveillance.THIN 数据库在 OMOP 通用数据模型中用于主动药物安全性监测的评估。
Drug Saf. 2013 Feb;36(2):119-34. doi: 10.1007/s40264-012-0009-3.
9
Transforming French Electronic Health Records into the Observational Medical Outcome Partnership's Common Data Model: A Feasibility Study.将法国电子健康记录转化为观察性医疗结局伙伴关系的通用数据模型:一项可行性研究。
Appl Clin Inform. 2020 Jan;11(1):13-22. doi: 10.1055/s-0039-3402754. Epub 2020 Jan 8.
10
FHIR-Ontop-OMOP: Building clinical knowledge graphs in FHIR RDF with the OMOP Common data Model.FHIR-Ontop-OMOP:在 FHIR RDF 中使用 OMOP 通用数据模型构建临床知识图谱。
J Biomed Inform. 2022 Oct;134:104201. doi: 10.1016/j.jbi.2022.104201. Epub 2022 Sep 9.

引用本文的文献

1
Capturing Real-World Habitual Sleep Patterns With a Novel User-Centric Algorithm to Preprocess Fitbit Data in the All of Us Research Program: Retrospective Observational Longitudinal Study.在“我们所有人”研究项目中,使用一种新颖的以用户为中心的算法捕捉真实世界的习惯性睡眠模式,以预处理Fitbit数据:回顾性观察性纵向研究。
J Med Internet Res. 2025 Jul 28;27:e71718. doi: 10.2196/71718.
2
European advances in digital rheumatology: explainable insights and personalized digital health tools for psoriatic arthritis.欧洲数字风湿病学的进展:银屑病关节炎的可解释见解与个性化数字健康工具
EClinicalMedicine. 2025 May 17;84:103243. doi: 10.1016/j.eclinm.2025.103243. eCollection 2025 Jun.
3

本文引用的文献

1
Accrual to Clinical Trials (ACT): A Clinical and Translational Science Award Consortium Network.临床试验入组(ACT):临床与转化科学奖联盟网络
JAMIA Open. 2018 Oct;1(2):147-152. doi: 10.1093/jamiaopen/ooy033. Epub 2018 Aug 21.
2
The Precision Medicine Initiative's All of Us Research Program: an agenda for research on its ethical, legal, and social issues.精准医学倡议的“我们所有人”研究计划:关于其伦理、法律和社会问题的研究议程。
Genet Med. 2017 Jul;19(7):743-750. doi: 10.1038/gim.2016.183. Epub 2016 Dec 8.
3
Data interchange using i2b2.
Breaking Digital Health Barriers Through a Large Language Model-Based Tool for Automated Observational Medical Outcomes Partnership Mapping: Development and Validation Study.
通过基于大语言模型的自动观察性医学结果伙伴关系映射工具打破数字健康障碍:开发与验证研究
J Med Internet Res. 2025 May 15;27:e69004. doi: 10.2196/69004.
4
Twenty-Five Years of Evolution and Hurdles in Electronic Health Records and Interoperability in Medical Research: Comprehensive Review.电子健康记录在医学研究中的25年发展历程与障碍及互操作性:全面综述
J Med Internet Res. 2025 Jan 9;27:e59024. doi: 10.2196/59024.
5
Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine-based approach in diverse healthcare systems.在不同医疗保健系统中使用基于机器的方法对非酒精性脂肪性肝病患者进行快速识别和表型分析。
Clin Transl Sci. 2025 Jan;18(1):e70105. doi: 10.1111/cts.70105.
6
Factors associated with unfavorable outcomes in older patients with traumatic brain injury: analysis from the research program.老年创伤性脑损伤患者不良结局的相关因素:来自该研究项目的分析
Front Neurol. 2024 Nov 19;15:1452995. doi: 10.3389/fneur.2024.1452995. eCollection 2024.
7
Integrating genome-wide information and wearable device data to explore the link of anxiety and antidepressants with pulse rate variability.整合全基因组信息和可穿戴设备数据,以探索焦虑及抗抑郁药物与心率变异性之间的联系。
Mol Psychiatry. 2025 Jun;30(6):2309-2315. doi: 10.1038/s41380-024-02836-7. Epub 2024 Nov 18.
8
Multi-modality risk prediction of cardiovascular diseases for breast cancer cohort in the All of Us Research Program.“我们所有人”研究项目中乳腺癌队列心血管疾病的多模态风险预测
J Am Med Inform Assoc. 2024 Dec 1;31(12):2800-2810. doi: 10.1093/jamia/ocae199.
9
FAIR enough: Building an academic data ecosystem to make real-world data available for translational research.有道理:构建学术数据生态系统,以使真实世界数据可用于转化研究。
J Clin Transl Sci. 2024 May 2;8(1):e92. doi: 10.1017/cts.2024.530. eCollection 2024.
10
Centralized and Federated Models for the Analysis of Clinical Data.集中式和联邦式临床数据分析模型。
Annu Rev Biomed Data Sci. 2024 Aug;7(1):179-199. doi: 10.1146/annurev-biodatasci-122220-115746. Epub 2024 Jul 24.
使用i2b2进行数据交换。
J Am Med Inform Assoc. 2016 Sep;23(5):909-15. doi: 10.1093/jamia/ocv188. Epub 2016 Feb 5.
4
Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.观察性健康数据科学与信息学(OHDSI):观察性研究人员的机遇。
Stud Health Technol Inform. 2015;216:574-8.
5
SHRINE: enabling nationally scalable multi-site disease studies.SHRINE:实现全国范围内可扩展的多站点疾病研究。
PLoS One. 2013;8(3):e55811. doi: 10.1371/journal.pone.0055811. Epub 2013 Mar 7.
6
Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2).以整合生物学与床边护理的信息学服务企业及其他领域 (i2b2)。
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):124-30. doi: 10.1136/jamia.2009.000893.