OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States.
Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands.
J Am Med Inform Assoc. 2023 Dec 22;31(1):209-219. doi: 10.1093/jamia/ocad214.
Health data standardized to a common data model (CDM) simplifies and facilitates research. This study examines the factors that make standardizing observational health data to the Observational Medical Outcomes Partnership (OMOP) CDM successful.
Twenty-five data partners (DPs) from 11 countries received funding from the European Health Data Evidence Network (EHDEN) to standardize their data. Three surveys, DataQualityDashboard results, and statistics from the conversion process were analyzed qualitatively and quantitatively. Our measures of success were the total number of days to transform source data into the OMOP CDM and participation in network research.
The health data converted to CDM represented more than 133 million patients. 100%, 88%, and 84% of DPs took Surveys 1, 2, and 3. The median duration of the 6 key extract, transform, and load (ETL) processes ranged from 4 to 115 days. Of the 25 DPs, 21 DPs were considered applicable for analysis of which 52% standardized their data on time, and 48% participated in an international collaborative study.
This study shows that the consistent workflow used by EHDEN proves appropriate to support the successful standardization of observational data across Europe. Over the 25 successful transformations, we confirmed that getting the right people for the ETL is critical and vocabulary mapping requires specific expertise and support of tools. Additionally, we learned that teams that proactively prepared for data governance issues were able to avoid considerable delays improving their ability to finish on time.
This study provides guidance for future DPs to standardize to the OMOP CDM and participate in distributed networks. We demonstrate that the Observational Health Data Sciences and Informatics community must continue to evaluate and provide guidance and support for what ultimately develops the backbone of how community members generate evidence.
将健康数据标准化到通用数据模型 (CDM) 可简化并促进研究。本研究探讨了使观察性健康数据标准化到观察性医学结局伙伴关系 (OMOP) CDM 成功的因素。
来自 11 个国家的 25 个数据合作伙伴 (DP) 从欧洲健康数据证据网络 (EHDEN) 获得资金,以标准化其数据。对三份调查、DataQualityDashboard 结果和转换过程中的统计数据进行了定性和定量分析。我们的成功衡量标准是将源数据转换为 OMOP CDM 所需的总天数和参与网络研究的情况。
转换为 CDM 的健康数据代表了超过 1.33 亿名患者。100%、88%和 84%的 DP 完成了调查 1、2 和 3。6 个关键提取、转换和加载 (ETL) 过程的中位数持续时间从 4 天到 115 天不等。在 25 个 DP 中,有 21 个 DP 被认为可用于分析,其中 52%按时标准化了数据,48%参与了国际合作研究。
本研究表明,EHDEN 使用的一致工作流程适合支持整个欧洲的观察数据的成功标准化。在 25 次成功的转换中,我们确认为 ETL 找到合适的人员至关重要,词汇映射需要特定的专业知识和工具支持。此外,我们了解到,积极为数据治理问题做准备的团队能够避免相当大的延迟,从而提高按时完成的能力。
本研究为未来的 DP 提供了标准化到 OMOP CDM 和参与分布式网络的指导。我们证明,观察性健康数据科学和信息学社区必须继续评估并为最终发展社区成员生成证据的骨干提供指导和支持。