Regenstrief Institute, Indiana University, School of Medicine, Indianapolis, Indiana, USA.
J Am Med Inform Assoc. 2012 Jan-Feb;19(1):54-60. doi: 10.1136/amiajnl-2011-000376. Epub 2011 Oct 28.
Systematic analysis of observational medical databases for active safety surveillance is hindered by the variation in data models and coding systems. Data analysts often find robust clinical data models difficult to understand and ill suited to support their analytic approaches. Further, some models do not facilitate the computations required for systematic analysis across many interventions and outcomes for large datasets. Translating the data from these idiosyncratic data models to a common data model (CDM) could facilitate both the analysts' understanding and the suitability for large-scale systematic analysis. In addition to facilitating analysis, a suitable CDM has to faithfully represent the source observational database. Before beginning to use the Observational Medical Outcomes Partnership (OMOP) CDM and a related dictionary of standardized terminologies for a study of large-scale systematic active safety surveillance, the authors validated the model's suitability for this use by example.
To validate the OMOP CDM, the model was instantiated into a relational database, data from 10 different observational healthcare databases were loaded into separate instances, a comprehensive array of analytic methods that operate on the data model was created, and these methods were executed against the databases to measure performance.
There was acceptable representation of the data from 10 observational databases in the OMOP CDM using the standardized terminologies selected, and a range of analytic methods was developed and executed with sufficient performance to be useful for active safety surveillance.
由于数据模型和编码系统的差异,对观察性医学数据库进行主动安全性监测的系统分析受到阻碍。数据分析人员经常发现强大的临床数据模型难以理解,不适合支持他们的分析方法。此外,某些模型不便于为大型数据集的许多干预措施和结果进行系统分析所需的计算。将这些特殊数据模型中的数据转换为通用数据模型(CDM)可以促进分析师的理解和大规模系统分析的适用性。除了便于分析外,合适的 CDM 还必须忠实地表示源观察数据库。在开始使用观察性医学结局伙伴关系(OMOP)CDM 和相关的标准化术语字典进行大规模系统主动安全性监测研究之前,作者通过示例验证了该模型在此用途上的适用性。
为了验证 OMOP CDM,将模型实例化为关系数据库,将来自 10 个不同观察性医疗保健数据库的数据加载到单独的实例中,创建了一套全面的针对数据模型的分析方法,并针对数据库执行这些方法以衡量性能。
使用选定的标准化术语,在 OMOP CDM 中可以接受地表示来自 10 个观察性数据库的数据,并且已经开发和执行了一系列分析方法,其性能足以用于主动安全性监测。