Huser Vojtech, Cimino James J
Laboratory for Informatics Development, NIH Clinical Center, Bethesda, MD.
AMIA Annu Symp Proc. 2013 Nov 16;2013:648-56. eCollection 2013.
Integrated data repositories (IDRs) are indispensable tools for numerous biomedical research studies. We compare three large IDRs (Informatics for Integrating Biology and the Bedside (i2b2), HMO Research Network's Virtual Data Warehouse (VDW) and Observational Medical Outcomes Partnership (OMOP) repository) in order to identify common architectural features that enable efficient storage and organization of large amounts of clinical data. We define three high-level classes of underlying data storage models and we analyze each repository using this classification. We look at how a set of sample facts is represented in each repository and conclude with a list of desiderata for IDRs that deal with the information storage model, terminology model, data integration and value-sets management.
集成数据存储库(IDR)是众多生物医学研究不可或缺的工具。我们比较了三个大型IDR(整合生物学与床边信息学(i2b2)、健康维护组织研究网络的虚拟数据仓库(VDW)和观察性医疗结果合作组织(OMOP)存储库),以确定能够实现大量临床数据高效存储和组织的常见架构特征。我们定义了三类底层数据存储模型,并使用此分类对每个存储库进行分析。我们研究了一组示例事实在每个存储库中的表示方式,并以一份针对处理信息存储模型、术语模型、数据集成和值集管理的IDR的需求列表作为结论。