Division of Biostatistics and Bioinformatics, National Jewish Health, Denver, Colorado 80206-2761, USA.
J Am Med Inform Assoc. 2011 Dec;18 Suppl 1(Suppl 1):i96-102. doi: 10.1136/amiajnl-2011-000339. Epub 2011 Aug 19.
Many clinical research data integration platforms rely on the Entity-Attribute-Value model because of its flexibility, even though it presents problems in query formulation and execution time. The authors sought more balance in these traits.
Borrowing concepts from Entity-Attribute-Value and from enterprise data warehousing, the authors designed an alternative called the Dimensional Bus model and used it to integrate electronic medical record, sponsored study, and biorepository data. Each type of observational collection has its own table, and the structure of these tables varies to suit the source data. The observational tables are linked to the Bus, which holds provenance information and links to various classificatory dimensions that amplify the meaning of the data or facilitate its query and exposure management.
The authors implemented a Bus-based clinical research data repository with a query system that flexibly manages data access and confidentiality, facilitates catalog search, and readily formulates and compiles complex queries.
The design provides a workable way to manage and query mixed schemas in a data warehouse.
许多临床研究数据集成平台依赖于实体-属性-值模型,因为它具有灵活性,尽管在查询制定和执行时间方面存在问题。作者试图在这些特性之间取得更平衡的效果。
作者借鉴实体-属性-值和企业数据仓库的概念,设计了一种名为“维度总线”的替代模型,并将其用于整合电子病历、赞助研究和生物库数据。每种类型的观测数据集都有自己的表,这些表的结构因数据源而异。观测表与总线相连,总线保存着数据的出处信息,并链接到各种分类维度,这些维度可以放大数据的含义,或方便查询和暴露管理。
作者实现了一个基于总线的临床研究数据存储库,以及一个查询系统,该系统可以灵活地管理数据访问和保密性,方便目录搜索,并能轻松制定和编译复杂的查询。
该设计提供了一种可行的方法来管理和查询数据仓库中的混合模式。