Brandt Cynthia A, Morse Richard, Matthews Keri, Sun Kexin, Deshpande Aniruddha M, Gadagkar Rohit, Cohen Dorothy B, Miller Perry L, Nadkarni Prakash M
Center for Medical Informatics, Yale University School of Medicine, P.O. Box 208009, New Haven, CT 06520-8009, USA.
Int J Med Inform. 2002 Nov 12;65(3):225-41. doi: 10.1016/s1386-5056(02)00047-3.
Generic clinical study data management systems can record data on an arbitrary number of parameters in an arbitrary number of clinical studies without requiring modification of the database schema. They achieve this by using an Entity-Attribute-Value (EAV) model for clinical data. While very flexible for creating transaction-oriented systems for data entry and browsing of individual forms, EAV-modeled data is unsuitable for direct analytical processing, which is the focus of data marts. For this purpose, such data must be extracted and restructured appropriately. This paper describes how such a process, which is non-trivial and highly error prone if performed using non-systematic approaches, can be automated by judicious use of the study metadata-the descriptions of measured parameters and their higher-level grouping. The metadata, in addition to driving the process, is exported along with the data, in order to facilitate its human interpretation.
通用临床研究数据管理系统可以在任意数量的临床研究中记录任意数量参数的数据,而无需修改数据库模式。它们通过使用临床数据的实体-属性-值(EAV)模型来实现这一点。虽然EAV模型对于创建面向事务的数据输入和单个表单浏览系统非常灵活,但以EAV模型建模的数据不适合直接进行分析处理,而分析处理是数据集市的重点。为此,必须对这些数据进行适当的提取和重组。本文描述了这样一个过程,如果使用非系统方法执行,这个过程既不平凡又极易出错,如何通过明智地使用研究元数据(即对测量参数及其高级分组的描述)来实现自动化。除了驱动这个过程外,元数据还与数据一起导出,以便于人工解读。