Fette Georg, Kaspar Mathias, Dietrich Georg, Ertl Maximilian, Krebs Jonathan, Stoerk Stefan, Puppe Frank
Würzburg University, Chair of Computer Science 6.
University Hospital of Würzburg, Comprehensive Heart Failure Center.
Stud Health Technol Inform. 2017;243:90-94.
In recent years, clinical data warehouses (CDW) storing routine patient data have become more and more popular to support scientific work in the medical domain. Although CDW systems provide interfaces to import new data, these interfaces have to be used by processing tools that are often not included in the systems themselves. In order to establish an extraction-transformation-load (ETL) workflow, already existing components have to be taken or new components have to be developed to perform the load part of the ETL. We present a customizable importer for the two CDW systems PaDaWaN and I2B2, which is able to import the most common import formats (plain text, CSV and XML files). In order to be run, the importer only needs a configuration file with the user credentials for the target CDW and a list of XML import configuration files, which determine how already exported data is indented to be imported. The importer is provided as a Java program, which has no further software requirements.
近年来,存储常规患者数据的临床数据仓库(CDW)越来越受欢迎,以支持医学领域的科研工作。尽管CDW系统提供了导入新数据的接口,但这些接口必须由通常不包含在系统本身中的处理工具来使用。为了建立一个提取-转换-加载(ETL)工作流程,必须采用现有的组件或开发新的组件来执行ETL的加载部分。我们为两个CDW系统PaDaWaN和I2B2提供了一个可定制的导入器,它能够导入最常见的导入格式(纯文本、CSV和XML文件)。为了运行,导入器只需要一个包含目标CDW用户凭证的配置文件和一个XML导入配置文件列表,这些文件决定了已导出的数据如何缩进以便导入。导入器作为一个Java程序提供,没有进一步的软件要求。