Kock-Schoppenhauer Ann-Kristin, Kamann Christian, Ulrich Hannes, Duhm-Harbeck Petra, Ingenerf Josef
IT for Clinical Research, Lübeck (ITCR-L), University of Lübeck, Germany.
Stud Health Technol Inform. 2017;235:131-135.
Clinical care and research data are widely dispersed in isolated systems based on heterogeneous data models. Biomedicine predominantly makes use of connected datasets based on the Semantic Web paradigm. Initiatives like Bio2RDF created Resource Description Framework (RDF) versions of Omics resources, enabling sophisticated Linked Data applications. In contrast, electronic healthcare records (EHR) data are generated and processed in diverse clinical subsystems within hospital information systems (HIS). Usually, each of them utilizes a relational database system with a different proprietary schema. Semantic integration and access to the data is hardly possible. This paper describes ways of using Ontology Based Data Access (OBDA) for bridging the semantic gap between existing raw data and user-oriented views supported by ontology-based queries. Based on mappings between entities of data schemas and ontologies data can be made available as materialized or virtualized RDF triples ready for querying and processing. Our experiments based on CentraXX for biobank and study management demonstrate the advantages of abstracting away from low level details and semantic mediation. Furthermore, it becomes clear that using a professional platform for Linked Data applications is recommended due to the inherent complexity, the inconvenience to confront end users with SPARQL, and scalability and performance issues.
临床护理和研究数据广泛分散在基于异构数据模型的孤立系统中。生物医学主要利用基于语义网范式的关联数据集。像Bio2RDF这样的项目创建了组学资源的资源描述框架(RDF)版本,支持复杂的关联数据应用程序。相比之下,电子健康记录(EHR)数据是在医院信息系统(HIS)内的不同临床子系统中生成和处理的。通常,每个子系统都使用具有不同专有模式的关系数据库系统。语义集成和数据访问几乎是不可能的。本文描述了使用基于本体的数据访问(OBDA)来弥合现有原始数据与基于本体查询支持的面向用户视图之间语义鸿沟的方法。基于数据模式实体与本体之间的映射,数据可以作为物化或虚拟化的RDF三元组提供,以便进行查询和处理。我们基于CentraXX进行生物样本库和研究管理的实验证明了从低级细节和语义中介中抽象出来的优势。此外,由于其固有的复杂性、让终端用户面对SPARQL的不便性以及可扩展性和性能问题,很明显建议使用专业平台进行关联数据应用程序开发。