Suppr超能文献

通过基于本体的数据访问在临床研究中的关联数据应用。

Linked Data Applications Through Ontology Based Data Access in Clinical Research.

作者信息

Kock-Schoppenhauer Ann-Kristin, Kamann Christian, Ulrich Hannes, Duhm-Harbeck Petra, Ingenerf Josef

机构信息

IT for Clinical Research, Lübeck (ITCR-L), University of Lübeck, Germany.

出版信息

Stud Health Technol Inform. 2017;235:131-135.

Abstract

Clinical care and research data are widely dispersed in isolated systems based on heterogeneous data models. Biomedicine predominantly makes use of connected datasets based on the Semantic Web paradigm. Initiatives like Bio2RDF created Resource Description Framework (RDF) versions of Omics resources, enabling sophisticated Linked Data applications. In contrast, electronic healthcare records (EHR) data are generated and processed in diverse clinical subsystems within hospital information systems (HIS). Usually, each of them utilizes a relational database system with a different proprietary schema. Semantic integration and access to the data is hardly possible. This paper describes ways of using Ontology Based Data Access (OBDA) for bridging the semantic gap between existing raw data and user-oriented views supported by ontology-based queries. Based on mappings between entities of data schemas and ontologies data can be made available as materialized or virtualized RDF triples ready for querying and processing. Our experiments based on CentraXX for biobank and study management demonstrate the advantages of abstracting away from low level details and semantic mediation. Furthermore, it becomes clear that using a professional platform for Linked Data applications is recommended due to the inherent complexity, the inconvenience to confront end users with SPARQL, and scalability and performance issues.

摘要

临床护理和研究数据广泛分散在基于异构数据模型的孤立系统中。生物医学主要利用基于语义网范式的关联数据集。像Bio2RDF这样的项目创建了组学资源的资源描述框架(RDF)版本,支持复杂的关联数据应用程序。相比之下,电子健康记录(EHR)数据是在医院信息系统(HIS)内的不同临床子系统中生成和处理的。通常,每个子系统都使用具有不同专有模式的关系数据库系统。语义集成和数据访问几乎是不可能的。本文描述了使用基于本体的数据访问(OBDA)来弥合现有原始数据与基于本体查询支持的面向用户视图之间语义鸿沟的方法。基于数据模式实体与本体之间的映射,数据可以作为物化或虚拟化的RDF三元组提供,以便进行查询和处理。我们基于CentraXX进行生物样本库和研究管理的实验证明了从低级细节和语义中介中抽象出来的优势。此外,由于其固有的复杂性、让终端用户面对SPARQL的不便性以及可扩展性和性能问题,很明显建议使用专业平台进行关联数据应用程序开发。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验