Mate Sebastian, Vormstein Patric, Kadioglu Dennis, Majeed Raphael W, Lablans Martin, Prokosch Hans-Ulrich, Storf Holger
Medical Informatics, Univ. of Erlangen-Nürnberg, Erlangen, Germany.
Medical Informatics Group, University Hospital Frankfurt, Frankfurt, Germany; German Cancer Consortium (DKTK), partner site Frankfurt; and German Cancer Research Center (DKFZ), Heidelberg, Germany.
Stud Health Technol Inform. 2017;243:42-46.
Information retrieval is a major challenge in medical informatics. Various research projects have worked on this task in recent years on an institutional level by developing tools to integrate and retrieve information. However, when it comes down to querying such data across institutions, the challenge persists due to the high heterogeneity of data and differences in software systems. The German Biobank Node (GBN) project faced this challenge when trying to interconnect four biobanks to enable distributed queries for biospecimens. All biobanks had already established integrated data repositories, and some of them were already part of research networks. Instead of developing another software platform, GBN decided to form a bridge between these. This paper describes and discusses a core component from the GBN project, the OmniQuery library, which was implemented to enable on-the-fly query translation between heterogeneous research infrastructures.
信息检索是医学信息学中的一项重大挑战。近年来,各个研究项目在机构层面致力于这项任务,通过开发工具来整合和检索信息。然而,当涉及跨机构查询此类数据时,由于数据的高度异质性和软件系统的差异,挑战依然存在。德国生物样本库节点(GBN)项目在试图将四个生物样本库互连以实现生物样本的分布式查询时就面临了这一挑战。所有生物样本库都已经建立了集成数据存储库,其中一些已经是研究网络的一部分。GBN没有开发另一个软件平台,而是决定在这些存储库之间搭建一座桥梁。本文描述并讨论了GBN项目的一个核心组件——OmniQuery库,该库的实现是为了在异构研究基础设施之间实现即时查询转换。