German Biobank Node (GBN), Charité - Universitätsmedizin Berlin, Berlin, Germany; German Cancer Consortium (DKTK), DKFZ, Heidelberg, Germany; Charité University Hospital Berlin, Berlin, Germany.
German Cancer Consortium (DKTK), DKFZ, Heidelberg, Germany; Federated Information Systems, German Cancer Research Centre (DKFZ), Heidelberg, Germany; Complex Medical Informatics, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany; Mannheim Institute for Intelligent Systems in Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany.
Comput Biol Med. 2024 Sep;180:108941. doi: 10.1016/j.compbiomed.2024.108941. Epub 2024 Aug 5.
This study outlines the development of a highly interoperable federated IT infrastructure for academic biobanks located at the major university hospital sites across Germany. High-quality biosamples linked to clinical data, stored in biobanks are essential for biomedical research. We aimed to facilitate the findability of these biosamples and their associated data. Networks of biobanks provide access to even larger pools of samples and data even from rare diseases and small disease subgroups. The German Biobank Alliance (GBA) established in 2017 under the umbrella of the German Biobank Node (GBN), has taken on the mission of a federated data discovery service to make biosamples and associated data available to researchers across Germany and Europe.
In this context, we identified the requirements of researchers seeking human biosamples from biobanks and the needs of biobanks for data sovereignty over their samples and data in conjunction with the sample donor's consent. Based on this, we developed a highly interoperable federated IT infrastructure using standards such as Fast Healthcare Interoperability Resources (HL7 FHIR) and Clinical Quality Language (CQL).
The infrastructure comprises two major components enabling federated real-time access to biosample metadata, allowing privacy-compliant queries and subsequent project requests. It has been in use since 2019, connecting 16 German academic biobanks, with additional European biobanks joining. In production since 2019 it has run 4941 queries over the span of one year on more than 900,000 biosamples collected from more than 170,000 donors.
This infrastructure enhances the visibility and accessibility of biosamples for research, addressing the growing demand for human biosamples and associated data in research. It also underscores the need for improvements in processes beyond IT infrastructure, aiming to advance biomedical research and similar infrastructure development in other fields.
本研究概述了德国主要大学医院地点的学术生物库的高度互操作的联邦化 IT 基础设施的开发。高质量的生物样本与临床数据相关联,存储在生物库中,对于生物医学研究至关重要。我们旨在促进这些生物样本及其相关数据的可发现性。生物库网络提供了对更大的样本和数据的访问,甚至包括罕见疾病和小疾病亚组的数据。2017 年,在德国生物库节点(GBN)的保护伞下成立的德国生物库联盟(GBA)承担了联邦数据发现服务的任务,以便将生物样本和相关数据提供给德国和欧洲的研究人员。
在这种情况下,我们确定了研究人员从生物库中寻找人类生物样本的要求,以及生物库对其样本和数据的数据主权的需求,同时考虑到样本捐赠者的同意。在此基础上,我们使用 HL7 FHIR 和 CQL 等标准开发了一个高度互操作的联邦化 IT 基础设施。
该基础设施由两个主要组件组成,允许对生物样本元数据进行联邦实时访问,允许符合隐私要求的查询和随后的项目请求。自 2019 年以来,它已经连接了 16 个德国学术生物库,并增加了其他欧洲生物库。自 2019 年投入使用以来,在一年的时间内,它已经运行了 4941 次查询,涉及来自 17 万多名捐赠者的 90 多万个生物样本。
该基础设施提高了生物样本对研究的可见性和可及性,满足了研究中对人类生物样本和相关数据日益增长的需求。它还强调了需要改进超越 IT 基础设施的流程,旨在推进生物医学研究和其他领域类似的基础设施发展。