泛欧生物库数据协调在 ADOPT BBMRI-ERIC 中。

Pan-European Data Harmonization for Biobanks in ADOPT BBMRI-ERIC.

机构信息

Medical Centre for Information and Communication Technology, Universitätsklinikum Erlangen, Erlangen, Germany.

Chair of Medical Informatics, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany.

出版信息

Appl Clin Inform. 2019 Aug;10(4):679-692. doi: 10.1055/s-0039-1695793. Epub 2019 Sep 11.

DOI:10.1055/s-0039-1695793

PMID:31509880

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6739205/

Abstract

BACKGROUND

High-quality clinical data and biological specimens are key for medical research and personalized medicine. The Biobanking and Biomolecular Resources Research Infrastructure-European Research Infrastructure Consortium (BBMRI-ERIC) aims to facilitate access to such biological resources. The accompanying ADOPT BBMRI-ERIC project kick-started BBMRI-ERIC by collecting colorectal cancer data from European biobanks.

OBJECTIVES

To transform these data into a common representation, a uniform approach for data integration and harmonization had to be developed. This article describes the design and the implementation of a toolset for this task.

METHODS

Based on the semantics of a metadata repository, we developed a lexical bag-of-words matcher, capable of semiautomatically mapping local biobank terms to the central ADOPT BBMRI-ERIC terminology. Its algorithm supports fuzzy matching, utilization of synonyms, and sentiment tagging. To process the anonymized instance data based on these mappings, we also developed a data transformation application.

RESULTS

The implementation was used to process the data from 10 European biobanks. The lexical matcher automatically and correctly mapped 78.48% of the 1,492 local biobank terms, and human experts were able to complete the remaining mappings. We used the expert-curated mappings to successfully process 147,608 data records from 3,415 patients.

CONCLUSION

A generic harmonization approach was created and successfully used for cross-institutional data harmonization across 10 European biobanks. The software tools were made available as open source.

摘要

背景

高质量的临床数据和生物样本是医学研究和个性化医疗的关键。生物库和生物分子资源研究基础设施-欧洲研究基础设施联盟（BBMRI-ERIC）旨在促进对这些生物资源的访问。伴随而来的 ADOPT BBMRI-ERIC 项目通过从欧洲生物库中收集结直肠癌数据，启动了 BBMRI-ERIC。

目的

为了将这些数据转化为通用表示，必须开发一种统一的方法来进行数据集成和协调。本文描述了用于此任务的工具集的设计和实现。

方法

基于元数据存储库的语义，我们开发了一个词汇袋字匹配器，能够半自动地将本地生物库术语映射到中央 ADOPT BBMRI-ERIC 术语。其算法支持模糊匹配、同义词的利用和情感标记。为了基于这些映射处理匿名实例数据，我们还开发了一个数据转换应用程序。

结果

该实现用于处理来自 10 个欧洲生物库的数据。词汇匹配器自动且正确地映射了 1492 个本地生物库术语中的 78.48%，而人类专家能够完成其余的映射。我们使用专家策划的映射成功地处理了来自 3415 名患者的 147608 条数据记录。

结论

创建了一种通用的协调方法，并成功地用于跨 10 个欧洲生物库的机构间数据协调。软件工具作为开源提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20ee/6739205/b76a7a46106d/10-1055-s-0039-1695793-i190028ra-1.jpg

相似文献

Pan-European Data Harmonization for Biobanks in ADOPT BBMRI-ERIC.

Appl Clin Inform. 2019 Aug;10(4):679-692. doi: 10.1055/s-0039-1695793. Epub 2019 Sep 11.

Launch of an Infrastructure for Health Research: BBMRI-ERIC.

Biopreserv Biobank. 2018 Jun;16(3):233-241. doi: 10.1089/bio.2018.0027. Epub 2018 May 21.

The reconfiguration of biobanks in Europe under the BBMRI-ERIC framework: towards global sharing nodes?

Life Sci Soc Policy. 2020 Oct 1;16(1):9. doi: 10.1186/s40504-020-00105-3.

Infrastructuring European scientific integration: Heterogeneous meanings of the European biobanking infrastructure BBMRI-ERIC.

Soc Stud Sci. 2023 Aug;53(4):572-598. doi: 10.1177/03063127231162629. Epub 2023 Jun 12.

BBMRI-ERIC Negotiator: Implementing Efficient Access to Biobanks.

Biopreserv Biobank. 2021 Oct;19(5):414-421. doi: 10.1089/bio.2020.0144. Epub 2021 Jun 28.

BBMRI-ERIC: the novel gateway to biobanks. From humans to humans.

Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2016 Mar;59(3):379-84. doi: 10.1007/s00103-015-2301-8.

The landscape of biobanks in Poland-characteristics of Polish biobanking units at the beginning of BBMRI.pl organization.

J Transl Med. 2021 Jun 22;19(1):267. doi: 10.1186/s12967-021-02926-y.

Toward Global Biobank Integration by Implementation of the Minimum Information About BIobank Data Sharing (MIABIS 2.0 Core).

Biopreserv Biobank. 2016 Aug;14(4):298-306. doi: 10.1089/bio.2015.0070. Epub 2016 Mar 15.

Quality Management System in the BBMRI.pl Consortium: Status Before the Formation of the Polish Biobanking Network.

Biopreserv Biobank. 2019 Oct;17(5):401-409. doi: 10.1089/bio.2018.0127. Epub 2019 Apr 22.

[Biobanks European infrastructure].

Cas Lek Cesk. 2016;155(3):44-6.

引用本文的文献

A natural language processing approach to support biomedical data harmonization: Leveraging large language models.

PLoS One. 2025 Jul 24;20(7):e0328262. doi: 10.1371/journal.pone.0328262. eCollection 2025.

Data Management in Biobanking: Strategies, Challenges, and Future Directions.

BioTech (Basel). 2024 Sep 2;13(3):34. doi: 10.3390/biotech13030034.

Unlocking the potential of big data and AI in medicine: insights from biobanking.

Front Med (Lausanne). 2024 Jan 31;11:1336588. doi: 10.3389/fmed.2024.1336588. eCollection 2024.

The Future of Biobanking: What Is Next?

BioTech (Basel). 2020 Nov 23;9(4):23. doi: 10.3390/biotech9040023.

Understanding the Nature of Metadata: Systematic Review.

J Med Internet Res. 2022 Jan 11;24(1):e25440. doi: 10.2196/25440.

Guidelines for Biobanking of Bone Marrow Adipose Tissue and Related Cell Types: Report of the Biobanking Working Group of the International Bone Marrow Adiposity Society.

Front Endocrinol (Lausanne). 2021 Sep 27;12:744527. doi: 10.3389/fendo.2021.744527. eCollection 2021.

Sync for Genes: Making Clinical Genomics Available for Precision Medicine at the Point-of-Care.

Appl Clin Inform. 2020 Mar;11(2):295-302. doi: 10.1055/s-0040-1708051. Epub 2020 Apr 22.

本文引用的文献

MIRACUM: Medical Informatics in Research and Care in University Medicine.

Methods Inf Med. 2018 Jul;57(S 01):e82-e91. doi: 10.3414/ME17-02-0025. Epub 2018 Jul 17.

Individual genomes and personalized medicine: life diversity and complexity.

Per Med. 2010 Jul;7(4):347-350. doi: 10.2217/pme.10.30.

Launch of an Infrastructure for Health Research: BBMRI-ERIC.

Biopreserv Biobank. 2018 Jun;16(3):233-241. doi: 10.1089/bio.2018.0027. Epub 2018 May 21.

[Requirements for a cross-location biobank IT infrastructure : Survey of stakeholder input on the establishment of a biobank network of the German Biobank Alliance (GBA)].

Pathologe. 2018 Jul;39(4):289-296. doi: 10.1007/s00292-018-0435-9.

Can Diabetes Be Controlled by Lifestyle Activities?

Curr Res Diabetes Obes J. 2017 Mar;1(4). Epub 2017 Mar 20.

BiobankUniverse: automatic matchmaking between datasets for biobank data discovery and integration.

Bioinformatics. 2017 Nov 15;33(22):3627-3634. doi: 10.1093/bioinformatics/btx478.

Interactions Between Genetics, Lifestyle, and Environmental Factors for Healthcare.

Adv Exp Med Biol. 2017;1005:167-191. doi: 10.1007/978-981-10-5717-5_8.

From a Content Delivery Portal to a Knowledge Management System for Standardized Cancer Documentation.

Stud Health Technol Inform. 2017;243:180-184.

Proof-of-Concept Integration of Heterogeneous Biobank IT Infrastructures into a Hybrid Biobanking Network.

Stud Health Technol Inform. 2017;243:100-104.

A Decentralized IT Architecture for Locating and Negotiating Access to Biobank Samples.

Stud Health Technol Inform. 2017;243:75-79.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

泛欧生物库数据协调在 ADOPT BBMRI-ERIC 中。

Pan-European Data Harmonization for Biobanks in ADOPT BBMRI-ERIC.

机构信息

出版信息

BACKGROUND

OBJECTIVES

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献