Population Health and Reproduction, UC Davis, Davis, CA 95616, USA.
University of Maryland Institute for Genome Sciences, University of Maryland School of Medicine, MD 21201, USA.
Gigascience. 2022 Nov 21;11. doi: 10.1093/gigascience/giac105.
The Common Fund Data Ecosystem (CFDE) has created a flexible system of data federation that enables researchers to discover datasets from across the US National Institutes of Health Common Fund without requiring that data owners move, reformat, or rehost those data. This system is centered on a catalog that integrates detailed descriptions of biomedical datasets from individual Common Fund Programs' Data Coordination Centers (DCCs) into a uniform metadata model that can then be indexed and searched from a centralized portal. This Crosscut Metadata Model (C2M2) supports the wide variety of data types and metadata terms used by individual DCCs and can readily describe nearly all forms of biomedical research data. We detail its use to ingest and index data from 11 DCCs.
共同基金数据生态系统(CFDE)创建了一个灵活的数据联合系统,使研究人员能够发现来自美国国立卫生研究院共同基金的数据集,而无需数据所有者移动、重新格式化或重新托管这些数据。该系统以目录为中心,将来自各个共同基金计划数据协调中心(DCC)的生物医学数据集的详细描述集成到一个统一的元数据模型中,然后可以从集中式门户对其进行索引和搜索。这个跨领域元数据模型(C2M2)支持各个 DCC 使用的各种数据类型和元数据术语,并且可以轻松描述几乎所有形式的生物医学研究数据。我们详细介绍了如何使用它来摄取和索引来自 11 个 DCC 的数据。