Computational Biology, Life & Medical Sciences (LIMES) Institute, University of Bonn, Bonn, Germany.
Developmental Biology of the Immune System, Life & Medical Sciences (LIMES) Institute, University of Bonn, Bonn, Germany.
Sci Data. 2024 May 22;11(1):524. doi: 10.1038/s41597-024-03349-2.
Datasets consist of measurement data and metadata. Metadata provides context, essential for understanding and (re-)using data. Various metadata standards exist for different methods, systems and contexts. However, relevant information resides at differing stages across the data-lifecycle. Often, this information is defined and standardized only at publication stage, which can lead to data loss and workload increase. In this study, we developed Metadatasheet, a metadata standard based on interviews with members of two biomedical consortia and systematic screening of data repositories. It aligns with the data-lifecycle allowing synchronous metadata recording within Microsoft Excel, a widespread data recording software. Additionally, we provide an implementation, the Metadata Workbook, that offers user-friendly features like automation, dynamic adaption, metadata integrity checks, and export options for various metadata standards. By design and due to its extensive documentation, the proposed metadata standard simplifies recording and structuring of metadata for biomedical scientists, promoting practicality and convenience in data management. This framework can accelerate scientific progress by enhancing collaboration and knowledge transfer throughout the intermediate steps of data creation.
数据集由测量数据和元数据组成。元数据提供了上下文信息,对于理解和(重新)使用数据至关重要。不同的元数据标准适用于不同的方法、系统和上下文。然而,相关信息在数据生命周期的不同阶段都有存在。通常,这些信息仅在发布阶段定义和标准化,这可能导致数据丢失和工作负载增加。在本研究中,我们开发了 Metadatasheet,这是一种基于对两个生物医学联盟成员的访谈和对数据存储库的系统筛选的元数据标准。它与数据生命周期保持一致,允许在 Microsoft Excel 中同步记录元数据,Microsoft Excel 是一种广泛使用的数据记录软件。此外,我们提供了一个实现,即 Metadata Workbook,它提供了用户友好的功能,如自动化、动态适应、元数据完整性检查,以及各种元数据标准的导出选项。由于其设计和广泛的文档,所提出的元数据标准简化了生物医学科学家记录和构建元数据的过程,提高了数据管理的实用性和便利性。通过在数据创建的中间步骤中加强协作和知识转移,该框架可以加速科学进步。