Lortie C J, Vargas Poulsen Camila, Brun Julien, Kui Li
National Center for Ecological Analysis and Synthesis, UCSB Santa Barbara California USA.
Department of Biology York University Toronto Ontario Canada.
Ecol Evol. 2022 Aug 25;12(8):e9245. doi: 10.1002/ece3.9245. eCollection 2022 Aug.
Data support knowledge development and theory advances in ecology and evolution. We are increasingly reusing data within our teams and projects and through the global, openly archived datasets of others. Metadata can be challenging to write and interpret, but it is always crucial for reuse. The value metadata cannot be overstated-even as a relatively independent research object because it describes the work that has been done in a structured format. We advance a new perspective and classify methods for metadata curation and development with tables. Tables with templates can be effectively used to capture all components of an experiment or project in a single, easy-to-read file familiar to most scientists. If coupled with the R programming language, metadata from tables can then be rapidly and reproducibly converted to publication formats including extensible markup language files suitable for data repositories. Tables can also be used to summarize existing metadata and store metadata across many datasets. A case study is provided and the added benefits of tables for metadata, a priori, are developed to ensure a more streamlined publishing process for many data repositories used in ecology, evolution, and the environmental sciences. In ecology and evolution, researchers are often highly tabular thinkers from experimental data collection in the lab and/or field, and representations of metadata as a table will provide novel research and reuse insights.
数据支持生态学和进化领域的知识发展及理论进步。我们越来越多地在团队和项目内部以及通过他人全球公开存档的数据集来重复使用数据。元数据编写和解读起来可能具有挑战性,但对于数据复用而言始终至关重要。元数据的价值再怎么强调都不为过——即便作为一个相对独立的研究对象,因为它以结构化格式描述了已完成的工作。我们提出一种新视角,并用表格对元数据编目与开发方法进行分类。带有模板的表格可有效地用于在一个大多数科学家都熟悉的、易于阅读的单一文件中捕捉实验或项目的所有组成部分。如果与R编程语言结合使用,表格中的元数据随后就能快速且可重复地转换为包括适合数据存储库的可扩展标记语言文件在内的出版格式。表格还可用于总结现有元数据并跨多个数据集存储元数据。本文提供了一个案例研究,并阐述了表格对于元数据的先验附加益处,以确保生态学、进化领域及环境科学中许多数据存储库的出版过程更加精简。在生态学和进化领域,研究人员从实验室和/或野外的实验数据收集开始往往就是善于使用表格的思考者,而将元数据表示为表格将提供新颖的研究和复用见解。