Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
Github, San Francisco, CA, 94107, USA.
Sci Data. 2022 Nov 14;9(1):700. doi: 10.1038/s41597-022-01606-w.
Research can be more transparent and collaborative by using Findable, Accessible, Interoperable, and Reusable (FAIR) principles to publish Earth and environmental science data. Reporting formats-instructions, templates, and tools for consistently formatting data within a discipline-can help make data more accessible and reusable. However, the immense diversity of data types across Earth science disciplines makes development and adoption challenging. Here, we describe 11 community reporting formats for a diverse set of Earth science (meta)data including cross-domain metadata (dataset metadata, location metadata, sample metadata), file-formatting guidelines (file-level metadata, CSV files, terrestrial model data archiving), and domain-specific reporting formats for some biological, geochemical, and hydrological data (amplicon abundance tables, leaf-level gas exchange, soil respiration, water and sediment chemistry, sensor-based hydrologic measurements). More broadly, we provide guidelines that communities can use to create new (meta)data formats that integrate with their scientific workflows. Such reporting formats have the potential to accelerate scientific discovery and predictions by making it easier for data contributors to provide (meta)data that are more interoperable and reusable.
通过使用可发现(Findable)、可访问(Accessible)、可互操作(Interoperable)和可重用(Reusable,FAIR)原则来发布地球和环境科学数据,可以使研究更加透明和协作。报告格式(用于在学科内一致格式化数据的说明、模板和工具)可以帮助提高数据的可访问性和可重用性。然而,地球科学学科中数据类型的巨大多样性使得其开发和采用具有挑战性。在这里,我们描述了 11 种用于各种地球科学(元)数据的社区报告格式,包括跨领域元数据(数据集元数据、位置元数据、样本元数据)、文件格式指南(文件级元数据、CSV 文件、陆地模型数据归档)以及一些生物、地球化学和水文学数据的特定领域报告格式(扩增子丰度表、叶片气体交换、土壤呼吸、水和沉积物化学、基于传感器的水文测量)。更广泛地说,我们提供了社区可以用来创建新的(元)数据格式的指导方针,这些格式可以与他们的科学工作流程集成。通过使数据贡献者更容易提供更具互操作性和可重用性的(元)数据,此类报告格式有可能加速科学发现和预测。