University of California, Berkeley, California, United States of America.
PLoS One. 2012;7(1):e29715. doi: 10.1371/journal.pone.0029715. Epub 2012 Jan 6.
Biodiversity data derive from myriad sources stored in various formats on many distinct hardware and software platforms. An essential step towards understanding global patterns of biodiversity is to provide a standardized view of these heterogeneous data sources to improve interoperability. Fundamental to this advance are definitions of common terms. This paper describes the evolution and development of Darwin Core, a data standard for publishing and integrating biodiversity information. We focus on the categories of terms that define the standard, differences between simple and relational Darwin Core, how the standard has been implemented, and the community processes that are essential for maintenance and growth of the standard. We present case-study extensions of the Darwin Core into new research communities, including metagenomics and genetic resources. We close by showing how Darwin Core records are integrated to create new knowledge products documenting species distributions and changes due to environmental perturbations.
生物多样性数据来源于存储在各种格式中的无数来源,这些来源存储在许多不同的硬件和软件平台上。要了解生物多样性的全球模式,一个重要的步骤是为这些异构数据源提供标准化视图,以提高互操作性。这方面的基础是定义通用术语。本文描述了 Darwin Core 的演进和发展,这是一个用于发布和整合生物多样性信息的数据标准。我们专注于定义标准的术语类别、简单 Darwin Core 和关系型 Darwin Core 之间的区别、标准的实现方式,以及对于标准的维护和发展至关重要的社区流程。我们展示了将 Darwin Core 扩展到新的研究社区的案例研究,包括宏基因组学和遗传资源。最后,我们展示了如何整合 Darwin Core 记录来创建新的知识产品,记录由于环境干扰而导致的物种分布和变化。