Department of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, UK.
Flanders Marine Institute (VLIZ), 8400 Ostend, Belgium.
Philos Trans R Soc Lond B Biol Sci. 2020 Dec 21;375(1814):20190445. doi: 10.1098/rstb.2019.0445. Epub 2020 Nov 2.
Recent decades have seen an explosion in the amount of data available on all aspects of biodiversity, which has led to data-driven approaches to understand how and why diversity varies in time and space. Global repositories facilitate access to various classes of species-level data including biogeography, genetics and conservation status, which are in turn required to study different dimensions of diversity. Ensuring that these different data sources are interoperable is a challenge as we aim to create synthetic data products to monitor the state of the world's biodiversity. One way to approach this is to link data of different classes, and to inventory the availability of data across multiple sources. Here, we use a comprehensive list of more than 200 000 marine animal species, and quantify the availability of data on geographical occurrences, genetic sequences, conservation assessments and DNA barcodes across all phyla and broad functional groups. This reveals a very uneven picture: 44% of species are represented by no record other than their taxonomy, but some species are rich in data. Although these data-rich species are concentrated into a few taxonomic and functional groups, especially vertebrates, data are spread widely across marine animals, with members of all 32 phyla represented in at least one database. By highlighting gaps in current knowledge, our census of marine diversity data helps to prioritize future data collection activities, as well as emphasizing the importance of ongoing sustained observations and archiving of existing data into global repositories. This article is part of the theme issue 'Integrative research perspectives on marine conservation'.
近几十年来,关于生物多样性各个方面的可用数据呈爆炸式增长,这促使人们采用数据驱动的方法来理解多样性在时间和空间上是如何以及为何变化的。全球知识库促进了对各种物种级数据类别的访问,包括生物地理学、遗传学和保护状况,而这些数据类别反过来又是研究多样性不同维度所必需的。确保这些不同数据源具有互操作性是一项挑战,因为我们旨在创建综合数据产品来监测世界生物多样性的状况。一种方法是链接不同类别的数据,并对多个来源的数据可用性进行编目。在这里,我们使用了一个由 20 多万种海洋动物组成的综合清单,并量化了所有门和广泛的功能组中地理分布、遗传序列、保护评估和 DNA 条码数据的可用性。这揭示了一个非常不均衡的情况:44%的物种除了其分类学记录外没有其他记录,但有些物种的数据非常丰富。尽管这些数据丰富的物种集中在几个分类学和功能组中,尤其是脊椎动物,但数据广泛分布在海洋动物中,至少有一个数据库中包含所有 32 个门的成员。通过突出当前知识的差距,我们对海洋多样性数据的普查有助于确定未来数据收集活动的优先次序,并强调持续进行观测和将现有数据归档到全球知识库中的重要性。本文是“海洋保护综合研究视角”主题专刊的一部分。