Biome Inc, Kyoto, Japan.
Department of Ocean Science, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
Elife. 2024 Jun 20;13:RP93694. doi: 10.7554/eLife.93694.
Comprehensive biodiversity data is crucial for ecosystem protection. The mobile app, launched in Japan, efficiently gathers species observations from the public using species identification algorithms and gamification elements. The app has amassed >6 million observations since 2019. Nonetheless, community-sourced data may exhibit spatial and taxonomic biases. Species distribution models (SDMs) estimate species distribution while accommodating such bias. Here, we investigated the quality of data and its impact on SDM performance. Species identification accuracy exceeds 95% for birds, reptiles, mammals, and amphibians, but seed plants, molluscs, and fishes scored below 90%. Our SDMs for 132 terrestrial plants and animals across Japan revealed that incorporating data into traditional survey data improved accuracy. For endangered species, traditional survey data required >2000 records for accurate models (Boyce index ≥ 0.9), while blending the two data sources reduced this to around 300. The uniform coverage of urban-natural gradients by data, compared to traditional data biased towards natural areas, may explain this improvement. Combining multiple data sources better estimates species distributions, aiding in protected area designation and ecosystem service assessment. Establishing a platform for accumulating community-sourced distribution data will contribute to conserving and monitoring natural ecosystems.
综合生物多样性数据对生态系统保护至关重要。这款在日本推出的移动应用程序,利用物种识别算法和游戏化元素,有效地从公众那里收集物种观察数据。自 2019 年以来,该应用程序已经积累了超过 600 万次观察。然而,社区来源的数据可能存在空间和分类学上的偏差。物种分布模型(SDM)在考虑这种偏差的同时估计物种的分布。在这里,我们研究了数据的质量及其对 SDM 性能的影响。鸟类、爬行动物、哺乳动物和两栖动物的物种识别准确率超过 95%,但种子植物、软体动物和鱼类的准确率低于 90%。我们在日本对 132 种陆地动植物的 SDM 表明,将数据纳入传统调查数据可以提高准确性。对于濒危物种,传统调查数据需要 >2000 条记录才能获得准确的模型(Boyce 指数≥0.9),而混合两种数据源则将其降低到约 300 条。与传统数据偏向自然区域相比,数据在城市-自然梯度上的均匀覆盖可能解释了这种改进。结合多个数据源可以更好地估计物种的分布,有助于保护区的指定和生态系统服务的评估。建立一个积累社区来源的分布数据的平台将有助于保护和监测自然生态系统。