Aubry Keith B, Raley Catherine M, McKelvey Kevin S
United States Department of Agriculture, Forest Service, Pacific Northwest Research Station, Olympia, Washington, United States of America.
United States Department of Agriculture, Forest Service, Rocky Mountain Research Station, Missoula, Montana, United States of America.
PLoS One. 2017 Jun 22;12(6):e0179152. doi: 10.1371/journal.pone.0179152. eCollection 2017.
The availability of spatially referenced environmental data and species occurrence records in online databases enable practitioners to easily generate species distribution models (SDMs) for a broad array of taxa. Such databases often include occurrence records of unknown reliability, yet little information is available on the influence of data quality on SDMs generated for rare, elusive, and cryptic species that are prone to misidentification in the field. We investigated this question for the fisher (Pekania pennanti), a forest carnivore of conservation concern in the Pacific States that is often confused with the more common Pacific marten (Martes caurina). Fisher occurrence records supported by physical evidence (verifiable records) were available from a limited area, whereas occurrence records of unknown quality (unscreened records) were available from throughout the fisher's historical range. We reserved 20% of the verifiable records to use as a test sample for both models and generated SDMs with each dataset using Maxent. The verifiable model performed substantially better than the unscreened model based on multiple metrics including AUCtest values (0.78 and 0.62, respectively), evaluation of training and test gains, and statistical tests of how well each model predicted test localities. In addition, the verifiable model was consistent with our knowledge of the fisher's habitat relations and potential distribution, whereas the unscreened model indicated a much broader area of high-quality habitat (indices > 0.5) that included large expanses of high-elevation habitat that fishers do not occupy. Because Pacific martens remain relatively common in upper elevation habitats in the Cascade Range and Sierra Nevada, the SDM based on unscreened records likely reflects primarily a conflation of marten and fisher habitat. Consequently, accurate identifications are far more important than the spatial extent of occurrence records for generating reliable SDMs for the fisher in this region. We strongly recommend that practitioners avoid using anecdotal occurrence records to build SDMs but, if such data are used, the validity of resulting models should be tested with verifiable occurrence records.
在线数据库中空间参考环境数据和物种出现记录的可用性,使从业者能够轻松地为各种各样的分类群生成物种分布模型(SDM)。这类数据库通常包含可靠性未知的出现记录,然而,关于数据质量对为在野外容易被误识别的珍稀、难以捉摸和隐秘物种生成的SDM的影响,几乎没有相关信息。我们针对渔貂(Pekania pennanti)研究了这个问题,渔貂是太平洋沿岸各州受保护关注的一种森林食肉动物,常与更常见的太平洋貂(Martes caurina)混淆。有实物证据支持的渔貂出现记录(可核实记录)仅来自有限区域,而质量未知的出现记录(未筛选记录)则来自渔貂整个历史分布范围。我们预留了20%的可核实记录用作两个模型的测试样本,并使用Maxent为每个数据集生成SDM。基于多个指标,包括AUCtest值(分别为0.78和0.62)、训练和测试增益评估以及每个模型对测试地点预测效果的统计检验,可核实模型的表现明显优于未筛选模型。此外,可核实模型与我们对渔貂栖息地关系和潜在分布的了解一致,而未筛选模型显示出更大面积的高质量栖息地(指数>0.5),其中包括大片渔貂并未占据的高海拔栖息地。由于太平洋貂在喀斯喀特山脉和内华达山脉的高海拔栖息地仍然相对常见,基于未筛选记录的SDM可能主要反映了貂和渔貂栖息地的混淆。因此,对于在该地区为渔貂生成可靠的SDM而言,准确识别远比出现记录的空间范围重要得多。我们强烈建议从业者避免使用传闻的出现记录来构建SDM,但如果使用了此类数据,则应使用可核实的出现记录来检验所得模型的有效性。