物种共现模型的样本量考量

Sample size considerations for species co-occurrence models.

作者信息

Cowans Amber, Bigatà Albert Bonet, Sutherland Chris

机构信息

Centre for Research into Ecological and Environmental Modelling, School of Mathematics and Statistics, University of St Andrews, St Andrews, Scotland, UK.

School of Biological Sciences, University of Aberdeen, Aberdeen, Scotland, UK.

出版信息

Ecology. 2025 Aug;106(8):e70175. doi: 10.1002/ecy.70175.

DOI:10.1002/ecy.70175

PMID:40776423

Abstract

Multispecies occupancy models are widely applied to infer interactions in the occurrence of different species, but convergence and estimation issues under realistic sample sizes are common. We conducted a simulation study to evaluate the ability of a recently developed model to recover co-occurrence estimates under varying sample size and interaction scenarios while increasing model complexity in two dimensions: the number of interacting species and the number of covariates. Using both standard and penalized likelihood, we demonstrate that the ability to quantify interactions in species occupancy using this model is highly sensitive to sample size, detection probability, and interaction strength. In the simplest scenario, there is high bias in the interaction parameter (used for co-occurrence inference) with less than 100 sites at high detection, and 400-1000 sites at low detection, depending on interaction strength. Strong co-occurrence is detected consistently above 200 sites with high detection probabilities, but weak co-occurrence is never consistently detected even with 2980 sites. We demonstrate that the mean predictive ability of the co-occurrence model is less affected by sample size, with low bias in derived probabilities at 50 sites. Our results highlight that while occupancy patterns are often robust to sample size limitations, reliable inference about co-occurrence demands substantially larger datasets than many studies currently achieve. We caution the interpretation of model output in small datasets or when co-occurrence is expected to be weak, but show methods are suitable to quantify strong co-occurrence in larger datasets and generate predictions of site occupancy states.

摘要

多物种占有率模型被广泛应用于推断不同物种出现时的相互作用，但在实际样本量下的收敛和估计问题很常见。我们进行了一项模拟研究，以评估最近开发的一种模型在不同样本量和相互作用场景下恢复共现估计的能力，同时在两个维度上增加模型复杂性：相互作用物种的数量和协变量的数量。使用标准似然和惩罚似然，我们证明了使用该模型量化物种占有率中相互作用的能力对样本量、检测概率和相互作用强度高度敏感。在最简单的情况下，根据相互作用强度，在高检测率时少于100个位点，低检测率时400 - 1000个位点时，相互作用参数（用于共现推断）存在高偏差。在高检测概率下，始终能在200个以上位点检测到强共现，但即使有2980个位点，也从未始终检测到弱共现。我们证明了共现模型的平均预测能力受样本量影响较小，在50个位点时导出概率的偏差较低。我们的结果强调，虽然占有率模式通常对样本量限制具有鲁棒性，但关于共现的可靠推断需要比许多当前研究所达到的大得多的数据集。我们提醒在小数据集中或预期共现较弱时对模型输出的解释，但表明这些方法适用于量化较大数据集中的强共现并生成位点占有率状态的预测。