Centre for Mathematics and its Applications, The Australian National University, Canberra, Australia.
PLoS One. 2013;8(1):e52015. doi: 10.1371/journal.pone.0052015. Epub 2013 Jan 10.
We show that occupancy models are more difficult to fit than is generally appreciated because the estimating equations often have multiple solutions, including boundary estimates which produce fitted probabilities of zero or one. The estimates are unstable when the data are sparse, making them difficult to interpret, and, even in ideal situations, highly variable. As a consequence, making accurate inference is difficult. When abundance varies over sites (which is the general rule in ecology because we expect spatial variance in abundance) and detection depends on abundance, the standard analysis suffers bias (attenuation in detection, biased estimates of occupancy and potentially finding misleading relationships between occupancy and other covariates), asymmetric sampling distributions, and slow convergence of the sampling distributions to normality. The key result of this paper is that the biases are of similar magnitude to those obtained when we ignore non-detection entirely. The fact that abundance is subject to detection error and hence is not directly observable, means that we cannot tell when bias is present (or, equivalently, how large it is) and we cannot adjust for it. This implies that we cannot tell which fit is better: the fit from the occupancy model or the fit ignoring the possibility of detection error. Therefore trying to adjust occupancy models for non-detection can be as misleading as ignoring non-detection completely. Ignoring non-detection can actually be better than trying to adjust for it.
我们表明,占据模型比通常认为的更难拟合,因为估计方程通常有多个解,包括产生拟合概率为零或一的边界估计。当数据稀疏时,估计是不稳定的,这使得它们难以解释,即使在理想情况下,也高度可变。因此,进行准确的推断是困难的。当丰度在地点之间变化(这在生态学中是普遍规律,因为我们预计丰度会有空间变化)并且检测取决于丰度时,标准分析会受到偏差(检测衰减、占据的偏差估计以及可能发现占据与其他协变量之间存在误导性关系)、不对称的采样分布以及采样分布向正态性的收敛速度慢的影响。本文的关键结果是,这些偏差与我们完全忽略未检测到的情况时获得的偏差具有相似的大小。丰度受到检测误差的影响,因此不能直接观察到,这意味着我们无法判断是否存在偏差(或者,等效地,偏差有多大),也无法对其进行调整。这意味着我们无法判断哪个拟合更好:是来自占据模型的拟合还是忽略检测误差的拟合。因此,试图为未检测到的情况调整占据模型可能会像完全忽略未检测到的情况一样具有误导性。忽略未检测到的情况实际上可能比试图调整更好。