School of Natural Resources, University of Missouri, Columbia, Missouri, 65211, USA.
Department of Fisheries, Wildlife and Conservation Biology, University of Minnesota, St. Paul, Minnesota, 55108, USA.
Ecol Appl. 2021 Apr;31(3):e2258. doi: 10.1002/eap.2258. Epub 2021 Feb 2.
Integrated population models (IPMs) are widely used to combine disparate data sets in joint analysis to better understand population dynamics and provide guidance for conservation activities. An often-cited assumption of IPMs is independence among component data sets within the combined likelihood. Dependency among data sets should lead to underestimation of variance and bias because individuals contribute data to more than one data set. In practice, studied individuals often occur in multiple data sets in IPMs (i.e., overlap), which is one way for the independence assumption to be violated. Such cases have the potential to dissuade practitioners and limit application of IPMs to solve emerging ecological problems. We assessed precision and bias of demographic rates estimated from IPMs using a complete gradient (0-100%) of overlap among data sets, wide ranges in demographic rates (e.g., survival 0.1-0.8) and sample sizes (100-1,200 individuals) and variable data sources. We compared results from our simulations with those from IPMs constructed using empirical data on tree swallows (Tachycineta bicolor) where data sets either had complete overlap or included different individuals. Contrary to previous investigators, we found no substantive bias or uncertainty in any demographic rate from IPMs derived from data sets with complete overlap. While variability in demographic rates was greater at low sample sizes (i.e., low capture, recapture, and survey probabilities), there were negligible differences in the posterior mean or root mean square error of demographic rates among IPMs with strong dependence vs. complete independence among data sets. Our simulations suggest IPMs can be designed using only capture-recapture data or harvest and capture-recovery data where population estimates are obtained from the same data as survival and productivity data. While we encourage researchers to carefully consider the modeling approach best suited for their data sets, our results suggest that dependence among data sets does not generally compromise IPM estimates. Thus, violation of the independence assumption should not dissuade researchers from the application of IPMs in ecological research.
综合种群模型(IPM)被广泛用于联合分析中,以组合不同的数据组,从而更好地了解种群动态,并为保护活动提供指导。IPM 的一个常被引用的假设是组合似然中各个数据组之间的独立性。如果数据组之间存在依赖性,那么由于个体向多个数据组提供数据,方差和偏差的估计就会出现低估,因为个体向多个数据组提供数据。在实践中,研究个体经常在 IPM 中出现在多个数据组中(即重叠),这是违反独立性假设的一种方式。这种情况有可能会阻止从业者,并限制 IPM 的应用,以解决新出现的生态问题。我们评估了在完全重叠(0-100%)的数据集之间,使用广泛的生存(0.1-0.8)和样本量(100-1200 个个体)和不同数据源的综合种群模型(IPM)中估计的种群动态率的精度和偏差。我们将模拟结果与基于树燕(Tachycineta bicolor)的经验数据构建的 IPM 结果进行了比较,这些数据组要么完全重叠,要么包含不同的个体。与之前的研究人员不同,我们发现从完全重叠的数据集得出的 IPM 中,没有任何种群动态率存在实质性的偏差或不确定性。虽然在低样本量(即低捕获、重捕和调查概率)下,种群动态率的变异性更大,但在数据组之间具有强依赖性与完全独立性的 IPM 中,种群动态率的后验均值或均方根误差几乎没有差异。我们的模拟表明,仅使用捕获-再捕获数据或收获和捕获-恢复数据设计 IPM 是可行的,前提是通过与生存和生产力数据相同的数据获得种群估计。虽然我们鼓励研究人员仔细考虑最适合其数据组的建模方法,但我们的结果表明,数据组之间的依赖性通常不会损害 IPM 估计。因此,违反独立性假设不应阻止研究人员在生态研究中应用 IPM。