Blackwell Paul G, Matthiopoulos Jason
School of Mathematics and Statistics, University of Sheffield, Sheffield, UK.
School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK.
Ecology. 2024 Dec;105(12):e4457. doi: 10.1002/ecy.4457. Epub 2024 Oct 30.
Data integration, the joint statistical analysis of data from different observation platforms, is pivotal for data-hungry disciplines such as spatial ecology. Pooled data types obtained from the same underlying process, analyzed jointly, can improve both precision and accuracy in models of species distributions and species-habitat associations. However, the integration of telemetry and spatial survey data has proved elusive because of the fundamentally different analytical approaches required by these two data types. Here, "spatial survey" denotes a survey that records spatial locations and has no temporal structure, for example, line or point transects but not capture-recapture or telemetry. Step selection functions (SSFs-the canonical framework for telemetry) and habitat selection functions (HSFs-the default approach to spatial surveys) differ in not only their specifications but also their results. By imposing the constraint that microscopic mechanisms (animal movement) must correctly scale up to macroscopic emergence (population distributions), a relationship can be written between SSFs and HSFs, leading to a joint likelihood using both datasets. We implement this approach using maximum likelihood, explore its estimation precision by systematic simulation, and seek to derive broad guidelines for effort allocation in the field. We find that complementarities in spatial coverage and resolution between telemetry and survey data often lead to marked inferential improvements in joint analyses over those using either data type alone. The optimal allocation of effort between the two methods (or the choice between them, if only one can be selected) depends on the overall effort expended and the pattern of environmental heterogeneity. Examining inferential performance over a broad range of scenarios for the relative cost between the two methods, we find that integrated analysis usually offers higher precision. Our methodological work shows how to integrate the analysis of telemetry and spatial survey data under a novel joint likelihood function, using traditional computational methods. Our simulation experiments suggest that even when the relative costs of the two methods favor the deployment of one field approach over another, their joint use is broadly preferable. Therefore, joint analysis of the two key methods used in spatial ecology is not only possible but also computationally efficient and statistically more powerful.
数据整合,即对来自不同观测平台的数据进行联合统计分析,对于空间生态学等数据需求大的学科至关重要。从同一基础过程获得的合并数据类型,若进行联合分析,可提高物种分布模型和物种 - 栖息地关联模型的精度和准确性。然而,由于这两种数据类型所需的分析方法根本不同,遥测数据和空间调查数据的整合一直难以实现。这里,“空间调查”指记录空间位置且无时间结构的调查,例如线状或点状样带调查,但不包括标记重捕或遥测调查。步选择函数(SSF——遥测的规范框架)和栖息地选择函数(HSF——空间调查的默认方法)不仅在其规格上不同,结果也不同。通过施加微观机制(动物运动)必须正确放大到宏观现象(种群分布)的约束条件,可以写出SSF和HSF之间的关系,从而得到使用两个数据集的联合似然函数。我们使用最大似然法实现这种方法,通过系统模拟探索其估计精度,并试图得出野外工作中努力分配的广泛指导原则。我们发现,遥测数据和调查数据在空间覆盖和分辨率上的互补性,通常会使联合分析比单独使用任何一种数据类型的分析在推断上有显著改进。两种方法之间的最佳努力分配(或者如果只能选择一种方法时在它们之间的选择)取决于所投入的总体努力以及环境异质性模式。在广泛的场景中检查两种方法相对成本的推断性能时,我们发现综合分析通常提供更高的精度。我们的方法学工作展示了如何在一个新的联合似然函数下,使用传统计算方法整合遥测数据和空间调查数据的分析。我们的模拟实验表明,即使两种方法的相对成本有利于采用一种野外方法而非另一种,它们联合使用通常更可取。因此,对空间生态学中使用的两种关键方法进行联合分析不仅是可能的,而且在计算上是高效的,在统计上更具威力。