Department of Biology, University of Waterloo, Waterloo, Ontario, Canada.
Cognitive and Behavioural Ecology Interdisciplinary Program, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador, Canada.
PLoS One. 2021 Jun 17;16(6):e0252471. doi: 10.1371/journal.pone.0252471. eCollection 2021.
Social network analysis is increasingly applied to understand animal groups. However, it is rarely feasible to observe every interaction among all individuals in natural populations. Studies have assessed how missing information affects estimates of individual network positions, but less attention has been paid to metrics that characterize overall network structure such as modularity, clustering coefficient, and density. In cases such as groups displaying fission-fusion dynamics, where subgroups break apart and rejoin in changing conformations, missing information may affect estimates of global network structure differently than in groups with distinctly separated communities due to the influence single individuals can have on the connectivity of the network. Using a bat maternity group showing fission-fusion dynamics, we quantify the effect of missing data on global network measures including community detection. In our system, estimating the number of communities was less reliable than detecting community structure. Further, reliably assorting individual bats into communities required fewer individuals and fewer observations per individual than to estimate the number of communities. Specifically, our metrics of global network structure (i.e., graph density, clustering coefficient, Rcom) approached the 'real' values with increasing numbers of observations per individual and, as the number of individuals included increased, the variance in these estimates decreased. Similar to previous studies, we recommend that more observations per individual should be prioritized over including more individuals when resources are limited. We recommend caution when making conclusions about animal social networks when a substantial number of individuals or observations are missing, and when possible, suggest subsampling large datasets to observe how estimates are influenced by sampling intensity. Our study serves as an example of the reliability, or lack thereof, of global network measures with missing information, but further work is needed to determine how estimates will vary with different data collection methods, network structures, and sampling periods.
社会网络分析越来越多地被应用于理解动物群体。然而,在自然种群中观察所有个体之间的所有相互作用通常是不可行的。研究已经评估了缺失信息如何影响个体网络位置的估计,但对描述整体网络结构的指标(如模块性、聚类系数和密度)的关注较少。在群体表现出裂变-融合动态的情况下,亚群会分裂并以不断变化的形态重新加入,由于单个个体对网络的连通性可能产生影响,因此缺失信息可能会对全局网络结构的估计产生不同于具有明显分离社区的群体的影响。我们使用表现出裂变-融合动态的蝙蝠母婴群体,量化了缺失数据对包括社区检测在内的全局网络度量的影响。在我们的系统中,估计社区数量的可靠性不如检测社区结构。此外,将个体蝙蝠可靠地分类到社区中所需的个体数量和每个个体的观察次数少于估计社区数量。具体来说,我们的全局网络结构度量(即图密度、聚类系数、Rcom)随着每个个体的观察次数的增加而接近“真实”值,并且随着所包含的个体数量的增加,这些估计的方差减小。与之前的研究类似,我们建议在资源有限的情况下,优先考虑每个个体的更多观察次数,而不是增加更多个体。当大量个体或观察值缺失时,我们建议在对动物社交网络做出结论时要谨慎,并在可能的情况下,建议对大型数据集进行抽样观察,以了解估计值如何受到抽样强度的影响。我们的研究为具有缺失信息的全局网络度量的可靠性或缺乏可靠性提供了一个示例,但需要进一步的工作来确定估计值将如何随不同的数据收集方法、网络结构和采样期而变化。