O'Malley A James, Paul Sudeshna
The Dartmouth Institute for Healh Policy and Clinical Practice, Geisel School of Medicine at Dartmouth, Lebanon, NH 03766, USA.
Nell Hodgson Woodruff School of Nursing, Emory University, Atlanta, GA 30322, USA.
Comput Stat Data Anal. 2015 Feb 1;82:35-46. doi: 10.1016/j.csda.2014.08.001.
Estimation of longitudinal models of relationship status between all pairs of individuals (dyads) in social networks is challenging due to the complex inter-dependencies among observations and lengthy computation times. To reduce the computational burden of model estimation, a method is developed that subsamples the "always-null" dyads in which no relationships develop throughout the period of observation. The informative sampling process is accounted for by weighting the likelihood contributions of the observations by the inverses of the sampling probabilities. This weighted-likelihood estimation method is implemented using Bayesian computation and evaluated in terms of its bias, efficiency, and speed of computation under various settings. Comparisons are also made to a full information likelihood-based procedure that is only feasible to compute when limited follow-up observations are available. Calculations are performed on two real social networks of very different sizes. The easily computed weighted-likelihood procedure closely approximates the corresponding estimates for the full network, even when using low sub-sampling fractions. The fast computation times make the weighted-likelihood approach practical and able to be applied to networks of any size.
估计社交网络中所有个体对(二元组)之间关系状态的纵向模型具有挑战性,这是由于观测值之间存在复杂的相互依赖关系以及计算时间较长。为了减轻模型估计的计算负担,开发了一种方法,该方法对“始终为空”的二元组进行子采样,即在整个观测期内没有关系发展的二元组。通过用采样概率的倒数对观测值的似然贡献进行加权来考虑信息采样过程。这种加权似然估计方法通过贝叶斯计算实现,并在各种设置下根据其偏差、效率和计算速度进行评估。还与基于全信息似然的程序进行了比较,该程序只有在有限的随访观测可用时才可行计算。在两个规模差异很大的真实社交网络上进行了计算。即使使用低子采样比例,易于计算的加权似然程序也能非常接近全网络的相应估计值。快速的计算时间使得加权似然方法切实可行,并且能够应用于任何规模的网络。