Habecker Patrick, Dombrowski Kirk, Khan Bilal
Department of Sociology, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America.
PLoS One. 2015 Dec 2;10(12):e0143406. doi: 10.1371/journal.pone.0143406. eCollection 2015.
Researchers interested in studying populations that are difficult to reach through traditional survey methods can now draw on a range of methods to access these populations. Yet many of these methods are more expensive and difficult to implement than studies using conventional sampling frames and trusted sampling methods. The network scale-up method (NSUM) provides a middle ground for researchers who wish to estimate the size of a hidden population, but lack the resources to conduct a more specialized hidden population study. Through this method it is possible to generate population estimates for a wide variety of groups that are perhaps unwilling to self-identify as such (for example, users of illegal drugs or other stigmatized populations) via traditional survey tools such as telephone or mail surveys--by asking a representative sample to estimate the number of people they know who are members of such a "hidden" subpopulation. The original estimator is formulated to minimize the weight a single scaling variable can exert upon the estimates. We argue that this introduces hidden and difficult to predict biases, and instead propose a series of methodological advances on the traditional scale-up estimation procedure, including a new estimator. Additionally, we formalize the incorporation of sample weights into the network scale-up estimation process, and propose a recursive process of back estimation "trimming" to identify and remove poorly performing predictors from the estimation process. To demonstrate these suggestions we use data from a network scale-up mail survey conducted in Nebraska during 2014. We find that using the new estimator and recursive trimming process provides more accurate estimates, especially when used in conjunction with sampling weights.
那些有兴趣研究难以通过传统调查方法触及的人群的研究人员,如今可以借助一系列方法来接触这些人群。然而,与使用传统抽样框架和可靠抽样方法的研究相比,这些方法中的许多成本更高且实施难度更大。网络扩大法(NSUM)为那些希望估计隐藏人群规模但缺乏资源开展更专业的隐藏人群研究的研究人员提供了一条中间途径。通过这种方法,有可能通过电话或邮件调查等传统调查工具,让有代表性的样本估计他们认识的属于此类“隐藏”亚人群体的人数,从而为各种可能不愿自我认定为该群体的人群(例如非法药物使用者或其他受污名化的人群)生成人口估计数。原始估计量的制定是为了尽量减少单个缩放变量对估计值的影响。我们认为这会引入难以预测的隐藏偏差,因此提出了一系列在传统扩大估计程序基础上的方法改进,包括一种新的估计量。此外,我们将样本权重纳入网络扩大估计过程的做法形式化,并提出一种反向估计“修剪”的递归过程,以识别并从估计过程中去除表现不佳的预测变量。为了证明这些建议,我们使用了2014年在 Nebraska 进行的一次网络扩大邮件调查的数据。我们发现,使用新的估计量和递归修剪过程能提供更准确的估计值,尤其是与抽样权重结合使用时。