Suppr超能文献

应答者驱动抽样:当前方法评估

Respondent-Driven Sampling: An Assessment of Current Methodology.

作者信息

Gile Krista J, Handcock Mark S

机构信息

Nuffield College, University of Oxford.

出版信息

Sociol Methodol. 2010 Aug;40(1):285-327. doi: 10.1111/j.1467-9531.2010.01223.x.

Abstract

Respondent-Driven Sampling (RDS) employs a variant of a link-tracing network sampling strategy to collect data from hard-to-reach populations. By tracing the links in the underlying social network, the process exploits the social structure to expand the sample and reduce its dependence on the initial (convenience) sample.The current estimators of population averages make strong assumptions in order to treat the data as a probability sample. We evaluate three critical sensitivities of the estimators: to bias induced by the initial sample, to uncontrollable features of respondent behavior, and to the without-replacement structure of sampling.Our analysis indicates: (1) that the convenience sample of seeds can induce bias, and the number of sample waves typically used in RDS is likely insufficient for the type of nodal mixing required to obtain the reputed asymptotic unbiasedness; (2) that preferential referral behavior by respondents leads to bias; (3) that when a substantial fraction of the target population is sampled the current estimators can have substantial bias.This paper sounds a cautionary note for the users of RDS. While current RDS methodology is powerful and clever, the favorable statistical properties claimed for the current estimates are shown to be heavily dependent on often unrealistic assumptions. We recommend ways to improve the methodology.

摘要

应答驱动抽样(RDS)采用了一种链接追踪网络抽样策略的变体,从难以接触到的人群中收集数据。通过追踪潜在社会网络中的链接,该过程利用社会结构来扩大样本规模并减少对初始(便利)样本的依赖。当前总体均值的估计方法做出了很强的假设,以便将数据视为概率样本。我们评估了估计方法的三个关键敏感性:对初始样本引起的偏差、对应答者行为不可控特征以及对无放回抽样结构的敏感性。我们的分析表明:(1)种子的便利样本可能会导致偏差,并且RDS中通常使用的样本轮次数量可能不足以实现获得所谓渐近无偏性所需的节点混合类型;(2)应答者的优先推荐行为会导致偏差;(3)当目标人群的很大一部分被抽样时,当前的估计方法可能会有很大偏差。本文为RDS的使用者敲响了警钟。虽然当前的RDS方法强大且巧妙,但当前估计所宣称的良好统计特性被证明严重依赖于往往不切实际的假设。我们推荐了改进该方法的途径。

相似文献

1
Respondent-Driven Sampling: An Assessment of Current Methodology.
Sociol Methodol. 2010 Aug;40(1):285-327. doi: 10.1111/j.1467-9531.2010.01223.x.
2
The efficacy of respondent-driven sampling for the health assessment of minority populations.
Cancer Epidemiol. 2017 Oct;50(Pt B):214-220. doi: 10.1016/j.canep.2017.07.006.
3
Evaluation of respondent-driven sampling.
Epidemiology. 2012 Jan;23(1):138-47. doi: 10.1097/EDE.0b013e31823ac17c.
6
THE GRAPHICAL STRUCTURE OF RESPONDENT-DRIVEN SAMPLING.
Sociol Methodol. 2016;46(1):187-211. doi: 10.1177/0081175016641713. Epub 2016 Aug 1.
7
The development of respondent-driven sampling (RDS) inference: A systematic review of the population mean and variance estimates.
Drug Alcohol Depend. 2020 Jan 1;206:107702. doi: 10.1016/j.drugalcdep.2019.107702. Epub 2019 Nov 1.
8
Modeling and analyzing respondent-driven sampling as a counting process.
Biometrics. 2017 Dec;73(4):1189-1198. doi: 10.1111/biom.12678. Epub 2017 Mar 3.
9
Identification of Homophily and Preferential Recruitment in Respondent-Driven Sampling.
Am J Epidemiol. 2018 Jan 1;187(1):153-160. doi: 10.1093/aje/kwx208.
10
Network Structure and Biased Variance Estimation in Respondent Driven Sampling.
PLoS One. 2015 Dec 17;10(12):e0145296. doi: 10.1371/journal.pone.0145296. eCollection 2015.

引用本文的文献

1
Predictors of perceived HIV stigma among people who inject drugs in the United States.
Harm Reduct J. 2025 Aug 30;22(1):148. doi: 10.1186/s12954-025-01285-x.
4
Inferring bivariate associations with continuous data from studies using respondent-driven sampling.
J R Stat Soc Ser C Appl Stat. 2024 Nov 26;74(2):429-446. doi: 10.1093/jrsssc/qlae061. eCollection 2025 Mar.
7
Prevalence and predictors of condom use among people who inject drugs in Georgia.
Harm Reduct J. 2025 Feb 20;22(1):21. doi: 10.1186/s12954-025-01171-6.
8
Influence of insurance type on healthcare utilization among rural people who use drugs.
Drug Alcohol Depend. 2025 Apr 1;269:112542. doi: 10.1016/j.drugalcdep.2024.112542. Epub 2025 Jan 9.
9
Optimization of multiple sampling for solving network boundary specification problem.
Sci Rep. 2025 Feb 4;15(1):4221. doi: 10.1038/s41598-025-87760-8.
10
Gender Characteristics and Population Size Estimation of Transgender People: A Field-Based Study from Iran.
Transgend Health. 2024 Aug 16;9(4):348-356. doi: 10.1089/trgh.2021.0073. eCollection 2024 Aug.

本文引用的文献

1
MODELING SOCIAL NETWORKS FROM SAMPLED DATA.
Ann Appl Stat. 2010;4(1):5-25. doi: 10.1214/08-AOAS221.
3
SNOWBALL VERSUS RESPONDENT-DRIVEN SAMPLING.
Sociol Methodol. 2011 Aug 1;41(1):355-366. doi: 10.1111/j.1467-9531.2011.01244.x.
4
Respondent-driven sampling as Markov chain Monte Carlo.
Stat Med. 2009 Jul 30;28(17):2202-29. doi: 10.1002/sim.3613.
10
Variance estimation, design effects, and sample size calculations for respondent-driven sampling.
J Urban Health. 2006 Nov;83(6 Suppl):i98-112. doi: 10.1007/s11524-006-9106-x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验