• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

广义最小二乘法可以克服应答驱动抽样中的关键阈值。

Generalized least squares can overcome the critical threshold in respondent-driven sampling.

机构信息

Department of Mathematics, University of Wisconsin, Madison, WI 53706.

Department of Statistics, University of Wisconsin, Madison, WI 53706

出版信息

Proc Natl Acad Sci U S A. 2018 Oct 9;115(41):10299-10304. doi: 10.1073/pnas.1706699115. Epub 2018 Sep 25.

DOI:10.1073/pnas.1706699115
PMID:30254152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6187121/
Abstract

To sample marginalized and/or hard-to-reach populations, respondent-driven sampling (RDS) and similar techniques reach their participants via peer referral. Under a Markov model for RDS, previous research has shown that if the typical participant refers too many contacts, then the variance of common estimators does not decay like [Formula: see text], where n is the sample size. This implies that confidence intervals will be far wider than under a typical sampling design. Here we show that generalized least squares (GLS) can effectively reduce the variance of RDS estimates. In particular, a theoretical analysis indicates that the variance of the GLS estimator is [Formula: see text] We then derive two classes of feasible GLS estimators. The first class is based upon a Degree Corrected Stochastic Blockmodel for the underlying social network. The second class is based upon a rank-two model. It might be of independent interest that in both model classes, the theoretical results show that it is possible to estimate the spectral properties of the population network from a random walk sample of the nodes. These theoretical results point the way to entirely different classes of estimators that account for the network structure beyond node degree. Diagnostic plots help to identify situations where feasible GLS estimators are more appropriate. The computational experiments show the potential benefits and also indicate that there is room to further develop these estimators in practical settings.

摘要

为了对边缘化和/或难以接触的人群进行抽样,响应驱动抽样(RDS)和类似技术通过同伴推荐来联系参与者。在 RDS 的马尔可夫模型下,先前的研究表明,如果典型参与者推荐了太多的联系人,那么常见估计量的方差不会像 [公式:见正文] 那样衰减,其中 n 是样本量。这意味着置信区间将远远宽于典型抽样设计。在这里,我们表明广义最小二乘法(GLS)可以有效地降低 RDS 估计量的方差。特别是,理论分析表明 GLS 估计量的方差为 [公式:见正文]。然后我们推导出两类可行的 GLS 估计量。第一类基于底层社交网络的校正度数随机块模型。第二类基于二阶模型。在这两个模型类别中,理论结果表明,有可能从节点的随机游走样本中估计出总体网络的谱性质,这可能具有独立的意义。这些理论结果为那些考虑到节点度数之外的网络结构的完全不同的估计类别指明了方向。诊断图有助于识别可行 GLS 估计量更适用的情况。计算实验表明了潜在的好处,并表明在实际环境中还有进一步开发这些估计量的空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/b8d143040713/pnas.1706699115fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/9fc94d5407c4/pnas.1706699115fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/d82f575c18b4/pnas.1706699115fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/b8d143040713/pnas.1706699115fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/9fc94d5407c4/pnas.1706699115fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/d82f575c18b4/pnas.1706699115fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf4/6187121/b8d143040713/pnas.1706699115fig03.jpg

相似文献

1
Generalized least squares can overcome the critical threshold in respondent-driven sampling.广义最小二乘法可以克服应答驱动抽样中的关键阈值。
Proc Natl Acad Sci U S A. 2018 Oct 9;115(41):10299-10304. doi: 10.1073/pnas.1706699115. Epub 2018 Sep 25.
2
Revisiting random walk based sampling in networks: evasion of burn-in period and frequent regenerations.重新审视网络中基于随机游走的采样:避免初始阶段和频繁再生。
Comput Soc Netw. 2018;5(1):4. doi: 10.1186/s40649-018-0051-0. Epub 2018 Mar 19.
3
Network Structure and Biased Variance Estimation in Respondent Driven Sampling.应答驱动抽样中的网络结构与偏差方差估计
PLoS One. 2015 Dec 17;10(12):e0145296. doi: 10.1371/journal.pone.0145296. eCollection 2015.
4
The development of respondent-driven sampling (RDS) inference: A systematic review of the population mean and variance estimates.回应驱动抽样(RDS)推断的发展:人口均值和方差估计的系统评价。
Drug Alcohol Depend. 2020 Jan 1;206:107702. doi: 10.1016/j.drugalcdep.2019.107702. Epub 2019 Nov 1.
5
Evaluation of Respondent-Driven Sampling Prevalence Estimators Using Real-World Reported Network Degree.使用实际报告的网络度数评估应答驱动抽样患病率估计器。
Sociol Methodol. 2023 Aug;53(2):269-287. doi: 10.1177/00811750231163832. Epub 2023 Apr 21.
6
Respondent-Driven Sampling: An Assessment of Current Methodology.应答者驱动抽样:当前方法评估
Sociol Methodol. 2010 Aug;40(1):285-327. doi: 10.1111/j.1467-9531.2010.01223.x.
7
Modeling and analyzing respondent-driven sampling as a counting process.将应答者驱动抽样建模并分析为一个计数过程。
Biometrics. 2017 Dec;73(4):1189-1198. doi: 10.1111/biom.12678. Epub 2017 Mar 3.
8
Does size really matter? A sensitivity analysis of number of seeds in a respondent-driven sampling study of gay, bisexual and other men who have sex with men in Vancouver, Canada.规模真的重要吗?对加拿大温哥华男同性恋、双性恋及其他与男性发生性行为的男性进行的应答驱动抽样研究中种子数的敏感性分析。
BMC Med Res Methodol. 2016 Nov 16;16(1):157. doi: 10.1186/s12874-016-0258-4.
9
NEW SURVEY QUESTIONS AND ESTIMATORS FOR NETWORK CLUSTERING WITH RESPONDENT-DRIVEN SAMPLING DATA.用于基于受访者驱动抽样数据的网络聚类的新调查问题和估计器
Sociol Methodol. 2017 Aug;47(1):274-306. doi: 10.1177/0081175017716489. Epub 2017 Jul 6.
10
A SIMULATION-BASED FRAMEWORK FOR ASSESSING THE FEASIBILITY OF RESPONDENT-DRIVEN SAMPLING FOR ESTIMATING CHARACTERISTICS IN POPULATIONS OF LESBIAN, GAY AND BISEXUAL OLDER ADULTS.一种基于模拟的框架,用于评估应答驱动抽样在估计女同性恋、男同性恋和双性恋老年人群体特征方面的可行性。
Ann Appl Stat. 2018 Dec;12(4):2252-2278. doi: 10.1214/18-AOAS1151. Epub 2018 Nov 13.

本文引用的文献

1
Estimating uncertainty in respondent-driven sampling using a tree bootstrap method.使用树状自举法估计应答者驱动抽样中的不确定性。
Proc Natl Acad Sci U S A. 2016 Dec 20;113(51):14668-14673. doi: 10.1073/pnas.1617258113. Epub 2016 Dec 7.
2
Strengthening the Reporting of Observational Studies in Epidemiology for respondent-driven sampling studies: "STROBE-RDS" statement.《流行病学观察性研究报告加强规范:应答驱动抽样研究“STROBE-RDS”声明》
J Clin Epidemiol. 2015 Dec;68(12):1463-71. doi: 10.1016/j.jclinepi.2015.04.002. Epub 2015 May 1.
3
Stochastic blockmodels and community structure in networks.
网络中的随机块模型与社区结构
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Jan;83(1 Pt 2):016107. doi: 10.1103/PhysRevE.83.016107. Epub 2011 Jan 21.
4
Assessing respondent-driven sampling.评估受访者驱动抽样法。
Proc Natl Acad Sci U S A. 2010 Apr 13;107(15):6743-7. doi: 10.1073/pnas.1000261107. Epub 2010 Mar 29.
5
Respondent-driven sampling as Markov chain Monte Carlo.作为马尔可夫链蒙特卡罗方法的应答者驱动抽样
Stat Med. 2009 Jul 30;28(17):2202-29. doi: 10.1002/sim.3613.
6
Social networks and infectious disease: the Colorado Springs Study.社交网络与传染病:科罗拉多斯普林斯研究
Soc Sci Med. 1994 Jan;38(1):79-88. doi: 10.1016/0277-9536(94)90302-6.