处理两阶段规模比例概率抽样设计中不准确的规模测量：在非洲家庭调查中的应用

DEALING WITH INACCURATE MEASURES OF SIZE IN TWO-STAGE PROBABILITY PROPORTIONAL TO SIZE SAMPLE DESIGNS: APPLICATIONS IN AFRICAN HOUSEHOLD SURVEYS.

作者信息

Kalton Graham, Flores Cervantes Ismael, Arieira Carlos, Kwanisai Mike, Radin Elizabeth, Saito Suzue, DE Anindya K, McCracken Stephen, Stupp Paul

机构信息

Westat, 1600 Research Blvd, Rockville, MD 20850, USA.

ICAP, 722 West 168th Street, New York, NY 10032, USA.

出版信息

J Surv Stat Methodol. 2021 Nov;9(5):1035-1049. doi: 10.1093/jssam/smaa020.

DOI:10.1093/jssam/smaa020

PMID:39081797

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11288091/

Abstract

The units at the early stages of multi-stage area samples are generally sampled with probabilities proportional to their estimated sizes (PPES). With such a design, an overall equal probability (EP) sample design would yield a constant number of final stage units from each final stage cluster if the measures of size used in the PPES selection at each sampling stage were directly proportional to the number of final stage units. However, there are often sizable relative differences between the measures of size used in the PPES selections and the number of final stage units. Two common approaches for dealing with these differences are: (1) to retain a self-weighting sample design, allowing the sample sizes to vary across the sampled primary sampling units (PSUs) and (2) to retain the fixed sample size in each PSU and to compensate for the unequal selection probabilities by weighting adjustments in the analyses. This article examines these alternative designs in the context of two-stage sampling in which PSUs are sampled with PPES at the first stage, and an equal probability sample of final stage units is selected from each sampled PSU at the second stage. Two-stage sample designs of this type are used for household surveys in many countries. The discussion is illustrated with data from the Population-based HIV Impact Assessment surveys that were conducted using this design in several African countries.

摘要

多阶段区域样本早期阶段的单元通常按与其估计规模成比例的概率（PPES）进行抽样。采用这种设计，如果在每个抽样阶段PPES选择中使用的规模测量值与最后阶段单元的数量成正比，那么总体等概率（EP）样本设计将从每个最后阶段聚类中产生恒定数量的最后阶段单元。然而，在PPES选择中使用的规模测量值与最后阶段单元的数量之间往往存在相当大的相对差异。处理这些差异的两种常见方法是：（1）保持自加权样本设计，允许样本规模在抽样的初级抽样单元（PSU）之间变化；（2）在每个PSU中保持固定的样本规模，并在分析中通过加权调整来补偿不等的选择概率。本文在两阶段抽样的背景下研究这些替代设计，其中在第一阶段按PPES对PSU进行抽样，在第二阶段从每个抽样的PSU中选择等概率的最后阶段单元样本。这种类型的两阶段样本设计在许多国家用于家庭调查。讨论以在几个非洲国家使用这种设计进行的基于人群的艾滋病毒影响评估调查的数据为例。

相似文献

DEALING WITH INACCURATE MEASURES OF SIZE IN TWO-STAGE PROBABILITY PROPORTIONAL TO SIZE SAMPLE DESIGNS: APPLICATIONS IN AFRICAN HOUSEHOLD SURVEYS.处理两阶段规模比例概率抽样设计中不准确的规模测量：在非洲家庭调查中的应用

J Surv Stat Methodol. 2021 Nov;9(5):1035-1049. doi: 10.1093/jssam/smaa020.

GridSample: an R package to generate household survey primary sampling units (PSUs) from gridded population data.GridSample：一个用于从网格化人口数据生成住户调查初级抽样单位（PSU）的R软件包。

Int J Health Geogr. 2017 Jul 19;16(1):25. doi: 10.1186/s12942-017-0098-4.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Optimal two-stage sampling for mean estimation in multilevel populations when cluster size is informative.多水平总体中当群大小为信息性时的均值估计的最优两阶段抽样。

Stat Methods Med Res. 2021 Feb;30(2):357-375. doi: 10.1177/0962280220952833. Epub 2020 Sep 17.

Effectiveness and cost-effectiveness of four different strategies for SARS-CoV-2 surveillance in the general population (CoV-Surv Study): a structured summary of a study protocol for a cluster-randomised, two-factorial controlled trial.在普通人群中进行 SARS-CoV-2 监测的四种不同策略的有效性和成本效益（CoV-Surv 研究）：一项关于集群随机、双因素对照试验的研究方案的结构化总结。

Trials. 2021 Jan 8;22(1):39. doi: 10.1186/s13063-020-04982-z.

Achieving equal probability of selection under various random sampling strategies.在各种随机抽样策略下实现相等的选择概率。

Paediatr Perinat Epidemiol. 1995 Apr;9(2):219-24. doi: 10.1111/j.1365-3016.1995.tb00135.x.

Design and Weighting Methods for a Nationally Representative Sample of HIV-infected Adults Receiving Medical Care in the United States-Medical Monitoring Project.美国医疗监测项目中接受医疗护理的全国具有代表性的HIV感染成人样本的设计与加权方法

Open AIDS J. 2016 Aug 19;10:164-81. doi: 10.2174/1874613601610010164. eCollection 2016.

Bayesian inference under cluster sampling with probability proportional to size.基于大小与概率成比例的聚类抽样的贝叶斯推断。

Stat Med. 2018 Nov 20;37(26):3849-3868. doi: 10.1002/sim.7892. Epub 2018 Jul 4.

A Comparison of Population-Averaged and Cluster-Specific Approaches in the Context of Unequal Probabilities of Selection.在选择概率不相等的情况下，总体平均方法与特定聚类方法的比较。

Multivariate Behav Res. 2017 May-Jun;52(3):325-349. doi: 10.1080/00273171.2017.1292115. Epub 2017 Mar 10.

Relative efficiencies of two-stage sampling schemes for mean estimation in multilevel populations when cluster size is informative.当群规模是有信息时，多水平总体中均值估计的两阶段抽样方案的相对效率。

Stat Med. 2019 May 10;38(10):1817-1834. doi: 10.1002/sim.8070. Epub 2018 Dec 21.

本文引用的文献

An Extension of Kish's Formula for Design Effects to Two- and Three-Stage Designs with Stratification.基什公式的设计效应向具有分层的两阶段和三阶段设计的扩展。

J Surv Stat Methodol. 2017 Jun;5(2):111-130. doi: 10.1093/jssam/smw036. Epub 2017 Mar 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验