Kalton Graham, Flores Cervantes Ismael, Arieira Carlos, Kwanisai Mike, Radin Elizabeth, Saito Suzue, DE Anindya K, McCracken Stephen, Stupp Paul
Westat, 1600 Research Blvd, Rockville, MD 20850, USA.
ICAP, 722 West 168th Street, New York, NY 10032, USA.
J Surv Stat Methodol. 2021 Nov;9(5):1035-1049. doi: 10.1093/jssam/smaa020.
The units at the early stages of multi-stage area samples are generally sampled with probabilities proportional to their estimated sizes (PPES). With such a design, an overall equal probability (EP) sample design would yield a constant number of final stage units from each final stage cluster if the measures of size used in the PPES selection at each sampling stage were directly proportional to the number of final stage units. However, there are often sizable relative differences between the measures of size used in the PPES selections and the number of final stage units. Two common approaches for dealing with these differences are: (1) to retain a self-weighting sample design, allowing the sample sizes to vary across the sampled primary sampling units (PSUs) and (2) to retain the fixed sample size in each PSU and to compensate for the unequal selection probabilities by weighting adjustments in the analyses. This article examines these alternative designs in the context of two-stage sampling in which PSUs are sampled with PPES at the first stage, and an equal probability sample of final stage units is selected from each sampled PSU at the second stage. Two-stage sample designs of this type are used for household surveys in many countries. The discussion is illustrated with data from the Population-based HIV Impact Assessment surveys that were conducted using this design in several African countries.
多阶段区域样本早期阶段的单元通常按与其估计规模成比例的概率(PPES)进行抽样。采用这种设计,如果在每个抽样阶段PPES选择中使用的规模测量值与最后阶段单元的数量成正比,那么总体等概率(EP)样本设计将从每个最后阶段聚类中产生恒定数量的最后阶段单元。然而,在PPES选择中使用的规模测量值与最后阶段单元的数量之间往往存在相当大的相对差异。处理这些差异的两种常见方法是:(1)保持自加权样本设计,允许样本规模在抽样的初级抽样单元(PSU)之间变化;(2)在每个PSU中保持固定的样本规模,并在分析中通过加权调整来补偿不等的选择概率。本文在两阶段抽样的背景下研究这些替代设计,其中在第一阶段按PPES对PSU进行抽样,在第二阶段从每个抽样的PSU中选择等概率的最后阶段单元样本。这种类型的两阶段样本设计在许多国家用于家庭调查。讨论以在几个非洲国家使用这种设计进行的基于人群的艾滋病毒影响评估调查的数据为例。