Joint Program in Survey Methodology, University of Maryland, College Park, Maryland, USA.
Independent Consultant, LGJ Consultants, Inc., Valencia, Spain.
Biom J. 2023 Jun;65(5):e2200136. doi: 10.1002/bimj.202200136. Epub 2023 Mar 6.
Estimating the size of hidden populations is essential to understand the magnitude of social and healthcare needs, risk behaviors, and disease burden. However, due to the hidden nature of these populations, they are difficult to survey, and there are no gold standard size estimation methods. Many different methods and variations exist, and diagnostic tools are needed to help researchers assess method-specific assumptions as well as compare between methods. Further, because many necessary mathematical assumptions are unrealistic for real survey implementation, assessment of how robust methods are to deviations from the stated assumptions is essential. We describe diagnostics and assess the performance of a new population size estimation method, capture-recapture with successive sampling population size estimation (CR-SS-PSE), which we apply to data from 3 years of studies from three cities and three hidden populations in Armenia. CR-SS-PSE relies on data from two sequential respondent-driven sampling surveys and extends the successive sampling population size estimation (SS-PSE) framework by using the number of individuals in the overlap between the two surveys and a model for the successive sampling process to estimate population size. We demonstrate that CR-SS-PSE is more robust to violations of successive sampling assumptions than SS-PSE. Further, we compare the CR-SS-PSE estimates to population size estimations using other common methods, including unique object and service multipliers, wisdom of the crowd, and two-source capture-recapture to illustrate volatility across estimation methods.
估算隐蔽人群的规模对于了解社会和医疗保健需求、风险行为和疾病负担的程度至关重要。然而,由于这些人群的隐蔽性质,很难对其进行调查,而且也没有黄金标准的规模估算方法。存在许多不同的方法和变体,需要诊断工具来帮助研究人员评估特定方法的假设,并比较不同方法之间的差异。此外,由于许多必要的数学假设在实际调查实施中不切实际,因此评估方法对所陈述假设的偏差的稳健性至关重要。我们描述了诊断方法,并评估了一种新的人口规模估算方法——捕获-再捕获与连续抽样人口规模估算(CR-SS-PSE)的性能,我们将该方法应用于来自亚美尼亚三个城市和三个隐蔽人群的三年研究数据。CR-SS-PSE 依赖于两个连续的受访者驱动抽样调查的数据,并通过使用两个调查之间重叠的个体数量和用于连续抽样过程的模型来扩展连续抽样人口规模估算(SS-PSE)框架,以估算人口规模。我们证明,CR-SS-PSE 比 SS-PSE 更能抵抗连续抽样假设的违反。此外,我们将 CR-SS-PSE 估计值与其他常见方法(包括独特对象和服务乘数、众包和双源捕获-再捕获)的人口规模估计值进行比较,以说明估计方法的波动性。