WorldPop, Geography and Environmental Science, University of Southampton, Southampton SO17 1BJ, United Kingdom;
WorldPop, Geography and Environmental Science, University of Southampton, Southampton SO17 1BJ, United Kingdom.
Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24173-24179. doi: 10.1073/pnas.1913050117. Epub 2020 Sep 14.
Population estimates are critical for government services, development projects, and public health campaigns. Such data are typically obtained through a national population and housing census. However, population estimates can quickly become inaccurate in localized areas, particularly where migration or displacement has occurred. Some conflict-affected and resource-poor countries have not conducted a census in over 10 y. We developed a hierarchical Bayesian model to estimate population numbers in small areas based on enumeration data from sample areas and nationwide information about administrative boundaries, building locations, settlement types, and other factors related to population density. We demonstrated this model by estimating population sizes in every 10- m grid cell in Nigeria with national coverage. These gridded population estimates and areal population totals derived from them are accompanied by estimates of uncertainty based on Bayesian posterior probabilities. The model had an overall error rate of 67 people per hectare (mean of absolute residuals) or 43% (using scaled residuals) for predictions in out-of-sample survey areas (approximately 3 ha each), with increased precision expected for aggregated population totals in larger areas. This statistical approach represents a significant step toward estimating populations at high resolution with national coverage in the absence of a complete and recent census, while also providing reliable estimates of uncertainty to support informed decision making.
人口估计对于政府服务、发展项目和公共卫生活动至关重要。此类数据通常通过全国人口和住房普查获得。然而,在局部地区,特别是在发生移民或流离失所的情况下,人口估计可能会迅速变得不准确。一些受冲突影响和资源匮乏的国家已经有 10 多年没有进行人口普查了。我们开发了一种分层贝叶斯模型,根据样本地区的普查数据以及关于行政边界、建筑物位置、定居类型和其他与人口密度相关的因素的全国信息,来估算小区域的人口数量。我们通过在全国范围内对尼日利亚的每个 10 米网格单元进行人口估算,展示了该模型。这些网格化的人口估计值及其衍生的面积人口总数都附有基于贝叶斯后验概率的不确定性估计值。对于样本外调查区域(每个区域约 3 公顷)的预测,该模型的总体误差率为每公顷 67 人(绝对残差的平均值)或 43%(使用比例残差),对于较大区域的聚合人口总数,预计会有更高的精度。在没有完整和最新的普查的情况下,这种统计方法代表了在全国范围内以高分辨率估算人口的重要一步,同时还提供了可靠的不确定性估计值,以支持明智的决策。