Suppr超能文献

人口普查办公室能否发布多个小区域地理单元的统计数据?对统计披露中差异问题的分析。

Can census offices publish statistics for more than one small area geography? An analysis of the differencing problem in statistical disclosure.

作者信息

Duke-williams O, Rees P

出版信息

Int J Geogr Inf Sci. 1998 Sep;12(6):579-605. doi: 10.1080/136588198241680.

Abstract

"The paper describes a problem faced by National Statistical Offices when publishing the results of decennial censuses for small geographical areas. If they publish statistical tables for two or more sets of areas, users can compare the tables and produce new statistics for the areas formed by differencing, which may have populations below confidentiality thresholds. To investigate the problem, the authors construct a software system and carry out a series of experiments using a large synthetic population base for Yorkshire and Humberside [in England]. The results indicate that publishing statistics for zones close in size to the primary areas is not safe unless the zones have been carefully designed. However, publishing statistics for sufficiently large areas such as 5km grid squares or postal sectors alongside enumeration districts is safe."

摘要

本文描述了国家统计局在发布小地理区域十年一次人口普查结果时面临的一个问题。如果他们发布两组或更多组区域的统计表,用户可以比较这些表格,并为通过差分形成的区域生成新的统计数据,而这些区域的人口可能低于保密阈值。为了研究这个问题,作者构建了一个软件系统,并使用[英格兰]约克郡和亨伯赛德郡的大量合成人口数据库进行了一系列实验。结果表明,除非区域经过精心设计,否则发布面积与主要区域相近的区域的统计数据是不安全的。然而,与枚举区一起发布足够大区域(如5公里网格方块或邮政部门)的统计数据是安全的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验