Suppr超能文献

差分隐私将如何影响我们对美国健康差异的理解。

How differential privacy will affect our understanding of health disparities in the United States.

机构信息

Department of Human Development and Family Studies, The Pennsylvania State University, University Park, PA 16802;

Department of Public Health, University of Texas at San Antonio, San Antonio, TX 78249.

出版信息

Proc Natl Acad Sci U S A. 2020 Jun 16;117(24):13405-13412. doi: 10.1073/pnas.2003714117. Epub 2020 May 28.

Abstract

The application of a currently proposed differential privacy algorithm to the 2020 United States Census data and additional data products may affect the usefulness of these data, the accuracy of estimates and rates derived from them, and critical knowledge about social phenomena such as health disparities. We test the ramifications of applying differential privacy to released data by studying estimates of US mortality rates for the overall population and three major racial/ethnic groups. We ask how changes in the denominators of these vital rates due to the implementation of differential privacy can lead to biased estimates. We situate where these changes are most likely to matter by disaggregating biases by population size, degree of urbanization, and adjacency to a metropolitan area. Our results suggest that differential privacy will more strongly affect mortality rate estimates for non-Hispanic blacks and Hispanics than estimates for non-Hispanic whites. We also find significant changes in estimated mortality rates for less populous areas, with more pronounced changes when stratified by race/ethnicity. We find larger changes in estimated mortality rates for areas with lower levels of urbanization or adjacency to metropolitan areas, with these changes being greater for non-Hispanic blacks and Hispanics. These findings highlight the consequences of implementing differential privacy, as proposed, for research examining population composition, particularly mortality disparities across racial/ethnic groups and along the urban/rural continuum. Overall, they demonstrate the challenges in using the data products derived from the proposed disclosure avoidance methods, while highlighting critical instances where scientific understandings may be negatively impacted.

摘要

目前提出的差分隐私算法在 2020 年美国人口普查数据和其他数据产品中的应用,可能会影响这些数据的可用性、从这些数据中得出的估计值和比率的准确性,以及关于健康差距等社会现象的关键知识。我们通过研究美国全人群和三个主要种族/族裔的总死亡率估计值,来测试差分隐私在已发布数据中的应用所带来的影响。我们询问由于实施差分隐私而导致这些重要比率的分母发生变化,如何导致有偏差的估计值。我们通过按人口规模、城市化程度和毗邻大都市地区对偏差进行细分,确定这些变化最有可能产生影响的地方。我们的结果表明,差分隐私将对非西班牙裔黑人和西班牙裔的死亡率估计值产生比非西班牙裔白人更大的影响。我们还发现人口较少地区的估计死亡率有显著变化,按种族/族裔划分时变化更为明显。我们发现城市化程度较低或毗邻大都市地区的地区的估计死亡率变化更大,非西班牙裔黑人和西班牙裔的变化更大。这些发现强调了实施拟议的差分隐私回避方法对研究人口构成的影响,特别是对不同种族/族裔群体和城乡连续体的死亡率差异的影响。总体而言,它们展示了使用拟议的披露回避方法得出的数据产品所面临的挑战,同时突出了科学理解可能受到负面影响的关键实例。

相似文献

引用本文的文献

3
The shortcomings of synthetic census microdata.合成普查微观数据的缺点。
Proc Natl Acad Sci U S A. 2025 Mar 18;122(11):e2424655122. doi: 10.1073/pnas.2424655122. Epub 2025 Mar 6.
4
Equitable differential privacy.公平差分隐私
Front Big Data. 2024 Aug 16;7:1420344. doi: 10.3389/fdata.2024.1420344. eCollection 2024.
9
Distribution-Invariant Differential Privacy.分布不变差分隐私
J Econom. 2023 Aug;235(2):444-453. doi: 10.1016/j.jeconom.2022.05.004. Epub 2022 Jun 18.

本文引用的文献

3
Local demographic changes and US presidential voting, 2012 to 2016.2012 年至 2016 年美国地方人口变化与总统选举
Proc Natl Acad Sci U S A. 2019 Dec 10;116(50):25023-25028. doi: 10.1073/pnas.1909202116. Epub 2019 Nov 19.
6
Predicting Social Security numbers from public data.从公开数据预测社会保障号码。
Proc Natl Acad Sci U S A. 2009 Jul 7;106(27):10975-80. doi: 10.1073/pnas.0904891106. Epub 2009 Jul 6.
7
Son-biased sex ratios in the 2000 United States Census.2000年美国人口普查中的出生性别比偏向男性。
Proc Natl Acad Sci U S A. 2008 Apr 15;105(15):5681-2. doi: 10.1073/pnas.0800703105. Epub 2008 Mar 31.
8
Temporal and spatial changes in social vulnerability to natural hazards.社会对自然灾害脆弱性的时空变化。
Proc Natl Acad Sci U S A. 2008 Feb 19;105(7):2301-6. doi: 10.1073/pnas.0710375105. Epub 2008 Feb 11.
10
The multiple-race population of the United States: issues and estimates.美国的多种族人口:问题与估算
Proc Natl Acad Sci U S A. 2000 May 23;97(11):6230-5. doi: 10.1073/pnas.100086897.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验