• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于约束的聚类对医疗研究进行人口普查区层面的社会经济数据去识别化。

De-identifying Socioeconomic Data at the Census Tract Level for Medical Research Through Constraint-based Clustering.

机构信息

Vanderbilt University, Nashville, TN.

Vanderbilt University Medical Center, Nashville, TN.

出版信息

AMIA Annu Symp Proc. 2022 Feb 21;2021:793-802. eCollection 2021.

PMID:35309009
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8861681/
Abstract

Numerous studies have shown that a person's health status is closely related to their socioeconomic status. It is evident that incorporating socioeconomic data associated with a patient's geographic area of residence into clinical datasets will promote medical research. However, most socioeconomic variables are unique in combination and are affiliated with small geographical regions (e.g., census tracts) that are often associated with less than 20,000 people. Thus, sharing such tract-level data can violate the Safe Harbor implementation of de-identification under the Health Insurance Portability and Accountability Act of 1996 (HIPAA). In this paper, we introduce a constraint-based k-means clustering approach to generate census tract-level socioeconomic data that is de-identification compliant. Our experimental analysis with data from the American Community Survey illustrates that the approach generates a protected dataset with high similarity to the unaltered values, and achieves a substantially better data utility than the HIPAA Safe Harbor recommendation of 3-digit ZIP code.

摘要

大量研究表明,一个人的健康状况与其社会经济地位密切相关。显然,将与患者居住地理区域相关的社会经济数据纳入临床数据集将促进医学研究。然而,大多数社会经济变量在组合上是独特的,并且与小的地理区域(例如,人口普查区)相关联,这些区域通常与不到 20000 人相关联。因此,共享此类区域级数据可能会违反 1996 年《健康保险携带和责任法案》(HIPAA)的安全港实施的去识别。在本文中,我们介绍了一种基于约束的 k-均值聚类方法来生成符合去识别要求的人口普查区社会经济数据。我们使用美国社区调查数据进行的实验分析表明,该方法生成的受保护数据集与原始值高度相似,并且比 HIPAA 安全港建议的 3 位邮政编码具有更高的数据实用性。

相似文献

1
De-identifying Socioeconomic Data at the Census Tract Level for Medical Research Through Constraint-based Clustering.通过基于约束的聚类对医疗研究进行人口普查区层面的社会经济数据去识别化。
AMIA Annu Symp Proc. 2022 Feb 21;2021:793-802. eCollection 2021.
2
The effects of the Health Insurance Portability and Accountability Act privacy rule on influenza research using geographical information systems.《医疗保险可携性与责任法案》隐私规则对使用地理信息系统进行流感研究的影响。
Geospat Health. 2010 Nov;5(1):3-9. doi: 10.4081/gh.2010.182.
3
Participation in patient support forums may put rare disease patient data at risk of re-identification.参与患者支持论坛可能会使罕见病患者的数据面临重新识别的风险。
Orphanet J Rare Dis. 2020 Aug 31;15(1):226. doi: 10.1186/s13023-020-01497-3.
4
Reframing the influence of the Health Insurance Portability and Accountability Act on research.重新构建《健康保险携带和责任法案》对研究的影响。
Chest. 2012 Mar;141(3):782-786. doi: 10.1378/chest.11-2182.
5
Twenty Years of the Health Insurance Portability and Accountability Act Safe Harbor Provision: Unsolved Challenges and Ways Forward.《医疗保险可携性与责任法案》安全港条款二十年:未解挑战与前行之路
JMIR Med Inform. 2022 Aug 3;10(8):e37756. doi: 10.2196/37756.
6
Incorporating a location-based socioeconomic index into a de-identified i2b2 clinical data warehouse.将基于位置的社会经济指数纳入去识别的 i2b2 临床数据仓库中。
J Am Med Inform Assoc. 2019 Apr 1;26(4):286-293. doi: 10.1093/jamia/ocy172.
7
The Health Insurance Portability and Accountability Act (HIPAA): its broad effect on practice.《健康保险流通与责任法案》(HIPAA):其对医疗实践的广泛影响。
Am J Gastroenterol. 2005 Jul;100(7):1440-3. doi: 10.1111/j.1572-0241.2005.50621.x.
8
The Health Insurance Portability and Accountability Act of 1996 (HIPAA) privacy rule: implications for clinical research.1996年《健康保险流通与责任法案》(HIPAA)隐私规则:对临床研究的影响
Annu Rev Med. 2006;57:575-90. doi: 10.1146/annurev.med.57.121304.131257.
9
Creation of clinical research databases in the 21st century: a practical algorithm for HIPAA Compliance.21世纪临床研究数据库的创建:符合《健康保险流通与责任法案》的实用算法
Surg Infect (Larchmt). 2006 Feb;7(1):37-44. doi: 10.1089/sur.2006.7.37.
10
Air Pollution, Socioeconomic Status, and Age-Specific Mortality Risk in the United States.空气污染、社会经济地位与美国特定年龄段的死亡率风险
JAMA Netw Open. 2022 May 2;5(5):e2213540. doi: 10.1001/jamanetworkopen.2022.13540.

引用本文的文献

1
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.
2
Impact of social disparities on 10 year survival rates in paediatric cancers: a cohort study.社会差异对儿童癌症10年生存率的影响:一项队列研究。
Lancet Reg Health Am. 2023 Feb 24;20:100454. doi: 10.1016/j.lana.2023.100454. eCollection 2023 Apr.
3
Algorithms to anonymize structured medical and healthcare data: A systematic review.使结构化医学和医疗保健数据匿名化的算法:一项系统综述。
Front Bioinform. 2022 Dec 22;2:984807. doi: 10.3389/fbinf.2022.984807. eCollection 2022.
4
A phenome-wide association study of polygenic scores for attention deficit hyperactivity disorder across two genetic ancestries in electronic health record data.基于电子健康记录数据的两种遗传背景下注意缺陷多动障碍多基因评分的全基因组关联研究。
Am J Med Genet B Neuropsychiatr Genet. 2022 Sep;189(6):185-195. doi: 10.1002/ajmg.b.32911. Epub 2022 Jul 15.

本文引用的文献

1
Integrating Social Care Into the Delivery of Health Care.将社会护理融入医疗保健服务之中。
JAMA. 2019 Nov 12;322(18):1763-1764. doi: 10.1001/jama.2019.15603.
2
Socioeconomic status and risk of cardiovascular disease in 20 low-income, middle-income, and high-income countries: the Prospective Urban Rural Epidemiologic (PURE) study.20 个低收入、中等收入和高收入国家的社会经济地位与心血管疾病风险:前瞻性城乡流行病学(PURE)研究。
Lancet Glob Health. 2019 Jun;7(6):e748-e760. doi: 10.1016/S2214-109X(19)30045-2. Epub 2019 Apr 23.
3
Material community deprivation and hospital utilization during the first year of life: an urban population-based cohort study.生命第一年的物质社区剥夺与医院利用:一项基于城市人群的队列研究。
Ann Epidemiol. 2019 Feb;30:37-43. doi: 10.1016/j.annepidem.2018.11.008. Epub 2018 Nov 29.
4
Neighborhood Disadvantage and Allostatic Load in African American Women at Risk for Obesity-Related Diseases.非裔美国女性肥胖相关疾病风险中的邻里劣势与全身适应综合征负荷。
Prev Chronic Dis. 2017 Nov 22;14:E119. doi: 10.5888/pcd14.170143.
5
Association of Neighborhood Socioeconomic Context With Participation in Cardiac Rehabilitation.社区社会经济环境与参与心脏康复的关系。
J Am Heart Assoc. 2017 Oct 11;6(10):e006260. doi: 10.1161/JAHA.117.006260.
6
It takes a village: Exploring the impact of social determinants on delivery system outcomes for heart failure patients.需要整个社区的努力:探讨社会决定因素对心力衰竭患者的医疗服务系统结果的影响。
Healthc (Amst). 2018 Jun;6(2):112-116. doi: 10.1016/j.hjdsi.2017.06.001. Epub 2017 Jun 24.
7
Inequalities in Life Expectancy Among US Counties, 1980 to 2014: Temporal Trends and Key Drivers.1980年至2014年美国各县预期寿命的不平等:时间趋势和主要驱动因素
JAMA Intern Med. 2017 Jul 1;177(7):1003-1011. doi: 10.1001/jamainternmed.2017.0918.
8
The Impact of Socioeconomic Status on Appendiceal Perforation in Pediatric Appendicitis.社会经济地位对小儿阑尾炎阑尾穿孔的影响
J Pediatr. 2016 Mar;170:156-60.e1. doi: 10.1016/j.jpeds.2015.11.075. Epub 2015 Dec 28.
9
Income and heart disease: Neglected risk factor.收入与心脏病:被忽视的风险因素。
Can Fam Physician. 2015 Aug;61(8):698-704.
10
Neighborhoods at risk: estimating risk of higher Neisseria gonorrhoeae incidence among women at the census tract level.高危社区:在普查区层面评估女性淋病奈瑟菌发病率较高的风险。
Sex Transm Dis. 2014 Nov;41(11):649-55. doi: 10.1097/OLQ.0000000000000195.