• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估匿名数据集重新识别风险的实用且现成的方法。

Practical and ready-to-use methodology to assess the re-identification risk in anonymized datasets.

作者信息

Sondeck Louis Philippe, Laurent Maryline

机构信息

COACHMESEC Consulting (Clever Identity), 151 rue des Meuniers, Bagneux, 92220, France.

Samovar, Télécom SudParis, Institut Polytechnique de Paris, 19 Place Marguerite Perey, Palaiseau, 91120, France.

出版信息

Sci Rep. 2025 Jul 2;15(1):23223. doi: 10.1038/s41598-025-04907-3.

DOI:10.1038/s41598-025-04907-3
PMID:40603887
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12222771/
Abstract

To prove that a dataset is sufficiently anonymized, many privacy policies suggest that a re-identification risk assessment be performed, but do not provide a precise methodology for doing so, leaving the industry alone with the problem. This paper proposes a practical and ready-to-use methodology for re-identification risk assessment, the originality of which is manifold: (1) it is the first to follow well-known risk analysis methods (e.g. EBIOS) that have been used in the cybersecurity field for years, which consider not only the ability to perform an attack, but also the severity such an attack can have on an individual; (2) it is the first to qualify attributes and values of attributes with e.g. degree of exposure, as known real-world attacks mainly target certain types of attributes and not others; (3) it is the first to provide clear, comprehensible criteria and interpretable, explainable assessment results. In addition, the fine granularity of the methodology makes it possible to score the risk as accurately as possible, and thus maintain good data quality at an acceptable risk, which is very promising for the AI industrial sector. Finally, the implementation of the methodology is illustrated using the publicly available Adult dataset, which was assessed as having a critical risk of re-identification, with 14 concrete cases of individualization.

摘要

为证明一个数据集已充分匿名化,许多隐私政策建议进行重新识别风险评估,但未提供具体的操作方法,这使得该行业只能独自面对这个问题。本文提出了一种实用且易于使用的重新识别风险评估方法,其创新性体现在多个方面:(1)它首次采用了多年来在网络安全领域使用的知名风险分析方法(如EBIOS),该方法不仅考虑攻击的能力,还考虑此类攻击对个人可能造成的严重性;(2)它首次对属性及其值进行了定性,例如暴露程度,因为已知现实世界中的攻击主要针对某些类型的属性而非其他属性;(3)它首次提供了清晰、易懂的标准以及可解释、可说明的评估结果。此外,该方法的精细粒度使得能够尽可能准确地对风险进行评分,从而在可接受的风险水平下保持良好的数据质量,这对人工智能产业部门非常有前景。最后,使用公开可用的成人数据集说明了该方法的实施情况,该数据集被评估为具有重新识别的关键风险,存在14个具体的个体化案例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d3d/12222771/6282ff368578/41598_2025_4907_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d3d/12222771/d4f58eb2a433/41598_2025_4907_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d3d/12222771/6282ff368578/41598_2025_4907_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d3d/12222771/d4f58eb2a433/41598_2025_4907_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d3d/12222771/6282ff368578/41598_2025_4907_Fig2_HTML.jpg

相似文献

1
Practical and ready-to-use methodology to assess the re-identification risk in anonymized datasets.评估匿名数据集重新识别风险的实用且现成的方法。
Sci Rep. 2025 Jul 2;15(1):23223. doi: 10.1038/s41598-025-04907-3.
2
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
3
Interventions targeted at women to encourage the uptake of cervical screening.针对女性的干预措施,以鼓励她们接受宫颈癌筛查。
Cochrane Database Syst Rev. 2021 Sep 6;9(9):CD002834. doi: 10.1002/14651858.CD002834.pub3.
4
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
10
The clinical effectiveness and cost-effectiveness of enzyme replacement therapy for Gaucher's disease: a systematic review.戈谢病酶替代疗法的临床疗效和成本效益:一项系统评价。
Health Technol Assess. 2006 Jul;10(24):iii-iv, ix-136. doi: 10.3310/hta10240.

本文引用的文献

1
Health Data Re-Identification: Assessing Adversaries and Potential Harms.健康数据再识别:评估对手和潜在危害。
Stud Health Technol Inform. 2024 Aug 22;316:1199-1203. doi: 10.3233/SHTI240626.
2
Enabling realistic health data re-identification risk assessment through adversarial modeling.通过对抗建模实现现实健康数据重新识别风险评估。
J Am Med Inform Assoc. 2021 Mar 18;28(4):744-752. doi: 10.1093/jamia/ocaa327.
3
Estimating the re-identification risk of clinical data sets.估算临床数据集的再识别风险。
BMC Med Inform Decis Mak. 2012 Jul 9;12:66. doi: 10.1186/1472-6947-12-66.
4
A systematic review of re-identification attacks on health data.对健康数据再识别攻击的系统综述。
PLoS One. 2011;6(12):e28071. doi: 10.1371/journal.pone.0028071. Epub 2011 Dec 2.
5
Evaluating re-identification risks with respect to the HIPAA privacy rule.评估 HIPAA 隐私规则下的重新识别风险。
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):169-77. doi: 10.1136/jamia.2009.000026.
6
Protecting privacy using k-anonymity.使用 k-匿名保护隐私。
J Am Med Inform Assoc. 2008 Sep-Oct;15(5):627-37. doi: 10.1197/jamia.M2716. Epub 2008 Jun 25.