• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

中国患者数据隐私风险评估:案例研究

Evaluation of Privacy Risks of Patients' Data in China: Case Study.

作者信息

Gong Mengchun, Wang Shuang, Wang Lezi, Liu Chao, Wang Jianyang, Guo Qiang, Zheng Hao, Xie Kang, Wang Chenghong, Hui Zhouguang

机构信息

Digital China Health Technologies Corporation Limited, Beijing, China.

Shanghai Putuo People's Hospital, Tongji University, Shanghai, China.

出版信息

JMIR Med Inform. 2020 Feb 5;8(2):e13046. doi: 10.2196/13046.

DOI:10.2196/13046
PMID:32022691
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7055805/
Abstract

BACKGROUND

Patient privacy is a ubiquitous problem around the world. Many existing studies have demonstrated the potential privacy risks associated with sharing of biomedical data. Owing to the increasing need for data sharing and analysis, health care data privacy is drawing more attention. However, to better protect biomedical data privacy, it is essential to assess the privacy risk in the first place.

OBJECTIVE

In China, there is no clear regulation for health systems to deidentify data. It is also not known whether a mechanism such as the Health Insurance Portability and Accountability Act (HIPAA) safe harbor policy will achieve sufficient protection. This study aimed to conduct a pilot study using patient data from Chinese hospitals to understand and quantify the privacy risks of Chinese patients.

METHODS

We used g-distinct analysis to evaluate the reidentification risks with regard to the HIPAA safe harbor approach when applied to Chinese patients' data. More specifically, we estimated the risks based on the HIPAA safe harbor and limited dataset policies by assuming an attacker has background knowledge of the patient from the public domain.

RESULTS

The experiments were conducted on 0.83 million patients (with data field of date of birth, gender, and surrogate ZIP codes generated based on home address) across 33 provincial-level administrative divisions in China. Under the Limited Dataset policy, 19.58% (163,262/833,235) of the population could be uniquely identifiable under the g-distinct metric (ie, 1-distinct). In contrast, the Safe Harbor policy is able to significantly reduce privacy risk, where only 0.072% (601/833,235) of individuals are uniquely identifiable, and the majority of the population is 3000 indistinguishable (ie the population is expected to share common attributes with 3000 or less people).

CONCLUSIONS

Through the experiments based on real-world patient data, this work illustrates that the results of g-distinct analysis about Chinese patient privacy risk are similar to those from a previous US study, in which data from different organizations/regions might be vulnerable to different reidentification risks under different policies. This work provides reference to Chinese health care entities for estimating patients' privacy risk during data sharing, which laid the foundation of privacy risk study about Chinese patients' data in the future.

摘要

背景

患者隐私是全球普遍存在的问题。许多现有研究已证明与生物医学数据共享相关的潜在隐私风险。由于数据共享和分析的需求不断增加,医疗保健数据隐私正受到更多关注。然而,为了更好地保护生物医学数据隐私,首先评估隐私风险至关重要。

目的

在中国,卫生系统对数据去标识化没有明确规定。也不清楚诸如《健康保险流通与责任法案》(HIPAA)安全港政策之类的机制是否能提供充分保护。本研究旨在利用中国医院患者数据进行一项试点研究,以了解并量化中国患者的隐私风险。

方法

我们使用g-独特性分析来评估HIPAA安全港方法应用于中国患者数据时的重新识别风险。更具体地说,我们通过假设攻击者具有来自公共领域的患者背景知识,基于HIPAA安全港和有限数据集政策来估计风险。

结果

在中国33个省级行政区对83万患者(数据字段包括出生日期、性别以及根据家庭住址生成的替代邮政编码)进行了实验。在有限数据集政策下,根据g-独特性度量(即1-独特性),19.58%(163,262/833,235)的人群可被唯一识别。相比之下,安全港政策能够显著降低隐私风险,其中只有0.072%(601/833,235)的个体可被唯一识别,并且大多数人群是3000不可区分的(即预计该人群与3000人或更少的人具有共同属性)。

结论

通过基于真实世界患者数据的实验,本研究表明关于中国患者隐私风险的g-独特性分析结果与先前美国的一项研究相似,在该研究中,不同组织/地区的数据在不同政策下可能面临不同的重新识别风险。本研究为中国医疗保健实体在数据共享期间估计患者隐私风险提供了参考,为未来中国患者数据的隐私风险研究奠定了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/e466330c0926/medinform_v8i2e13046_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/09a2990a5a67/medinform_v8i2e13046_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/aac94607622b/medinform_v8i2e13046_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/e466330c0926/medinform_v8i2e13046_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/09a2990a5a67/medinform_v8i2e13046_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/aac94607622b/medinform_v8i2e13046_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27c5/7055805/e466330c0926/medinform_v8i2e13046_fig3.jpg

相似文献

1
Evaluation of Privacy Risks of Patients' Data in China: Case Study.中国患者数据隐私风险评估:案例研究
JMIR Med Inform. 2020 Feb 5;8(2):e13046. doi: 10.2196/13046.
2
Evaluating re-identification risks with respect to the HIPAA privacy rule.评估 HIPAA 隐私规则下的重新识别风险。
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):169-77. doi: 10.1136/jamia.2009.000026.
3
Never too old for anonymity: a statistical standard for demographic data sharing via the HIPAA Privacy Rule.永远不要因为年龄而放弃匿名:通过 HIPAA 隐私规则共享人口统计数据的统计标准。
J Am Med Inform Assoc. 2011 Jan-Feb;18(1):3-10. doi: 10.1136/jamia.2010.004622.
4
Twenty Years of the Health Insurance Portability and Accountability Act Safe Harbor Provision: Unsolved Challenges and Ways Forward.《医疗保险可携性与责任法案》安全港条款二十年:未解挑战与前行之路
JMIR Med Inform. 2022 Aug 3;10(8):e37756. doi: 10.2196/37756.
5
Review of HIPAA, Part 1: History, Protected Health Information, and Privacy and Security Rules.《健康保险流通与责任法案》(HIPAA)综述,第1部分:历史、受保护的健康信息以及隐私和安全规则。
J Nucl Med Technol. 2019 Dec;47(4):269-272. doi: 10.2967/jnmt.119.227819. Epub 2019 Jun 10.
6
Dynamically adjusting case reporting policy to maximize privacy and public health utility in the face of a pandemic.在面对大流行时,动态调整病例报告政策以最大化隐私和公共卫生效益。
J Am Med Inform Assoc. 2022 Apr 13;29(5):853-863. doi: 10.1093/jamia/ocac011.
7
Evaluation of the Privacy Risks of Personal Health Identifiers and Quasi-Identifiers in a Distributed Research Network: Development and Validation Study.分布式研究网络中个人健康标识符和准标识符的隐私风险评估:开发与验证研究
JMIR Med Inform. 2021 May 31;9(5):e24940. doi: 10.2196/24940.
8
The Role of HIPAA Omnibus Rules in Reducing the Frequency of Medical Data Breaches: Insights From an Empirical Study.HIPAA 综合规则在降低医疗数据泄露频率中的作用:来自实证研究的见解。
Milbank Q. 2018 Mar;96(1):144-166. doi: 10.1111/1468-0009.12314.
9
Participation in patient support forums may put rare disease patient data at risk of re-identification.参与患者支持论坛可能会使罕见病患者的数据面临重新识别的风险。
Orphanet J Rare Dis. 2020 Aug 31;15(1):226. doi: 10.1186/s13023-020-01497-3.
10
Biobanking Research and Privacy Laws in the United States.美国的生物样本库研究与隐私法
J Law Med Ethics. 2016 Mar;44(1):106-27. doi: 10.1177/1073110516644203.

引用本文的文献

1
Exploring the emerging trends and hot topics of 5G technology application in wireless medicine: A bibliometric and visualization analysis.探索5G技术在无线医学领域应用的新兴趋势与热点话题:文献计量与可视化分析
Medicine (Baltimore). 2025 Jul 18;104(29):e43310. doi: 10.1097/MD.0000000000043310.
2
The association between clinicians' legal literacy and the service quality of primary healthcare - evidence from the Greater Bay Area study, China.临床医生的法律素养与基层医疗服务质量的关联——来自中国大湾区研究的证据
BMC Med Educ. 2025 May 9;25(1):681. doi: 10.1186/s12909-025-07243-2.
3
Advancing digital health in China: Aligning challenges, opportunities, and solutions with the Global Initiative on Digital Health (GIDH).

本文引用的文献

1
Privacy-preserving biomedical data dissemination via a hybrid approach.通过混合方法实现的隐私保护生物医学数据传播
AMIA Annu Symp Proc. 2018 Dec 5;2018:1176-1185. eCollection 2018.
2
iDASH secure genome analysis competition 2017.2017年iDASH安全基因组分析竞赛
BMC Med Genomics. 2018 Oct 11;11(Suppl 4):85. doi: 10.1186/s12920-018-0396-0.
3
Privacy of patient data in quality-of-care registries in cardiology and cardiothoracic surgery: the impact of the new general data protection regulation EU-law.心脏病学和心胸外科学术质量注册中的患者数据隐私:新的一般数据保护条例(欧盟法律)的影响。
推动中国数字健康发展:使挑战、机遇与解决方案与全球数字健康倡议(GIDH)保持一致。
Health Care Sci. 2024 Oct 17;3(5):365-369. doi: 10.1002/hcs2.118. eCollection 2024 Oct.
4
Generating fit-for-purpose real-world evidence in Asia: How far are we from closing the gaps?在亚洲生成适用的真实世界证据:我们距离弥合差距还有多远?
Perspect Clin Res. 2023 Jul-Sep;14(3):108-113. doi: 10.4103/picr.picr_193_22. Epub 2023 Jun 5.
5
Challenges and recommendations for high quality research using electronic health records.使用电子健康记录进行高质量研究的挑战与建议。
Front Digit Health. 2022 Aug 19;4:940330. doi: 10.3389/fdgth.2022.940330. eCollection 2022.
6
The Current Landscape and Emerging Applications for Real-World Data in Diagnostics and Clinical Decision Support and its Impact on Regulatory Decision Making.真实世界数据在诊断和临床决策支持中的当前状况与新兴应用及其对监管决策的影响
Clin Pharmacol Ther. 2022 Dec;112(6):1172-1182. doi: 10.1002/cpt.2565. Epub 2022 Apr 29.
7
Evaluation of the Privacy Risks of Personal Health Identifiers and Quasi-Identifiers in a Distributed Research Network: Development and Validation Study.分布式研究网络中个人健康标识符和准标识符的隐私风险评估:开发与验证研究
JMIR Med Inform. 2021 May 31;9(5):e24940. doi: 10.2196/24940.
8
Cloud-Based System for Effective Surveillance and Control of COVID-19: Useful Experiences From Hubei, China.基于云的新型冠状病毒肺炎有效监测与控制系统:来自中国湖北的有益经验
J Med Internet Res. 2020 Apr 22;22(4):e18948. doi: 10.2196/18948.
Eur Heart J Qual Care Clin Outcomes. 2018 Oct 1;4(4):239-245. doi: 10.1093/ehjqcco/qcy034.
4
SecureLR: Secure Logistic Regression Model via a Hybrid Cryptographic Protocol.SecureLR:通过混合加密协议实现安全逻辑回归模型。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):113-123. doi: 10.1109/TCBB.2018.2833463. Epub 2018 May 7.
5
SCOTCH: Secure Counting Of encrypTed genomiC data using a Hybrid approach.SCOTCH:使用混合方法对加密基因组数据进行安全计数
AMIA Annu Symp Proc. 2018 Apr 16;2017:1744-1753. eCollection 2017.
6
PRESAGE: PRivacy-preserving gEnetic testing via SoftwAre Guard Extension.PRESAGE:通过软件防护扩展实现隐私保护的基因检测
BMC Med Genomics. 2017 Jul 26;10(Suppl 2):48. doi: 10.1186/s12920-017-0281-2.
7
PRINCESS: Privacy-protecting Rare disease International Network Collaboration via Encryption through Software guard extensionS.公主:通过软件保护扩展进行加密的隐私保护罕见病国际网络协作。
Bioinformatics. 2017 Mar 15;33(6):871-878. doi: 10.1093/bioinformatics/btw758.
8
Use of personal information in medical research in Japan.日本医学研究中个人信息的使用。
Lancet. 2016 Oct 22;388(10055):1981-1982. doi: 10.1016/S0140-6736(16)31867-0.
9
Protecting patient privacy when sharing patient-level data from clinical trials.在共享临床试验中患者层面的数据时保护患者隐私。
BMC Med Res Methodol. 2016 Jul 8;16 Suppl 1(Suppl 1):77. doi: 10.1186/s12874-016-0169-4.
10
The risk of re-identification versus the need to identify individuals in rare disease research.罕见病研究中再识别风险与识别个体需求的权衡
Eur J Hum Genet. 2016 Nov;24(11):1553-1558. doi: 10.1038/ejhg.2016.52. Epub 2016 May 25.