• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
The Impact of Medical Big Data Anonymization on Early Acute Kidney Injury Risk Prediction.医学大数据匿名化对早期急性肾损伤风险预测的影响
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:617-625. eCollection 2020.
2
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.
3
Privacy-preserving data cube for electronic medical records: An experimental evaluation.用于电子病历的隐私保护数据立方体:实验评估
Int J Med Inform. 2017 Jan;97:33-42. doi: 10.1016/j.ijmedinf.2016.09.008. Epub 2016 Sep 24.
4
Utility-preserving anonymization for health data publishing.用于健康数据发布的效用保持匿名化
BMC Med Inform Decis Mak. 2017 Jul 11;17(1):104. doi: 10.1186/s12911-017-0499-0.
5
Data Anonymization for Pervasive Health Care: Systematic Literature Mapping Study.普及医疗保健中的数据匿名化:系统文献映射研究
JMIR Med Inform. 2021 Oct 15;9(10):e29871. doi: 10.2196/29871.
6
Privacy of Study Participants in Open-access Health and Demographic Surveillance System Data: Requirements Analysis for Data Anonymization.开放获取健康和人口监测系统数据中研究参与者的隐私:数据匿名化的需求分析。
JMIR Public Health Surveill. 2022 Sep 2;8(9):e34472. doi: 10.2196/34472.
7
Privacy-enhancing ETL-processes for biomedical data.用于生物医学数据的隐私增强型 ETL 流程。
Int J Med Inform. 2019 Jun;126:72-81. doi: 10.1016/j.ijmedinf.2019.03.006. Epub 2019 Mar 23.
8
An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques.一种基于混沌与扰动技术的高效大数据匿名化算法。
Entropy (Basel). 2018 May 17;20(5):373. doi: 10.3390/e20050373.
9
Privacy preserving data anonymization of spontaneous ADE reporting system dataset.自发不良药物事件报告系统数据集的隐私保护数据匿名化
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.
10
Utility-Preserving Anonymization in a Real-World Scenario: Evidence from the German Chronic Kidney Disease (GCKD) Study.实用匿名化在真实场景中的应用:来自德国慢性肾脏病(GCKD)研究的证据。
Stud Health Technol Inform. 2023 May 18;302:28-32. doi: 10.3233/SHTI230058.

引用本文的文献

1
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.
2
Privacy of Study Participants in Open-access Health and Demographic Surveillance System Data: Requirements Analysis for Data Anonymization.开放获取健康和人口监测系统数据中研究参与者的隐私:数据匿名化的需求分析。
JMIR Public Health Surveill. 2022 Sep 2;8(9):e34472. doi: 10.2196/34472.
3
Planning for monitoring the introduction and effectiveness of new vaccines using real-word data and geospatial visualization: An example using rotavirus vaccines with potential application to SARS-CoV-2.利用真实世界数据和地理空间可视化技术监测新型疫苗引入情况及效果的规划:以轮状病毒疫苗为例及其在SARS-CoV-2中的潜在应用
Vaccine X. 2021 Apr;7:100084. doi: 10.1016/j.jvacx.2021.100084. Epub 2021 Jan 9.

本文引用的文献

1
Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing.隐私保护生成式深度神经网络支持临床数据共享。
Circ Cardiovasc Qual Outcomes. 2019 Jul;12(7):e005122. doi: 10.1161/CIRCOUTCOMES.118.005122. Epub 2019 Jul 9.
2
Multi-perspective predictive modeling for acute kidney injury in general hospital populations using electronic medical records.利用电子病历对综合医院人群急性肾损伤进行多视角预测建模。
JAMIA Open. 2019 Apr;2(1):115-122. doi: 10.1093/jamiaopen/ooy043. Epub 2018 Nov 15.
3
An Open Source Tool for Game Theoretic Health Data De-Identification.一种用于博弈论健康数据去识别的开源工具。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1430-1439. eCollection 2017.
4
Predicting Inpatient Acute Kidney Injury over Different Time Horizons: How Early and Accurate?预测不同时间范围内的住院患者急性肾损伤:多早且多准确?
AMIA Annu Symp Proc. 2018 Apr 16;2017:565-574. eCollection 2017.
5
The Development of a Machine Learning Inpatient Acute Kidney Injury Prediction Model.机器学习在住院患者急性肾损伤预测模型中的应用
Crit Care Med. 2018 Jul;46(7):1070-1077. doi: 10.1097/CCM.0000000000003123.
6
Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes.使用来自多民族糖尿病患者群体的视网膜图像开发并验证用于糖尿病视网膜病变及相关眼病的深度学习系统
JAMA. 2017 Dec 12;318(22):2211-2223. doi: 10.1001/jama.2017.18152.
7
Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer.用于检测乳腺癌女性患者淋巴结转移的深度学习算法的诊断评估
JAMA. 2017 Dec 12;318(22):2199-2210. doi: 10.1001/jama.2017.14585.
8
Dermatologist-level classification of skin cancer with deep neural networks.基于深度神经网络的皮肤癌皮肤科医生级分类。
Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.
9
Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach.在促进隐私保护的同时扩大对大规模基因组数据的访问:一种博弈论方法。
Am J Hum Genet. 2017 Feb 2;100(2):316-322. doi: 10.1016/j.ajhg.2016.12.002. Epub 2017 Jan 5.
10
Real-World Evidence - What Is It and What Can It Tell Us?真实世界证据——它是什么以及能告诉我们什么?
N Engl J Med. 2016 Dec 8;375(23):2293-2297. doi: 10.1056/NEJMsb1609216.

医学大数据匿名化对早期急性肾损伤风险预测的影响

The Impact of Medical Big Data Anonymization on Early Acute Kidney Injury Risk Prediction.

作者信息

Song Xing, Waitman Lemuel R, Hu Yong, Luo Bo, Li Fengjun, Liu Mei

机构信息

University of Kansas Medical Center, Department of Internal Medicine, Division of Medical Informatics, Kansas City, KS, USA.

Jinan University, Big Data Decision Institute, Guangzhou, PRC.

出版信息

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:617-625. eCollection 2020.

PMID:32477684
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7233037/
Abstract

Artificial intelligence enabled medical big data analysis has the potential to revolutionize medical practice from diagnosis and prediction of complex diseases to making recommendations and resource allocation decisions in an evidence-based manner. However, big data comes with big disclosure risks. To preserve privacy, excessive data anonymization is often necessary, leading to significant loss of data utility. In this paper, we develop a systematic data scrubbing procedure for large datasets when key variables are uncertain for re-identification risk assessment and assess the trade-off between anonymization of electronic health record data for sharing in support of open science and performance of machine learning models for early acute kidney injury risk prediction using the data. Results demonstrate that our proposed data scrubbing procedure can maintain good feature diversity and moderate data utility but raises concerns regarding its impact on knowledge discovery capability.

摘要

人工智能驱动的医学大数据分析有潜力彻底改变医疗实践,从复杂疾病的诊断和预测到以循证方式做出推荐和资源分配决策。然而,大数据伴随着巨大的披露风险。为了保护隐私,往往需要进行过度的数据匿名化处理,这会导致数据效用的显著损失。在本文中,当关键变量对于重新识别风险评估不确定时,我们为大型数据集开发了一种系统的数据清理程序,并评估了用于支持开放科学而共享的电子健康记录数据匿名化与使用这些数据进行早期急性肾损伤风险预测的机器学习模型性能之间的权衡。结果表明,我们提出的数据清理程序可以保持良好的特征多样性和适度的数据效用,但引发了对其对知识发现能力影响的担忧。