• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于混沌与扰动技术的高效大数据匿名化算法。

An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques.

作者信息

Eyupoglu Can, Aydin Muhammed Ali, Zaim Abdul Halim, Sertbas Ahmet

机构信息

Department of Computer Engineering, Istanbul Commerce University, Istanbul 34840, Turkey.

Department of Computer Engineering, Istanbul University, Istanbul 34320, Turkey.

出版信息

Entropy (Basel). 2018 May 17;20(5):373. doi: 10.3390/e20050373.

DOI:10.3390/e20050373
PMID:33265463
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7512893/
Abstract

The topic of big data has attracted increasing interest in recent years. The emergence of big data leads to new difficulties in terms of protection models used for data privacy, which is of necessity for sharing and processing data. Protecting individuals' sensitive information while maintaining the usability of the data set published is the most important challenge in privacy preserving. In this regard, data anonymization methods are utilized in order to protect data against identity disclosure and linking attacks. In this study, a novel data anonymization algorithm based on chaos and perturbation has been proposed for privacy and utility preserving in big data. The performance of the proposed algorithm is evaluated in terms of Kullback-Leibler divergence, probabilistic anonymity, classification accuracy, F-measure and execution time. The experimental results have shown that the proposed algorithm is efficient and performs better in terms of Kullback-Leibler divergence, classification accuracy and F-measure compared to most of the existing algorithms using the same data set. Resulting from applying chaos to perturb data, such successful algorithm is promising to be used in privacy preserving data mining and data publishing.

摘要

近年来,大数据主题已引起越来越多的关注。大数据的出现给用于数据隐私保护的模型带来了新的难题,而数据隐私保护对于数据共享和处理来说是必不可少的。在隐私保护中,最重要的挑战是在保持所发布数据集可用性的同时保护个人敏感信息。在这方面,数据匿名化方法被用于保护数据免遭身份泄露和链接攻击。在本研究中,提出了一种基于混沌和扰动的新型数据匿名化算法,用于大数据中的隐私保护和实用性维护。从库尔贝克-莱布勒散度、概率匿名性、分类准确率、F值和执行时间等方面对所提算法的性能进行了评估。实验结果表明,与使用相同数据集的大多数现有算法相比,所提算法是高效的,并且在库尔贝克-莱布勒散度、分类准确率和F值方面表现更好。由于应用混沌来扰动数据,这种成功的算法有望用于隐私保护数据挖掘和数据发布。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/6efdf9887c2d/entropy-20-00373-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/0169a87ddd35/entropy-20-00373-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/3d05d0742e59/entropy-20-00373-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/a4f94f10fc4c/entropy-20-00373-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/dd34f622c1e3/entropy-20-00373-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/73eb6aa6cbbd/entropy-20-00373-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/82340ce9062c/entropy-20-00373-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/6efdf9887c2d/entropy-20-00373-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/0169a87ddd35/entropy-20-00373-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/3d05d0742e59/entropy-20-00373-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/a4f94f10fc4c/entropy-20-00373-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/dd34f622c1e3/entropy-20-00373-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/73eb6aa6cbbd/entropy-20-00373-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/82340ce9062c/entropy-20-00373-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b417/7512893/6efdf9887c2d/entropy-20-00373-g007.jpg

相似文献

1
An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques.一种基于混沌与扰动技术的高效大数据匿名化算法。
Entropy (Basel). 2018 May 17;20(5):373. doi: 10.3390/e20050373.
2
Privacy-Preserving Anonymity for Periodical Releases of Spontaneous Adverse Drug Event Reporting Data: Algorithm Development and Validation.自发不良药物事件报告数据定期发布的隐私保护匿名性:算法开发与验证
JMIR Med Inform. 2021 Oct 28;9(10):e28752. doi: 10.2196/28752.
3
Utility-preserving anonymization for health data publishing.用于健康数据发布的效用保持匿名化
BMC Med Inform Decis Mak. 2017 Jul 11;17(1):104. doi: 10.1186/s12911-017-0499-0.
4
Privacy preserving data anonymization of spontaneous ADE reporting system dataset.自发不良药物事件报告系统数据集的隐私保护数据匿名化
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):58. doi: 10.1186/s12911-016-0293-4.
5
Designing a Novel Approach Using a Greedy and Information-Theoretic Clustering-Based Algorithm for Anonymizing Microdata Sets.设计一种基于贪心和信息论聚类算法的新颖方法,用于对微数据集进行匿名化处理。
Entropy (Basel). 2023 Dec 1;25(12):1613. doi: 10.3390/e25121613.
6
Digression and Value Concatenation to Enable Privacy-Preserving Regression.用于实现隐私保护回归的离题与值串联
MIS Q. 2014 Sep;38(3):679-698. doi: 10.25300/misq/2014/38.3.03.
7
Privacy-preserving data cube for electronic medical records: An experimental evaluation.用于电子病历的隐私保护数据立方体:实验评估
Int J Med Inform. 2017 Jan;97:33-42. doi: 10.1016/j.ijmedinf.2016.09.008. Epub 2016 Sep 24.
8
Efficient Protection of Health Data from Sensitive Attribute Disclosure.有效保护健康数据免遭敏感属性泄露。
Stud Health Technol Inform. 2020 Jun 16;270:193-197. doi: 10.3233/SHTI200149.
9
Utility-Preserving Anonymization in a Real-World Scenario: Evidence from the German Chronic Kidney Disease (GCKD) Study.实用匿名化在真实场景中的应用:来自德国慢性肾脏病(GCKD)研究的证据。
Stud Health Technol Inform. 2023 May 18;302:28-32. doi: 10.3233/SHTI230058.
10
An anonymization-based privacy-preserving data collection protocol for digital health data.基于匿名化的数字健康数据隐私保护数据收集协议。
Front Public Health. 2023 Mar 3;11:1125011. doi: 10.3389/fpubh.2023.1125011. eCollection 2023.

引用本文的文献

1
Novel CAD Diagnosis Method Based on Search, PCA, and AdaBoostM1 Techniques.基于搜索、主成分分析和自适应增强M1技术的新型冠心病诊断方法
J Clin Med. 2024 May 13;13(10):2868. doi: 10.3390/jcm13102868.
2
Designing a Novel Approach Using a Greedy and Information-Theoretic Clustering-Based Algorithm for Anonymizing Microdata Sets.设计一种基于贪心和信息论聚类算法的新颖方法,用于对微数据集进行匿名化处理。
Entropy (Basel). 2023 Dec 1;25(12):1613. doi: 10.3390/e25121613.
3
Recent Developments in Privacy-Preserving Mining of Clinical Data.临床数据隐私保护挖掘的最新进展

本文引用的文献

1
Big data: survey, technologies, opportunities, and challenges.大数据:调查、技术、机遇与挑战。
ScientificWorldJournal. 2014;2014:712826. doi: 10.1155/2014/712826. Epub 2014 Jul 17.
2
Guaranteeing anonymity when sharing medical data, the Datafly System.Datafly系统在共享医疗数据时保证匿名性。
Proc AMIA Annu Fall Symp. 1997:51-5.
ACM IMS Trans Data Sci. 2021 Nov;2(4). doi: 10.1145/3447774.
4
A Note on the Reproducibility of Chaos Simulation.关于混沌模拟可重复性的一则注释
Entropy (Basel). 2020 Aug 29;22(9):953. doi: 10.3390/e22090953.