Suppr超能文献

STIF:具有统计变换证据权重和信息价值的直觉模糊高斯隶属函数,用于保护隐私信息。

STIF: Intuitionistic fuzzy Gaussian membership function with statistical transformation weight of evidence and information value for private information preservation.

作者信息

Kumar G Sathish, Premalatha K

机构信息

Department of Computer Science and Engineering, Sri Krishna College of Engineering and Technology, Coimbatore, Tamil Nadu India.

Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Erode, Tamil Nadu India.

出版信息

Distrib Parallel Databases. 2023 Apr 21:1-34. doi: 10.1007/s10619-023-07423-3.

Abstract

Data sharing to the multiple organizations are essential for analysis in many situations. The shared data contains the individual's private and sensitive information and results in privacy breach. To overcome the privacy challenges, privacy preserving data mining (PPDM) has progressed as a solution. This work addresses the problem of PPDM by proposing statistical transformation with intuitionistic fuzzy (STIF) algorithm for data perturbation. The STIF algorithm contains statistical methods weight of evidence, information value and intuitionistic fuzzy Gaussian membership function. The STIF algorithm is applied on three benchmark datasets adult income, bank marketing and lung cancer. The classifier models decision tree, random forest, extreme gradient boost and support vector machines are used for accuracy and performance analysis. The results show that the STIF algorithm achieves 99% of accuracy for adult income dataset and 100% accuracy for both bank marketing and lung cancer datasets. Further, the results highlights that the STIF algorithm outperforms in data perturbation capacity and privacy preserving capacity than the state-of-art algorithms without any information loss on both numerical and categorical data.

摘要

在许多情况下,将数据共享给多个组织对于分析而言至关重要。共享的数据包含个人的私密和敏感信息,会导致隐私泄露。为了克服隐私挑战,隐私保护数据挖掘(PPDM)作为一种解决方案得到了发展。这项工作通过提出用于数据扰动的直觉模糊统计变换(STIF)算法来解决PPDM问题。STIF算法包含证据权重、信息值等统计方法以及直觉模糊高斯隶属函数。STIF算法应用于成人收入、银行营销和肺癌这三个基准数据集。使用决策树、随机森林、极端梯度提升和支持向量机等分类器模型进行准确性和性能分析。结果表明,STIF算法在成人收入数据集上实现了99%的准确率,在银行营销和肺癌数据集上均实现了100%的准确率。此外,结果突出显示,与现有算法相比,STIF算法在数据扰动能力和隐私保护能力方面表现更优,在数值和分类数据上均无任何信息损失。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70bf/10121075/41c63a274576/10619_2023_7423_Fig1_HTML.jpg

相似文献

3
Automatic leukocyte nucleus segmentation by intuitionistic fuzzy divergence based thresholding.
Micron. 2014 Mar;58:55-65. doi: 10.1016/j.micron.2013.12.001. Epub 2013 Dec 7.
4
Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.
PLoS One. 2021 Mar 23;16(3):e0248737. doi: 10.1371/journal.pone.0248737. eCollection 2021.
5
Intuitionistic Fuzzy Weighted Least Squares Twin SVMs.
IEEE Trans Cybern. 2023 Jul;53(7):4400-4409. doi: 10.1109/TCYB.2022.3165879. Epub 2023 Jun 15.
6
Relative Density-Based Intuitionistic Fuzzy SVM for Class Imbalance Learning.
Entropy (Basel). 2022 Dec 24;25(1):34. doi: 10.3390/e25010034.
7
An adaptive categorical effect size method based on intuitionistic meta fuzzy functions.
Sci Rep. 2023 Oct 13;13(1):17403. doi: 10.1038/s41598-023-44691-6.
9

引用本文的文献

1
Enhancing privacy protection of physical examination data through synthetic algorithms based on differential privacy.
BMC Med Inform Decis Mak. 2025 Sep 1;25(1):324. doi: 10.1186/s12911-025-03109-1.
2
Anomaly-based threat detection in smart health using machine learning.
BMC Med Inform Decis Mak. 2024 Nov 19;24(1):347. doi: 10.1186/s12911-024-02760-4.
3
The global research of artificial intelligence in lung cancer: a 20-year bibliometric analysis.
Front Oncol. 2024 Feb 2;14:1346010. doi: 10.3389/fonc.2024.1346010. eCollection 2024.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验