• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于高斯核函数的差分隐私模糊 C-均值聚类算法。

Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.

机构信息

School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, China.

出版信息

PLoS One. 2021 Mar 23;16(3):e0248737. doi: 10.1371/journal.pone.0248737. eCollection 2021.

DOI:10.1371/journal.pone.0248737
PMID:33755689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7987176/
Abstract

Fuzzy C-means clustering algorithm is one of the typical clustering algorithms in data mining applications. However, due to the sensitive information in the dataset, there is a risk of user privacy being leaked during the clustering process. The fuzzy C-means clustering of differential privacy protection can protect the user's individual privacy while mining data rules, however, the decline in availability caused by data disturbances is a common problem of these algorithms. Aiming at the problem that the algorithm accuracy is reduced by randomly initializing the membership matrix of fuzzy C-means, in this paper, the maximum distance method is firstly used to determine the initial center point. Then, the gaussian value of the cluster center point is used to calculate the privacy budget allocation ratio. Additionally, Laplace noise is added to complete differential privacy protection. The experimental results demonstrate that the clustering accuracy and effectiveness of the proposed algorithm are higher than baselines under the same privacy protection intensity.

摘要

模糊 C 均值聚类算法是数据挖掘应用中典型的聚类算法之一。然而,由于数据集包含敏感信息,在聚类过程中存在用户隐私泄露的风险。差分隐私保护的模糊 C 均值聚类可以在挖掘数据规则的同时保护用户的个人隐私,然而,数据干扰导致的可用性下降是这些算法的共同问题。针对模糊 C 均值算法中隶属度矩阵随机初始化导致算法精度降低的问题,本文首先采用最大距离法确定初始中心,然后利用聚类中心点的高斯值计算隐私预算分配比例,最后添加拉普拉斯噪声完成差分隐私保护。实验结果表明,在相同隐私保护强度下,所提算法的聚类精度和有效性均高于基线算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/981e15f5a8bc/pone.0248737.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/28f1abcc9c53/pone.0248737.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/05a8afa975ab/pone.0248737.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/d9c58ae1e736/pone.0248737.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/450d95151574/pone.0248737.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/c81116c14582/pone.0248737.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/8b9972d0332a/pone.0248737.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/5bc9a9fb0b99/pone.0248737.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/24940c813c65/pone.0248737.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/fe500d9537d2/pone.0248737.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/bbead4b6094f/pone.0248737.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/981e15f5a8bc/pone.0248737.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/28f1abcc9c53/pone.0248737.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/05a8afa975ab/pone.0248737.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/d9c58ae1e736/pone.0248737.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/450d95151574/pone.0248737.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/c81116c14582/pone.0248737.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/8b9972d0332a/pone.0248737.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/5bc9a9fb0b99/pone.0248737.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/24940c813c65/pone.0248737.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/fe500d9537d2/pone.0248737.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/bbead4b6094f/pone.0248737.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc1f/7987176/981e15f5a8bc/pone.0248737.g011.jpg

相似文献

1
Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.基于高斯核函数的差分隐私模糊 C-均值聚类算法。
PLoS One. 2021 Mar 23;16(3):e0248737. doi: 10.1371/journal.pone.0248737. eCollection 2021.
2
A differential privacy protecting K-means clustering algorithm based on contour coefficients.基于轮廓系数的差分隐私保护 K-均值聚类算法。
PLoS One. 2018 Nov 21;13(11):e0206832. doi: 10.1371/journal.pone.0206832. eCollection 2018.
3
An improved fuzzy C-means clustering algorithm for assisted therapy of chronic bronchitis.一种用于慢性支气管炎辅助治疗的改进型模糊C均值聚类算法。
Technol Health Care. 2015;23(6):699-713. doi: 10.3233/THC-151023.
4
Differential privacy protection method based on published trajectory cross-correlation constraint.基于发布轨迹互相关约束的差分隐私保护方法。
PLoS One. 2020 Aug 12;15(8):e0237158. doi: 10.1371/journal.pone.0237158. eCollection 2020.
5
Protecting the Privacy of Cancer Patients Using Fuzzy Association Rule Hiding.使用模糊关联规则隐藏技术保护癌症患者的隐私
Asian Pac J Cancer Prev. 2019 May 25;20(5):1437-1443. doi: 10.31557/APJCP.2019.20.5.1437.
6
The density-based clustering method for privacy-preserving data mining.用于隐私保护数据挖掘的基于密度的聚类方法。
Math Biosci Eng. 2019 Feb 27;16(3):1718-1728. doi: 10.3934/mbe.2019082.
7
Spectral Clustering Algorithm Based on Improved Gaussian Kernel Function and Beetle Antennae Search with Damping Factor.基于改进高斯核函数和带阻尼因子的甲虫触角搜索的谱聚类算法
Comput Intell Neurosci. 2020 May 29;2020:1648573. doi: 10.1155/2020/1648573. eCollection 2020.
8
Research on differential privacy protection method based on user tendency.基于用户倾向的差分隐私保护方法研究。
PLoS One. 2023 Oct 26;18(10):e0288823. doi: 10.1371/journal.pone.0288823. eCollection 2023.
9
Robust kernelized local information fuzzy C-means clustering for brain magnetic resonance image segmentation.用于脑磁共振图像分割的鲁棒核化局部信息模糊C均值聚类
J Xray Sci Technol. 2016 Mar 17;24(3):489-507. doi: 10.3233/XST-160563.
10
An Adaptive Feature Selection Algorithm for Fuzzy Clustering Image Segmentation Based on Embedded Neighbourhood Information Constraints.一种基于嵌入邻域信息约束的模糊聚类图像分割自适应特征选择算法
Sensors (Basel). 2020 Jul 3;20(13):3722. doi: 10.3390/s20133722.

引用本文的文献

1
Proteomic Analysis of 442 Clinical Plasma Samples From Individuals With Symptom Records Revealed Subtypes of Convalescent Patients Who Had COVID-19.对442份有症状记录个体的临床血浆样本进行蛋白质组学分析,揭示了感染过新冠病毒的康复患者的亚型。
J Med Virol. 2025 Apr;97(4):e70203. doi: 10.1002/jmv.70203.
2
Pathogenicity and transcriptomic profiling reveals immunology molecular hallmarks after CA10 virus infection.CA10 病毒感染后的致病机制和转录组特征分析揭示了免疫学的分子特征。
Animal Model Exp Med. 2024 Oct;7(5):717-731. doi: 10.1002/ame2.12415. Epub 2024 May 15.

本文引用的文献

1
A differential privacy protecting K-means clustering algorithm based on contour coefficients.基于轮廓系数的差分隐私保护 K-均值聚类算法。
PLoS One. 2018 Nov 21;13(11):e0206832. doi: 10.1371/journal.pone.0206832. eCollection 2018.
2
Fuzzy C-means clustering with local information and kernel metric for image segmentation.基于局部信息和核度量的模糊 C-均值聚类图像分割。
IEEE Trans Image Process. 2013 Feb;22(2):573-84. doi: 10.1109/TIP.2012.2219547. Epub 2012 Sep 18.