基于轮廓系数的差分隐私保护 K-均值聚类算法。

A differential privacy protecting K-means clustering algorithm based on contour coefficients.

机构信息

School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, ShaanXi, China.

School of Science, Xi'an University of Technology, Xi'an, ShaanXi, China.

出版信息

PLoS One. 2018 Nov 21;13(11):e0206832. doi: 10.1371/journal.pone.0206832. eCollection 2018.

DOI:10.1371/journal.pone.0206832

PMID:30462662

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6248925/

Abstract

This paper, based on differential privacy protecting K-means clustering algorithm, realizes privacy protection by adding data-disturbing Laplace noise to cluster center point. In order to solve the problem of Laplace noise randomness which causes the center point to deviate, especially when poor availability of clustering results appears because of small privacy budget parameters, an improved differential privacy protecting K-means clustering algorithm was raised in this paper. The improved algorithm uses the contour coefficients to quantitatively evaluate the clustering effect of each iteration and add different noise to different clusters. In order to be adapted to the huge number of data, this paper provides an algorithm design in MapReduce Framework. Experimental finding shows that the new algorithm improves the availability of the algorithm clustering results under the condition of ensuring individual privacy without significantly increasing its operating time.

摘要

本文基于差分隐私保护 K-means 聚类算法，通过向聚类中心点添加数据干扰拉普拉斯噪声来实现隐私保护。为了解决拉普拉斯噪声随机性导致中心点偏离的问题，特别是当由于隐私预算参数较小而导致聚类结果可用性较差时，本文提出了一种改进的差分隐私保护 K-means 聚类算法。改进的算法使用轮廓系数来定量评估每次迭代的聚类效果，并对不同的聚类添加不同的噪声。为了适应大量的数据，本文在 MapReduce 框架中提供了一种算法设计。实验结果表明，新算法在不显著增加运行时间的情况下，提高了算法聚类结果的可用性，同时确保了个体隐私。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8efa/6248925/b46cf5008eb7/pone.0206832.g001.jpg

相似文献

A differential privacy protecting K-means clustering algorithm based on contour coefficients.

PLoS One. 2018 Nov 21;13(11):e0206832. doi: 10.1371/journal.pone.0206832. eCollection 2018.

Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.

PLoS One. 2021 Mar 23;16(3):e0248737. doi: 10.1371/journal.pone.0248737. eCollection 2021.

An Efficient Differential Privacy-Based Method for Location Privacy Protection in Location-Based Services.

Sensors (Basel). 2023 May 31;23(11):5219. doi: 10.3390/s23115219.

The density-based clustering method for privacy-preserving data mining.

Math Biosci Eng. 2019 Feb 27;16(3):1718-1728. doi: 10.3934/mbe.2019082.

Solving text clustering problem using a memetic differential evolution algorithm.

PLoS One. 2020 Jun 11;15(6):e0232816. doi: 10.1371/journal.pone.0232816. eCollection 2020.

An improved fuzzy C-means clustering algorithm for assisted therapy of chronic bronchitis.

Technol Health Care. 2015;23(6):699-713. doi: 10.3233/THC-151023.

An incremental clustering method based on the boundary profile.

PLoS One. 2018 Apr 20;13(4):e0196108. doi: 10.1371/journal.pone.0196108. eCollection 2018.

A hybrid monkey search algorithm for clustering analysis.

ScientificWorldJournal. 2014 Mar 4;2014:938239. doi: 10.1155/2014/938239. eCollection 2014.

Research on Data Analysis of Traditional Chinese Medicine with Improved Differential Evolution Clustering Algorithm.

J Healthc Eng. 2021 Sep 4;2021:4468741. doi: 10.1155/2021/4468741. eCollection 2021.

A clustering-based differential privacy protection algorithm for weighted social networks.

Math Biosci Eng. 2024 Feb 18;21(3):3755-3773. doi: 10.3934/mbe.2024166.

引用本文的文献

Identification and validation of hub m7G-related genes and infiltrating immune cells in osteoarthritis based on integrated computational and bioinformatics analysis.

BMC Musculoskelet Disord. 2025 Apr 4;26(1):333. doi: 10.1186/s12891-025-08539-6.

Integrating single-cell sequencing and machine learning to uncover the role of mitophagy in subtyping and prognosis of esophageal cancer.

Apoptosis. 2025 Apr;30(3-4):1021-1041. doi: 10.1007/s10495-024-02061-1. Epub 2025 Feb 13.

Gluteus medius muscle activation patterns during gait with Cerebral Palsy (CP): A hierarchical clustering analysis.

PLoS One. 2025 Jan 9;20(1):e0309582. doi: 10.1371/journal.pone.0309582. eCollection 2025.

Integrated analysis of single-cell RNA-seq, bulk RNA-seq, Mendelian randomization, and eQTL reveals T cell-related nomogram model and subtype classification in rheumatoid arthritis.

Front Immunol. 2024 Jun 19;15:1399856. doi: 10.3389/fimmu.2024.1399856. eCollection 2024.

Significance of neutrophil extracellular traps-related gene in the diagnosis and classification of atherosclerosis.

Apoptosis. 2024 Jun;29(5-6):605-619. doi: 10.1007/s10495-023-01923-4. Epub 2024 Feb 17.

Identification of cuproptosis-related subtypes, characterization of immune microenvironment infiltration, and development of a prognosis model for osteoarthritis.

Front Immunol. 2023 Sep 21;14:1178794. doi: 10.3389/fimmu.2023.1178794. eCollection 2023.

2D nanomaterial sensing array using machine learning for differential profiling of pathogenic microbial taxonomic identification.

Mikrochim Acta. 2022 Jul 6;189(8):273. doi: 10.1007/s00604-022-05368-5.

Effect of Minimally Invasive Puncture Drainage and Conservative Treatment on Prognosis of Patients with Cerebral Hemorrhage.

J Healthc Eng. 2021 Dec 23;2021:2401256. doi: 10.1155/2021/2401256. eCollection 2021.

Differential privacy fuzzy C-means clustering algorithm based on gaussian kernel function.

PLoS One. 2021 Mar 23;16(3):e0248737. doi: 10.1371/journal.pone.0248737. eCollection 2021.

Unsupervised color image segmentation: A case of RGB histogram based K-means clustering initialization.

PLoS One. 2020 Oct 22;15(10):e0240015. doi: 10.1371/journal.pone.0240015. eCollection 2020.

本文引用的文献

Collaborative fuzzy clustering from multiple weighted views.

IEEE Trans Cybern. 2015 Apr;45(4):688-701. doi: 10.1109/TCYB.2014.2334595. Epub 2014 Jul 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于轮廓系数的差分隐私保护 K-均值聚类算法。

A differential privacy protecting K-means clustering algorithm based on contour coefficients.

机构信息

School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, ShaanXi, China.

School of Science, Xi'an University of Technology, Xi'an, ShaanXi, China.

出版信息

PLoS One. 2018 Nov 21;13(11):e0206832. doi: 10.1371/journal.pone.0206832. eCollection 2018.

DOI:10.1371/journal.pone.0206832

PMID:30462662

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6248925/

Abstract

摘要

基于轮廓系数的差分隐私保护 K-均值聚类算法。

A differential privacy protecting K-means clustering algorithm based on contour coefficients.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于轮廓系数的差分隐私保护 K-均值聚类算法。

A differential privacy protecting K-means clustering algorithm based on contour coefficients.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献