• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于相邻网格搜索的新型聚类方法。

A Novel Clustering Method Based on Adjacent Grids Searching.

作者信息

Li Zhimeng, Zhong Wen, Liao Weiwen, Zhao Jian, Yu Ming, He Gaiyun

机构信息

School of Control and Mechanical Engineering, Tianjin Chengjian University, Tianjin 300384, China.

School of Computer and Information Engineering, Tianjin Chengjian University, Tianjin 300384, China.

出版信息

Entropy (Basel). 2023 Sep 15;25(9):1342. doi: 10.3390/e25091342.

DOI:10.3390/e25091342
PMID:37761640
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10528124/
Abstract

Clustering is used to analyze the intrinsic structure of a dataset based on the similarity of datapoints. Its widespread use, from image segmentation to object recognition and information retrieval, requires great robustness in the clustering process. In this paper, a novel clustering method based on adjacent grid searching (CAGS) is proposed. The CAGS consists of two steps: a strategy based on adaptive grid-space construction and a clustering strategy based on adjacent grid searching. In the first step, a multidimensional grid space is constructed to provide a quantization structure of the input dataset. The noise and cluster halo are automatically distinguished according to grid density. Moreover, the adaptive grid generating process solves the common problem of grid clustering, in which the number of cells increases sharply with the dimension. In the second step, a two-stage traversal process is conducted to accomplish the cluster recognition. The cluster cores with arbitrary shapes can be found by concealing the halo points. As a result, the number of clusters will be easily identified by CAGS. Therefore, CAGS has the potential to be widely used for clustering datasets with different characteristics. We test the clustering performance of CAGS through six different types of datasets: dataset with noise, large-scale dataset, high-dimensional dataset, dataset with arbitrary shapes, dataset with large differences in density between classes, and dataset with high overlap between classes. Experimental results show that CAGS, which performed best on 10 out of 11 tests, outperforms the state-of-the-art clustering methods in all the above datasets.

摘要

聚类用于基于数据点的相似性来分析数据集的内在结构。它在从图像分割到目标识别和信息检索等广泛领域的应用,要求在聚类过程中具有很强的鲁棒性。本文提出了一种基于相邻网格搜索的新型聚类方法(CAGS)。CAGS由两个步骤组成:基于自适应网格空间构建的策略和基于相邻网格搜索的聚类策略。在第一步中,构建一个多维网格空间以提供输入数据集的量化结构。根据网格密度自动区分噪声和聚类光晕。此外,自适应网格生成过程解决了网格聚类的常见问题,即单元格数量会随着维度急剧增加。在第二步中,进行两阶段遍历过程以完成聚类识别。通过隐藏光晕点可以找到任意形状的聚类核心。结果,CAGS能够轻松识别聚类数量。因此,CAGS有潜力广泛应用于对具有不同特征的数据集进行聚类。我们通过六种不同类型的数据集测试了CAGS的聚类性能:含噪声数据集、大规模数据集、高维数据集、任意形状数据集、类间密度差异大的数据集以及类间重叠度高的数据集。实验结果表明,CAGS在11次测试中的10次表现最佳,在上述所有数据集中均优于当前最先进的聚类方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/e0f1c8dfacd0/entropy-25-01342-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/59ea9db3a660/entropy-25-01342-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/ffa7c90a91e4/entropy-25-01342-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/e47866f2f9af/entropy-25-01342-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/e0f1c8dfacd0/entropy-25-01342-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/59ea9db3a660/entropy-25-01342-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/ffa7c90a91e4/entropy-25-01342-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/e47866f2f9af/entropy-25-01342-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c77/10528124/e0f1c8dfacd0/entropy-25-01342-g007.jpg

相似文献

1
A Novel Clustering Method Based on Adjacent Grids Searching.一种基于相邻网格搜索的新型聚类方法。
Entropy (Basel). 2023 Sep 15;25(9):1342. doi: 10.3390/e25091342.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Superpixel Segmentation Based on Grid Point Density Peak Clustering.基于网格点密度峰值聚类的超像素分割
Sensors (Basel). 2021 Sep 24;21(19):6374. doi: 10.3390/s21196374.
4
Case-based fracture image retrieval.基于案例的骨折图像检索。
Int J Comput Assist Radiol Surg. 2012 May;7(3):401-11. doi: 10.1007/s11548-011-0643-8. Epub 2011 Jul 29.
5
Robust Discriminant Subspace Clustering With Adaptive Local Structure Embedding.具有自适应局部结构嵌入的鲁棒判别子空间聚类。
IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2466-2479. doi: 10.1109/TNNLS.2021.3106702. Epub 2023 May 2.
6
A multiple kernel density clustering algorithm for incomplete datasets in bioinformatics.一种用于生物信息学中不完整数据集的多核密度聚类算法。
BMC Syst Biol. 2018 Nov 22;12(Suppl 6):111. doi: 10.1186/s12918-018-0630-6.
7
Vicinal support vector classifier using supervised kernel-based clustering.基于监督核聚类的邻接支持向量分类器。
Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.
8
Grid-Based Clustering Using Boundary Detection.基于网格的边界检测聚类
Entropy (Basel). 2022 Nov 4;24(11):1606. doi: 10.3390/e24111606.
9
Performance Analysis and Architecture of a Clustering Hybrid Algorithm Called FA+GA-DBSCAN Using Artificial Datasets.使用人工数据集的名为FA+GA-DBSCAN的聚类混合算法的性能分析与架构
Entropy (Basel). 2022 Jun 25;24(7):875. doi: 10.3390/e24070875.
10
Information Clustering Using Manifold-Based Optimization of the Bag-of-Features Representation.基于流形优化特征袋表示的信息聚类。
IEEE Trans Cybern. 2018 Jan;48(1):52-63. doi: 10.1109/TCYB.2016.2623581. Epub 2016 Nov 10.

本文引用的文献

1
Grid-Based Clustering Using Boundary Detection.基于网格的边界检测聚类
Entropy (Basel). 2022 Nov 4;24(11):1606. doi: 10.3390/e24111606.
2
A Neighborhood Grid Clustering Algorithm for Solving Localization Problem in WSN Using Genetic Algorithm.基于遗传算法的无线传感器网络定位问题的邻域网格聚类算法。
Comput Intell Neurosci. 2022 Jun 28;2022:8552142. doi: 10.1155/2022/8552142. eCollection 2022.
3
Deep Non-Negative Matrix Factorization Architecture Based on Underlying Basis Images Learning.基于基础图像学习的深度非负矩阵分解架构
IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):1897-1913. doi: 10.1109/TPAMI.2019.2962679. Epub 2021 May 11.
4
Machine learning. Clustering by fast search and find of density peaks.机器学习。基于密度峰值的快速搜索和发现的聚类。
Science. 2014 Jun 27;344(6191):1492-6. doi: 10.1126/science.1242072.
5
FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data.FLAME,一种用于分析DNA微阵列数据的新型模糊聚类方法。
BMC Bioinformatics. 2007 Jan 4;8:3. doi: 10.1186/1471-2105-8-3.