基于锚图的快速自监督聚类

Fast Self-Supervised Clustering With Anchor Graph.

作者信息

Wang Jingyu, Ma Zhenyu, Nie Feiping, Li Xuelong

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4199-4212. doi: 10.1109/TNNLS.2021.3056080. Epub 2022 Aug 31.

DOI:10.1109/TNNLS.2021.3056080

Abstract

Benefit from avoiding the utilization of labeled samples, which are usually insufficient in the real world, unsupervised learning has been regarded as a speedy and powerful strategy on clustering tasks. However, clustering directly from primal data sets leads to high computational cost, which limits its application on large-scale and high-dimensional problems. Recently, anchor-based theories are proposed to partly mitigate this problem and field naturally sparse affinity matrix, while it is still a challenge to get excellent performance along with high efficiency. To dispose of this issue, we first presented a fast semisupervised framework (FSSF) combined with a balanced K -means-based hierarchical K -means (BKHK) method and the bipartite graph theory. Thereafter, we proposed a fast self-supervised clustering method involved in this crucial semisupervised framework, in which all labels are inferred from a constructed bipartite graph with exactly k connected components. The proposed method remarkably accelerates the general semisupervised learning through the anchor and consists of four significant parts: 1) obtaining the anchor set as interim through BKHK algorithm; 2) constructing the bipartite graph; 3) solving the self-supervised problem to construct a typical probability model with FSSF; and 4) selecting the most representative points regarding anchors from BKHK as an interim and conducting label propagation. The experimental results on toy examples and benchmark data sets have demonstrated that the proposed method outperforms other approaches.

摘要

得益于避免使用标记样本（在现实世界中标记样本通常是不足的），无监督学习被视为聚类任务中一种快速且强大的策略。然而，直接从原始数据集进行聚类会导致计算成本高昂，这限制了其在大规模和高维问题上的应用。最近，基于锚点的理论被提出以部分缓解此问题并生成自然稀疏的亲和矩阵，然而要在高效的同时获得优异性能仍然是一个挑战。为了解决这个问题，我们首先提出了一个快速半监督框架（FSSF），它结合了基于平衡K均值的层次K均值（BKHK）方法和二分图理论。此后，我们在这个关键的半监督框架中提出了一种快速自监督聚类方法，其中所有标签都从具有恰好k个连通分量的构造二分图中推断出来。所提出的方法通过锚点显著加速了一般的半监督学习，并且由四个重要部分组成：1）通过BKHK算法获得锚点集作为中间结果；2）构建二分图；3）通过FSSF解决自监督问题以构建典型概率模型；4）从BKHK中选择关于锚点的最具代表性的点作为中间结果并进行标签传播。在玩具示例和基准数据集上的实验结果表明，所提出的方法优于其他方法。

相似文献

Fast Self-Supervised Clustering With Anchor Graph.基于锚图的快速自监督聚类

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4199-4212. doi: 10.1109/TNNLS.2021.3056080. Epub 2022 Aug 31.

Progressive Self-Supervised Clustering With Novel Category Discovery.渐进式自监督聚类与新类别发现。

IEEE Trans Cybern. 2022 Oct;52(10):10393-10406. doi: 10.1109/TCYB.2021.3069836. Epub 2022 Sep 19.

Fast Semisupervised Learning With Bipartite Graph for Large-Scale Data.

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):626-638. doi: 10.1109/TNNLS.2019.2908504. Epub 2019 May 20.

Learning the consensus and complementary information for large-scale multi-view clustering.学习大规模多视图聚类的共识和互补信息。

Neural Netw. 2024 Apr;172:106103. doi: 10.1016/j.neunet.2024.106103. Epub 2024 Jan 5.

Large-Scale Clustering With Structured Optimal Bipartite Graph.基于结构化最优二分图的大规模聚类

IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9950-9963. doi: 10.1109/TPAMI.2023.3277532. Epub 2023 Jun 30.

Fast Clustering by Directly Solving Bipartite Graph Clustering Problem.通过直接求解二分图聚类问题实现快速聚类

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9174-9185. doi: 10.1109/TNNLS.2022.3219131. Epub 2024 Jul 8.

Joint Sparse Representation and Embedding Propagation Learning: A Framework for Graph-Based Semisupervised Learning.联合稀疏表示和嵌入传播学习：基于图的半监督学习框架。

IEEE Trans Neural Netw Learn Syst. 2017 Dec;28(12):2949-2960. doi: 10.1109/TNNLS.2016.2609434. Epub 2016 Sep 28.

Sparse Low-Rank Multi-View Subspace Clustering With Consensus Anchors and Unified Bipartite Graph.具有一致性锚点和统一二分图的稀疏低秩多视图子空间聚类

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1438-1452. doi: 10.1109/TNNLS.2023.3332335. Epub 2025 Jan 7.

Joint Structured Bipartite Graph and Row-Sparse Projection for Large-Scale Feature Selection.用于大规模特征选择的联合结构化二分图和行稀疏投影

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6911-6924. doi: 10.1109/TNNLS.2024.3389029. Epub 2025 Apr 4.

Co-Clustering by Directly Solving Bipartite Spectral Graph Partitioning.通过直接求解二分谱图划分进行协同聚类

IEEE Trans Cybern. 2024 Dec;54(12):7590-7601. doi: 10.1109/TCYB.2024.3451292. Epub 2024 Nov 27.

引用本文的文献

scAGCI: an anchor graph-based method for cell clustering from integrated scRNA-seq and scATAC-seq data.scAGCI：一种基于锚定图的方法，用于从整合的单细胞RNA测序和单细胞染色质可及性测序数据中进行细胞聚类。

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf244.

Robust self supervised symmetric nonnegative matrix factorization to the graph clustering.用于图聚类的鲁棒自监督对称非负矩阵分解

Sci Rep. 2025 Mar 1;15(1):7350. doi: 10.1038/s41598-025-92564-x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于锚图的快速自监督聚类

Fast Self-Supervised Clustering With Anchor Graph.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献