基于 O(n) 二分图卷积的非图数据聚类。

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8729-8742. doi: 10.1109/TPAMI.2022.3231470. Epub 2023 Jun 5.

Since the representative capacity of graph-based clustering methods is usually limited by the graph constructed on the original features, it is attractive to find whether graph neural networks (GNNs), a strong extension of neural networks to graphs, can be applied to augment the capacity of graph-based clustering methods. The core problems mainly come from two aspects. On the one hand, the graph is unavailable in the most general clustering scenes so that how to construct graph on the non-graph data and the quality of graph is usually the most important part. On the other hand, given n samples, the graph-based clustering methods usually consume at least O(n) time to build graphs and the graph convolution requires nearly O(n) for a dense graph and O(|E|) for a sparse one with |E| edges. Accordingly, both graph-based clustering and GNNs suffer from the severe inefficiency problem. To tackle these problems, we propose a novel clustering method, AnchorGAE, with the self-supervised estimation of graph and efficient graph convolution. We first show how to convert a non-graph dataset into a graph dataset, by introducing the generative graph model and anchors. A bipartite graph is built via generating anchors and estimating the connectivity distributions of original points and anchors. We then show that the constructed bipartite graph can reduce the computational complexity of graph convolution from O(n) and O(|E|) to O(n). The succeeding steps for clustering can be easily designed as O(n) operations. Interestingly, the anchors naturally lead to siamese architecture with the help of the Markov process. Furthermore, the estimated bipartite graph is updated dynamically according to the features extracted by GNN modules, to promote the quality of the graph by exploiting the high-level information by GNNs. However, we theoretically prove that the self-supervised paradigm frequently results in a collapse that often occurs after 2-3 update iterations in experiments, especially when the model is well-trained. A specific strategy is accordingly designed to prevent the collapse. The experiments support the theoretical analysis and show the superiority of AnchorGAE.

由于基于图的聚类方法的表示能力通常受到原始特征上构建的图的限制，因此寻找图神经网络（GNN）是否可以应用于增强基于图的聚类方法的能力是很有吸引力的。核心问题主要来自两个方面。一方面，在最一般的聚类场景中，图是不可用的，因此如何在非图数据上构建图以及图的质量通常是最重要的部分。另一方面，对于 n 个样本，基于图的聚类方法通常至少需要 O(n)时间来构建图，而图卷积对于密集图需要近 O(n)时间，对于稀疏图（边数为|E|）需要 O(|E|)时间。因此，基于图的聚类和 GNN 都存在严重的效率问题。为了解决这些问题，我们提出了一种新的聚类方法 AnchorGAE，它具有图的自监督估计和高效图卷积。我们首先展示如何通过引入生成图模型和锚点将非图数据集转换为图数据集。通过生成锚点并估计原始点和锚点的连通性分布，构建了一个二分图。然后，我们展示了所构建的二分图可以将图卷积的计算复杂度从 O(n)和 O(|E|)降低到 O(n)。随后的聚类步骤可以很容易地设计为 O(n)操作。有趣的是，锚点在马尔可夫过程的帮助下自然地导致了孪生结构。此外，根据 GNN 模块提取的特征，动态更新估计的二分图，通过利用 GNN 的高级信息来提高图的质量。然而，我们从理论上证明了自监督范式经常导致崩溃，尤其是在模型训练良好的情况下，在实验中经常在 2-3 次更新迭代后发生崩溃。因此，设计了一种特定的策略来防止崩溃。实验支持了理论分析，并展示了 AnchorGAE 的优越性。

相似文献

Non-Graph Data Clustering via O(n) Bipartite Graph Convolution.基于 O(n) 二分图卷积的非图数据聚类。

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8729-8742. doi: 10.1109/TPAMI.2022.3231470. Epub 2023 Jun 5.

Adaptive Graph Auto-Encoder for General Data Clustering.用于通用数据聚类的自适应图自动编码器

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9725-9732. doi: 10.1109/TPAMI.2021.3125687. Epub 2022 Nov 7.

Anchor Graph Network for Incomplete Multiview Clustering.用于不完全多视图聚类的锚图网络

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3708-3719. doi: 10.1109/TNNLS.2024.3349405. Epub 2025 Feb 6.

Sparse Low-Rank Multi-View Subspace Clustering With Consensus Anchors and Unified Bipartite Graph.具有一致性锚点和统一二分图的稀疏低秩多视图子空间聚类

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1438-1452. doi: 10.1109/TNNLS.2023.3332335. Epub 2025 Jan 7.

Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.多物理图神经网络（MP-GNN）在 COVID-19 药物设计中的应用。

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac231.

Fast Self-Supervised Clustering With Anchor Graph.基于锚图的快速自监督聚类

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4199-4212. doi: 10.1109/TNNLS.2021.3056080. Epub 2022 Aug 31.

Co-Clustering on Bipartite Graphs for Robust Model Fitting.用于稳健模型拟合的二分图上的协同聚类

IEEE Trans Image Process. 2022;31:6605-6620. doi: 10.1109/TIP.2022.3214073. Epub 2022 Oct 26.

Kemeny Constant-Based Optimization of Network Clustering Using Graph Neural Networks.基于凯梅尼常数的图神经网络网络聚类优化

J Phys Chem B. 2024 Aug 29;128(34):8103-8115. doi: 10.1021/acs.jpcb.3c08213. Epub 2024 Aug 15.

Fast Haar Transforms for Graph Neural Networks.快速 Haar 变换用于图神经网络。

Neural Netw. 2020 Aug;128:188-198. doi: 10.1016/j.neunet.2020.04.028. Epub 2020 May 4.

Efficient Multi-View Clustering via Unified and Discrete Bipartite Graph Learning.通过统一和离散二分图学习实现高效多视图聚类

IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):11436-11447. doi: 10.1109/TNNLS.2023.3261460. Epub 2024 Aug 5.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Non-Graph Data Clustering via O(n) Bipartite Graph Convolution.基于 O(n) 二分图卷积的非图数据聚类。

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8729-8742. doi: 10.1109/TPAMI.2022.3231470. Epub 2023 Jun 5.

Adaptive Graph Auto-Encoder for General Data Clustering.用于通用数据聚类的自适应图自动编码器

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9725-9732. doi: 10.1109/TPAMI.2021.3125687. Epub 2022 Nov 7.

Anchor Graph Network for Incomplete Multiview Clustering.用于不完全多视图聚类的锚图网络

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3708-3719. doi: 10.1109/TNNLS.2024.3349405. Epub 2025 Feb 6.

Sparse Low-Rank Multi-View Subspace Clustering With Consensus Anchors and Unified Bipartite Graph.具有一致性锚点和统一二分图的稀疏低秩多视图子空间聚类

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1438-1452. doi: 10.1109/TNNLS.2023.3332335. Epub 2025 Jan 7.

Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.多物理图神经网络（MP-GNN）在 COVID-19 药物设计中的应用。

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac231.

Fast Self-Supervised Clustering With Anchor Graph.基于锚图的快速自监督聚类

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4199-4212. doi: 10.1109/TNNLS.2021.3056080. Epub 2022 Aug 31.

Co-Clustering on Bipartite Graphs for Robust Model Fitting.用于稳健模型拟合的二分图上的协同聚类

IEEE Trans Image Process. 2022;31:6605-6620. doi: 10.1109/TIP.2022.3214073. Epub 2022 Oct 26.

Kemeny Constant-Based Optimization of Network Clustering Using Graph Neural Networks.基于凯梅尼常数的图神经网络网络聚类优化

J Phys Chem B. 2024 Aug 29;128(34):8103-8115. doi: 10.1021/acs.jpcb.3c08213. Epub 2024 Aug 15.

Fast Haar Transforms for Graph Neural Networks.快速 Haar 变换用于图神经网络。

Neural Netw. 2020 Aug;128:188-198. doi: 10.1016/j.neunet.2020.04.028. Epub 2020 May 4.

Efficient Multi-View Clustering via Unified and Discrete Bipartite Graph Learning.通过统一和离散二分图学习实现高效多视图聚类

IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):11436-11447. doi: 10.1109/TNNLS.2023.3261460. Epub 2024 Aug 5.

Non-Graph Data Clustering via O(n) Bipartite Graph Convolution.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献