• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向实际大规模复杂网络的多级分层核谱聚类

Multilevel hierarchical kernel spectral clustering for real-life large scale complex networks.

作者信息

Mall Raghvendra, Langone Rocco, Suykens Johan A K

机构信息

ESAT-STADIUS, KU Leuven, Leuven, Belgium.

出版信息

PLoS One. 2014 Jun 20;9(6):e99966. doi: 10.1371/journal.pone.0099966. eCollection 2014.

DOI:10.1371/journal.pone.0099966
PMID:24949877
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4065034/
Abstract

Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks.

摘要

核谱聚类对应于约束优化框架中的加权核主成分分析问题。原始公式在对偶层面导致中心拉普拉斯矩阵的特征分解。对偶公式允许在训练阶段基于大规模网络的代表性子图构建模型,并在验证阶段估计模型参数。KSC模型具有强大的样本外扩展属性,可对大数据网络中未见过的节点进行聚类归属。在本文中,我们在验证阶段利用特征空间中投影的结构来自动确定一组递增的距离阈值。我们在测试阶段使用这些距离阈值为大规模网络获得多个层次级别。网络中的层次结构以自底向上的方式确定。我们通过实验表明,现实世界的网络具有多级层次组织,而诸如Louvain、OSLOM和Infomap方法等几种先进的大规模层次社区检测技术无法有效地检测到这种组织。我们表明,我们提出的方法的一个主要优点是能够使用7个真实网络上的内部聚类质量指标在层次结构的精细和粗糙级别上定位高质量的聚类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/f102d0436dc1/pone.0099966.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/d3acd3db1a14/pone.0099966.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/780b146b7bc3/pone.0099966.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/909c248684ae/pone.0099966.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/e5d3fbb79a7b/pone.0099966.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/893d03c4e47d/pone.0099966.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/27601bcf9f94/pone.0099966.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/9121365d8c64/pone.0099966.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/e968ea231a77/pone.0099966.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/f1e01ceadc11/pone.0099966.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/d2355a243c7f/pone.0099966.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/42fcec562140/pone.0099966.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/f102d0436dc1/pone.0099966.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/d3acd3db1a14/pone.0099966.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/780b146b7bc3/pone.0099966.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/909c248684ae/pone.0099966.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/e5d3fbb79a7b/pone.0099966.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/893d03c4e47d/pone.0099966.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/27601bcf9f94/pone.0099966.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/9121365d8c64/pone.0099966.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/e968ea231a77/pone.0099966.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/f1e01ceadc11/pone.0099966.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/d2355a243c7f/pone.0099966.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/42fcec562140/pone.0099966.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9e0/4065034/f102d0436dc1/pone.0099966.g012.jpg

相似文献

1
Multilevel hierarchical kernel spectral clustering for real-life large scale complex networks.面向实际大规模复杂网络的多级分层核谱聚类
PLoS One. 2014 Jun 20;9(6):e99966. doi: 10.1371/journal.pone.0099966. eCollection 2014.
2
Hierarchical kernel spectral clustering.层次核谱聚类。
Neural Netw. 2012 Nov;35:21-30. doi: 10.1016/j.neunet.2012.06.007. Epub 2012 Jul 16.
3
Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA.基于加权核主成分分析的离样本扩展的多路谱聚类。
IEEE Trans Pattern Anal Mach Intell. 2010 Feb;32(2):335-47. doi: 10.1109/TPAMI.2008.292.
4
Analysis of Network Clustering Algorithms and Cluster Quality Metrics at Scale.大规模网络聚类算法与聚类质量指标分析
PLoS One. 2016 Jul 8;11(7):e0159161. doi: 10.1371/journal.pone.0159161. eCollection 2016.
5
Hierarchical organization in complex networks.复杂网络中的层次组织。
Phys Rev E Stat Nonlin Soft Matter Phys. 2003 Feb;67(2 Pt 2):026112. doi: 10.1103/PhysRevE.67.026112. Epub 2003 Feb 14.
6
Resolving the structure of interactomes with hierarchical agglomerative clustering.利用层次凝聚聚类解析互作组学结构。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2105-12-S1-S44.
7
Weighted graph cuts without eigenvectors a multilevel approach.无需特征向量的加权图割:一种多级方法。
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1944-57. doi: 10.1109/TPAMI.2007.1115.
8
Finding statistically significant communities in networks.在网络中发现具有统计学意义的社区。
PLoS One. 2011 Apr 29;6(4):e18961. doi: 10.1371/journal.pone.0018961.
9
A community detection algorithm based on topology potential and spectral clustering.一种基于拓扑势和谱聚类的社区检测算法。
ScientificWorldJournal. 2014;2014:329325. doi: 10.1155/2014/329325. Epub 2014 Jul 22.
10
Spectral Clustering Community Detection Algorithm Based on Point-Wise Mutual Information Graph Kernel.基于逐点互信息图核的谱聚类社区检测算法
Entropy (Basel). 2023 Dec 3;25(12):1617. doi: 10.3390/e25121617.

引用本文的文献

1
VISH-Pred: an ensemble of fine-tuned ESM models for protein toxicity prediction.VISH-Pred:一种用于蛋白质毒性预测的微调 ESM 模型的集成。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae270.
2
Pancancer transcriptomic profiling identifies key PANoptosis markers as therapeutic targets for oncology.泛癌转录组分析确定关键的PAN凋亡标志物作为肿瘤学的治疗靶点。
NAR Cancer. 2022 Nov 1;4(4):zcac033. doi: 10.1093/narcan/zcac033. eCollection 2022 Dec.
3
Detection of statistically significant network changes in complex biological networks.

本文引用的文献

1
Hierarchical kernel spectral clustering.层次核谱聚类。
Neural Netw. 2012 Nov;35:21-30. doi: 10.1016/j.neunet.2012.06.007. Epub 2012 Jul 16.
2
Limits of modularity maximization in community detection.社区检测中模块化最大化的局限性。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Dec;84(6 Pt 2):066122. doi: 10.1103/PhysRevE.84.066122. Epub 2011 Dec 27.
3
Finding statistically significant communities in networks.在网络中发现具有统计学意义的社区。
复杂生物网络中具有统计学意义的网络变化检测。
BMC Syst Biol. 2017 Mar 4;11(1):32. doi: 10.1186/s12918-017-0412-6.
4
Single Molecule Cluster Analysis dissects splicing pathway conformational dynamics.单分子簇分析剖析剪接途径的构象动力学。
Nat Methods. 2015 Nov;12(11):1077-84. doi: 10.1038/nmeth.3602. Epub 2015 Sep 28.
5
Netgram: Visualizing Communities in Evolving Networks.Netgram:可视化动态网络中的社区
PLoS One. 2015 Sep 10;10(9):e0137502. doi: 10.1371/journal.pone.0137502. eCollection 2015.
PLoS One. 2011 Apr 29;6(4):e18961. doi: 10.1371/journal.pone.0018961.
4
Performance of modularity maximization in practical contexts.模块化最大化在实际环境中的性能。
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Apr;81(4 Pt 2):046106. doi: 10.1103/PhysRevE.81.046106. Epub 2010 Apr 15.
5
Community detection algorithms: a comparative analysis.社区检测算法:一项比较分析。
Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Nov;80(5 Pt 2):056117. doi: 10.1103/PhysRevE.80.056117. Epub 2009 Nov 30.
6
Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA.基于加权核主成分分析的离样本扩展的多路谱聚类。
IEEE Trans Pattern Anal Mach Intell. 2010 Feb;32(2):335-47. doi: 10.1109/TPAMI.2008.292.
7
Maps of random walks on complex networks reveal community structure.复杂网络上随机游走的图谱揭示了群落结构。
Proc Natl Acad Sci U S A. 2008 Jan 29;105(4):1118-23. doi: 10.1073/pnas.0706851105. Epub 2008 Jan 23.
8
Weighted graph cuts without eigenvectors a multilevel approach.无需特征向量的加权图割:一种多级方法。
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1944-57. doi: 10.1109/TPAMI.2007.1115.
9
Resolution limit in community detection.社区检测中的分辨率极限。
Proc Natl Acad Sci U S A. 2007 Jan 2;104(1):36-41. doi: 10.1073/pnas.0605965104. Epub 2006 Dec 26.
10
Finding community structure in very large networks.在超大型网络中寻找社区结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Dec;70(6 Pt 2):066111. doi: 10.1103/PhysRevE.70.066111. Epub 2004 Dec 6.