• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于链接的聚类集成问题方法。

A Link-Based Approach to the Cluster Ensemble Problem.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2396-409. doi: 10.1109/TPAMI.2011.84. Epub 2011 May 12.

DOI:10.1109/TPAMI.2011.84
PMID:21576752
Abstract

Cluster ensembles have recently emerged as a powerful alternative to standard cluster analysis, aggregating several input data clusterings to generate a single output clustering, with improved robustness and stability. From the early work, these techniques held great promise; however, most of them generate the final solution based on incomplete information of a cluster ensemble. The underlying ensemble-information matrix reflects only cluster-data point relations, while those among clusters are generally overlooked. This paper presents a new link-based approach to improve the conventional matrix. It achieves this using the similarity between clusters that are estimated from a link network model of the ensemble. In particular, three new link-based algorithms are proposed for the underlying similarity assessment. The final clustering result is generated from the refined matrix using two different consensus functions of feature-based and graph-based partitioning. This approach is the first to address and explicitly employ the relationship between input partitions, which has not been emphasized by recent studies of matrix refinement. The effectiveness of the link-based approach is empirically demonstrated over 10 data sets (synthetic and real) and three benchmark evaluation measures. The results suggest the new approach is able to efficiently extract information embedded in the input clusterings, and regularly illustrate higher clustering quality in comparison to several state-of-the-art techniques.

摘要

聚类集成最近作为一种强大的替代标准聚类分析的方法出现,它聚合了几个输入数据聚类,以生成单个输出聚类,从而提高了鲁棒性和稳定性。从早期的工作来看,这些技术具有很大的潜力;然而,它们中的大多数都是基于聚类集成的不完整信息生成最终解决方案的。基础的集成信息矩阵仅反映了聚类-数据点关系,而聚类之间的关系通常被忽略。本文提出了一种新的基于链接的方法来改进传统的矩阵。它通过使用从集合的链接网络模型估计的聚类之间的相似性来实现这一点。特别是,提出了三种新的基于链接的算法来进行基础相似性评估。最终的聚类结果是从使用基于特征和基于图的分区的两种不同共识函数的改进矩阵中生成的。这种方法是第一个解决并明确利用输入分区之间关系的方法,这在最近的矩阵细化研究中没有得到强调。基于链接的方法在 10 个数据集(合成和真实)和三个基准评估指标上的实证结果表明了其有效性。结果表明,新方法能够有效地提取输入聚类中嵌入的信息,并经常显示出比几种最先进的技术更高的聚类质量。

相似文献

1
A Link-Based Approach to the Cluster Ensemble Problem.基于链接的聚类集成问题方法。
IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2396-409. doi: 10.1109/TPAMI.2011.84. Epub 2011 May 12.
2
LCE: a link-based cluster ensemble method for improved gene expression data analysis.LCE:一种基于链接的聚类集成方法,用于改进基因表达数据分析。
Bioinformatics. 2010 Jun 15;26(12):1513-9. doi: 10.1093/bioinformatics/btq226. Epub 2010 May 5.
3
Knowledge based cluster ensemble for cancer discovery from biomolecular data.基于知识的聚类集成在生物分子数据中的癌症发现。
IEEE Trans Nanobioscience. 2011 Jun;10(2):76-85. doi: 10.1109/TNB.2011.2144997. Epub 2011 Jul 7.
4
Locally Weighted Ensemble Clustering.局部加权集成聚类。
IEEE Trans Cybern. 2018 May;48(5):1460-1473. doi: 10.1109/TCYB.2017.2702343. Epub 2017 May 23.
5
Clustering ensembles: models of consensus and weak partitions.聚类集成:共识模型与弱划分
IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1866-81. doi: 10.1109/TPAMI.2005.237.
6
Fuzzy ensemble clustering based on random projections for DNA microarray data analysis.基于随机投影的模糊集成聚类用于DNA微阵列数据分析
Artif Intell Med. 2009 Feb-Mar;45(2-3):173-83. doi: 10.1016/j.artmed.2008.07.014. Epub 2008 Sep 17.
7
Combining multiple clusterings using evidence accumulation.使用证据积累合并多个聚类。
IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):835-50. doi: 10.1109/TPAMI.2005.113.
8
Cumulative voting consensus method for partitions with variable number of clusters.具有可变聚类数的分区的累积投票共识方法。
IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):160-73. doi: 10.1109/TPAMI.2007.1138.
9
Evaluation of stability of k-means cluster ensembles with respect to random initialization.关于随机初始化的k均值聚类集成稳定性评估。
IEEE Trans Pattern Anal Mach Intell. 2006 Nov;28(11):1798-808. doi: 10.1109/TPAMI.2006.226.
10
A Validity Index for Prototype-Based Clustering of Data Sets With Complex Cluster Structures.一种用于具有复杂聚类结构数据集的基于原型聚类的有效性指标。
IEEE Trans Syst Man Cybern B Cybern. 2011 Aug;41(4):1039-53. doi: 10.1109/TSMCB.2010.2104319. Epub 2011 Feb 4.

引用本文的文献

1
Dual-level clustering ensemble algorithm with three consensus strategies.具有三种一致性策略的双层聚类集成算法
Sci Rep. 2023 Dec 18;13(1):22617. doi: 10.1038/s41598-023-49947-9.
2
Projecting genetic associations through gene expression patterns highlights disease etiology and drug mechanisms.通过基因表达模式预测遗传关联,突出了疾病的病因和药物机制。
Nat Commun. 2023 Sep 9;14(1):5562. doi: 10.1038/s41467-023-41057-4.