自适应双权值法在基于 HMM 的混合元聚类集成自动初始化和模型选择中的应用。

Adaptive Bi-Weighting Toward Automatic Initialization and Model Selection for HMM-Based Hybrid Meta-Clustering Ensembles.

出版信息

IEEE Trans Cybern. 2019 May;49(5):1657-1668. doi: 10.1109/TCYB.2018.2809562. Epub 2018 Mar 27.

DOI:10.1109/TCYB.2018.2809562

Abstract

Temporal data clustering can provide underpinning techniques for the discovery of intrinsic structures, which proved important in condensing or summarizing information demanded in various fields of information sciences, ranging from time series analysis to sequential data understanding. In this paper, we propose a novel hidden Markov model (HMM)-based hybrid meta-clustering ensemble with bi-weighting scheme to solve the problems of initialization and model selection associated with temporal data clustering. To improve the performance of the ensemble techniques, the proposed bi-weighting scheme adaptively examines the partition process and hence optimizes the fusion of consensus functions. Specifically, three consensus functions are used to combine the input partitions, generated by HMM-based K -models under different initializations, into a robust consensus partition. An optimal consensus partition is then selected from the three candidates by a normalized mutual information-based objective function. Finally, the optimal consensus partition is further refined by the HMM-based agglomerative clustering algorithm in association with dendrogram-based similarity partitioning algorithm, leading to the advantage that the number of clusters can be automatically and adaptively determined. Extensive experiments on synthetic data, time series, and real-world motion trajectory datasets illustrate that our proposed approach outperforms all the selected benchmarks and hence providing promising potentials for developing improved clustering tools for information analysis and management.

摘要

时间数据聚类可以为发现内在结构提供基础技术，这在信息科学的各个领域中都很重要，从时间序列分析到顺序数据理解，这些内在结构可以用来压缩或总结信息。在本文中，我们提出了一种新的基于隐马尔可夫模型（HMM）的混合元聚类集成，具有双加权方案，以解决与时间数据聚类相关的初始化和模型选择问题。为了提高集成技术的性能，所提出的双加权方案自适应地检查分区过程，从而优化共识函数的融合。具体来说，使用三个共识函数将由基于 HMM 的 K-模型在不同初始化下生成的输入分区合并为一个稳健的共识分区。然后，通过基于归一化互信息的目标函数从三个候选者中选择最佳共识分区。最后，通过与基于树状图的相似性分区算法相关联的基于 HMM 的凝聚聚类算法进一步细化最佳共识分区，从而可以自动且自适应地确定聚类的数量。在合成数据、时间序列和真实运动轨迹数据集上的广泛实验表明，我们提出的方法优于所有选定的基准，因此为开发用于信息分析和管理的改进聚类工具提供了有希望的潜力。

相似文献

Adaptive Bi-Weighting Toward Automatic Initialization and Model Selection for HMM-Based Hybrid Meta-Clustering Ensembles.自适应双权值法在基于 HMM 的混合元聚类集成自动初始化和模型选择中的应用。

IEEE Trans Cybern. 2019 May;49(5):1657-1668. doi: 10.1109/TCYB.2018.2809562. Epub 2018 Mar 27.

Hybrid Sampling-Based Clustering Ensemble With Global and Local Constitutions.基于混合采样的聚类集成算法，具有全局和局部结构。

IEEE Trans Neural Netw Learn Syst. 2016 May;27(5):952-65. doi: 10.1109/TNNLS.2015.2430821.

Creating Discriminative Models for Time Series Classification and Clustering by HMM Ensembles.基于 HMM 集成的时间序列分类和聚类判别模型的构建。

IEEE Trans Cybern. 2016 Dec;46(12):2899-2910. doi: 10.1109/TCYB.2015.2492920. Epub 2015 Oct 30.

Cumulative voting consensus method for partitions with variable number of clusters.具有可变聚类数的分区的累积投票共识方法。

IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):160-73. doi: 10.1109/TPAMI.2007.1138.

Clustering Hidden Markov Models With Variational Bayesian Hierarchical EM.变分贝叶斯层次 EM 聚类隐马尔可夫模型。

IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1537-1551. doi: 10.1109/TNNLS.2021.3105570. Epub 2023 Feb 28.

A Link-Based Approach to the Cluster Ensemble Problem.基于链接的聚类集成问题方法。

IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2396-409. doi: 10.1109/TPAMI.2011.84. Epub 2011 May 12.

Locally Weighted Ensemble Clustering.局部加权集成聚类。

IEEE Trans Cybern. 2018 May;48(5):1460-1473. doi: 10.1109/TCYB.2017.2702343. Epub 2017 May 23.

Adaptive Fuzzy Consensus Clustering Framework for Clustering Analysis of Cancer Data.用于癌症数据聚类分析的自适应模糊共识聚类框架

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):887-901. doi: 10.1109/TCBB.2014.2359433.

Variational Beta Process Hidden Markov Models with Shared Hidden States for Trajectory Recognition.具有共享隐藏状态的变分贝塔过程隐马尔可夫模型用于轨迹识别

Entropy (Basel). 2021 Sep 30;23(10):1290. doi: 10.3390/e23101290.

Combining multiple clusterings using evidence accumulation.使用证据积累合并多个聚类。

IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):835-50. doi: 10.1109/TPAMI.2005.113.

引用本文的文献

Using Multi-Scale Convolutional Neural Network Based on Multi-Instance Learning to Predict the Efficacy of Neoadjuvant Chemoradiotherapy for Rectal Cancer.基于多实例学习的多尺度卷积神经网络预测直肠癌新辅助放化疗疗效。

IEEE J Transl Eng Health Med. 2022 Mar 3;10:4300108. doi: 10.1109/JTEHM.2022.3156851. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自适应双权值法在基于 HMM 的混合元聚类集成自动初始化和模型选择中的应用。

Adaptive Bi-Weighting Toward Automatic Initialization and Model Selection for HMM-Based Hybrid Meta-Clustering Ensembles.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献