基于混合模型的深度生成模型的聚类分析

Clustering Analysis via Deep Generative Models With Mixture Models.

作者信息

Yang Lin, Fan Wentao, Bouguila Nizar

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):340-350. doi: 10.1109/TNNLS.2020.3027761. Epub 2022 Jan 5.

DOI:10.1109/TNNLS.2020.3027761

Abstract

Clustering is a fundamental problem that frequently arises in many fields, such as pattern recognition, data mining, and machine learning. Although various clustering algorithms have been developed in the past, traditional clustering algorithms with shallow structures cannot excavate the interdependence of complex data features in latent space. Recently, deep generative models, such as autoencoder (AE), variational AE (VAE), and generative adversarial network (GAN), have achieved remarkable success in many unsupervised applications thanks to their capabilities for learning promising latent representations from original data. In this work, first we propose a novel clustering approach based on both Wasserstein GAN with gradient penalty (WGAN-GP) and VAE with a Gaussian mixture prior. By combining the WGAN-GP with VAE, the generator of WGAN-GP is formulated by drawing samples from the probabilistic decoder of VAE. Moreover, to provide more robust clustering and generation performance when outliers are encountered in data, a variant of the proposed deep generative model is developed based on a Student's-t mixture prior. The effectiveness of our deep generative models is validated though experiments on both clustering analysis and samples generation. Through the comparison with other state-of-art clustering approaches based on deep generative models, the proposed approach can provide more stable training of the model, improve the accuracy of clustering, and generate realistic samples.

摘要

聚类是一个在许多领域经常出现的基本问题，如模式识别、数据挖掘和机器学习。尽管过去已经开发了各种聚类算法，但结构简单的传统聚类算法无法挖掘潜在空间中复杂数据特征的相互依赖性。最近，深度生成模型，如自动编码器（AE）、变分自动编码器（VAE）和生成对抗网络（GAN），由于其能够从原始数据中学习有前景的潜在表示，在许多无监督应用中取得了显著成功。在这项工作中，首先我们提出了一种基于带梯度惩罚的瓦瑟斯坦生成对抗网络（WGAN-GP）和具有高斯混合先验的VAE的新型聚类方法。通过将WGAN-GP与VAE相结合，WGAN-GP的生成器是通过从VAE的概率解码器中采样来构建的。此外，为了在数据中遇到离群值时提供更稳健的聚类和生成性能，基于学生t混合先验开发了所提出的深度生成模型的一个变体。通过聚类分析和样本生成实验验证了我们深度生成模型的有效性。通过与其他基于深度生成模型的先进聚类方法进行比较，所提出的方法可以为模型提供更稳定的训练，提高聚类的准确性，并生成逼真的样本。

相似文献

Clustering Analysis via Deep Generative Models With Mixture Models.基于混合模型的深度生成模型的聚类分析

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):340-350. doi: 10.1109/TNNLS.2020.3027761. Epub 2022 Jan 5.

Data augmentation for enhancing EEG-based emotion recognition with deep generative models.基于深度生成模型的数据增强以增强基于 EEG 的情绪识别。

J Neural Eng. 2020 Oct 14;17(5):056021. doi: 10.1088/1741-2552/abb580.

Deep Clustering Analysis via Dual Variational Autoencoder With Spherical Latent Embeddings.基于具有球形潜在嵌入的对偶变分自编码器的深度聚类分析

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6303-6312. doi: 10.1109/TNNLS.2021.3135460. Epub 2023 Sep 1.

VAE-WACGAN: An Improved Data Augmentation Method Based on VAEGAN for Intrusion Detection.变分自编码器- Wasserstein对抗生成网络：一种基于变分自编码器-生成对抗网络的改进型入侵检测数据增强方法

Sensors (Basel). 2024 Sep 18;24(18):6035. doi: 10.3390/s24186035.

Data generation for connected and automated vehicle tests using deep learning models.利用深度学习模型进行车联网和自动驾驶车辆测试的数据生成。

Accid Anal Prev. 2023 Sep;190:107192. doi: 10.1016/j.aap.2023.107192. Epub 2023 Jun 26.

Enhancing classification of cells procured from bone marrow aspirate smears using generative adversarial networks and sequential convolutional neural network.利用生成对抗网络和序列卷积神经网络增强骨髓穿刺涂片获取的细胞分类。

Comput Methods Programs Biomed. 2022 Sep;224:107019. doi: 10.1016/j.cmpb.2022.107019. Epub 2022 Jul 10.

Generative AI with WGAN-GP for boosting seizure detection accuracy.用于提高癫痫发作检测准确性的带有 Wasserstein 生成对抗网络梯度惩罚的生成式人工智能。

Front Artif Intell. 2024 Oct 2;7:1437315. doi: 10.3389/frai.2024.1437315. eCollection 2024.

Generative models struggle with kirigami metamaterials.生成模型在kirigami超材料方面存在困难。

Sci Rep. 2024 Aug 20;14(1):19397. doi: 10.1038/s41598-024-70364-z.

Robust Semisupervised Deep Generative Model Under Compound Noise.复合噪声下的稳健半监督深度生成模型

IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1179-1193. doi: 10.1109/TNNLS.2021.3105080. Epub 2023 Feb 28.

Improving Skin Cancer Classification Using Heavy-Tailed Student T-Distribution in Generative Adversarial Networks (TED-GAN).在生成对抗网络（TED-GAN）中使用重尾学生T分布改进皮肤癌分类

Diagnostics (Basel). 2021 Nov 19;11(11):2147. doi: 10.3390/diagnostics11112147.

引用本文的文献

scFocus: Detecting branching probabilities in single-cell data with SAC.scFocus：使用SAC检测单细胞数据中的分支概率。

Comput Struct Biotechnol J. 2025 May 20;27:2243-2263. doi: 10.1016/j.csbj.2025.04.036. eCollection 2025.

A deep multiple self-supervised clustering model based on autoencoder networks.一种基于自动编码器网络的深度多重自监督聚类模型。

Sci Rep. 2025 May 26;15(1):18372. doi: 10.1038/s41598-025-00349-z.

Private measures, random walks, and synthetic data.私人措施、随机游走与合成数据。

Probab Theory Relat Fields. 2024;189(1-2):569-611. doi: 10.1007/s00440-024-01279-z. Epub 2024 Apr 20.

Tea Chrysanthemum Detection by Leveraging Generative Adversarial Networks and Edge Computing.利用生成对抗网络和边缘计算进行茶菊花检测

Front Plant Sci. 2022 Apr 7;13:850606. doi: 10.3389/fpls.2022.850606. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于混合模型的深度生成模型的聚类分析

Clustering Analysis via Deep Generative Models With Mixture Models.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献