• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过使用变分自编码器和基于相似度的损失来实现深度聚类。

Achieving deep clustering through the use of variational autoencoders and similarity-based loss.

作者信息

Ma He

机构信息

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150000, China.

出版信息

Math Biosci Eng. 2022 Jul 22;19(10):10344-10360. doi: 10.3934/mbe.2022484.

DOI:10.3934/mbe.2022484
PMID:36031997
Abstract

Clustering is an important and challenging research topic in many fields. Although various clustering algorithms have been developed in the past, traditional shallow clustering algorithms cannot mine the underlying structural information of the data. Recent advances have shown that deep clustering can achieve excellent performance on clustering tasks. In this work, a novel variational autoencoder-based deep clustering algorithm is proposed. It treats the Gaussian mixture model as the prior latent space and uses an additional classifier to distinguish different clusters in the latent space accurately. A similarity-based loss function is proposed consisting specifically of the cross-entropy of the predicted transition probabilities of clusters and the Wasserstein distance of the predicted posterior distributions. The new loss encourages the model to learn meaningful cluster-oriented representations to facilitate clustering tasks. The experimental results show that our method consistently achieves competitive results on various data sets.

摘要

聚类是许多领域中一个重要且具有挑战性的研究课题。尽管过去已经开发了各种聚类算法,但传统的浅层聚类算法无法挖掘数据的潜在结构信息。最近的进展表明,深度聚类在聚类任务上可以取得优异的性能。在这项工作中,提出了一种基于变分自编码器的新型深度聚类算法。它将高斯混合模型视为先验潜在空间,并使用额外的分类器在潜在空间中准确区分不同的聚类。提出了一种基于相似度的损失函数,具体由聚类预测转移概率的交叉熵和预测后验分布的 Wasserstein 距离组成。新的损失鼓励模型学习有意义的面向聚类的表示,以促进聚类任务。实验结果表明,我们的方法在各种数据集上始终取得有竞争力的结果。

相似文献

1
Achieving deep clustering through the use of variational autoencoders and similarity-based loss.通过使用变分自编码器和基于相似度的损失来实现深度聚类。
Math Biosci Eng. 2022 Jul 22;19(10):10344-10360. doi: 10.3934/mbe.2022484.
2
Clustering Analysis via Deep Generative Models With Mixture Models.基于混合模型的深度生成模型的聚类分析
IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):340-350. doi: 10.1109/TNNLS.2020.3027761. Epub 2022 Jan 5.
3
A Decoder-Free Variational Deep Embedding for Unsupervised Clustering.一种用于无监督聚类的无解码器变分深度嵌入
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5681-5693. doi: 10.1109/TNNLS.2021.3071275. Epub 2022 Oct 5.
4
Mixture-of-Experts Variational Autoencoder for clustering and generating from similarity-based representations on single cell data.基于相似性表示的单细胞数据聚类和生成的专家混合变分自动编码器。
PLoS Comput Biol. 2021 Jun 30;17(6):e1009086. doi: 10.1371/journal.pcbi.1009086. eCollection 2021 Jun.
5
Research on load clustering algorithm based on variational autoencoder and hierarchical clustering.基于变分自编码器和层次聚类的负荷聚类算法研究
PLoS One. 2024 Jun 13;19(6):e0303977. doi: 10.1371/journal.pone.0303977. eCollection 2024.
6
Deep clustering of small molecules at large-scale via variational autoencoder embedding and K-means.通过变分自编码器嵌入和K均值实现小分子的大规模深度聚类。
BMC Bioinformatics. 2022 Apr 15;23(Suppl 4):132. doi: 10.1186/s12859-022-04667-1.
7
Deep Clustering Analysis via Dual Variational Autoencoder With Spherical Latent Embeddings.基于具有球形潜在嵌入的对偶变分自编码器的深度聚类分析
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6303-6312. doi: 10.1109/TNNLS.2021.3135460. Epub 2023 Sep 1.
8
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
9
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization.ScInfoVAE:使用变分自编码器和扩展互信息正则化对单细胞转录数据进行可解释的降维。
BioData Min. 2023 Jun 10;16(1):17. doi: 10.1186/s13040-023-00333-1.
10
A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations.基于变分推理和图自动编码器的 lncRNA-疾病关联预测的表示学习模型。
BMC Bioinformatics. 2021 Mar 21;22(1):136. doi: 10.1186/s12859-021-04073-z.