• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于矩阵数据聚类的惩罚矩阵正态混合模型。

A Penalized Matrix Normal Mixture Model for Clustering Matrix Data.

作者信息

Heo Jinwon, Baek Jangsun

机构信息

Department of Mathematics and Statistics, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea.

出版信息

Entropy (Basel). 2021 Sep 26;23(10):1249. doi: 10.3390/e23101249.

DOI:10.3390/e23101249
PMID:34681973
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8534904/
Abstract

Along with advances in technology, matrix data, such as medical/industrial images, have emerged in many practical fields. These data usually have high dimensions and are not easy to cluster due to their intrinsic correlated structure among rows and columns. Most approaches convert matrix data to multi dimensional vectors and apply conventional clustering methods to them, and thus, suffer from an extreme high-dimensionality problem as well as a lack of interpretability of the correlated structure among row/column variables. Recently, a regularized model was proposed for clustering matrix-valued data by imposing a sparsity structure for the mean signal of each cluster. We extend their approach by regularizing further on the covariance to cope better with the curse of dimensionality for large size images. A penalized matrix normal mixture model with lasso-type penalty terms in both mean and covariance matrices is proposed, and then an expectation maximization algorithm is developed to estimate the parameters. The proposed method has the competence of both parsimonious modeling and reflecting the proper conditional correlation structure. The estimators are consistent, and their limiting distributions are derived. We applied the proposed method to simulated data as well as real datasets and measured its clustering performance with the clustering accuracy (ACC) and the adjusted rand index (ARI). The experiment results show that the proposed method performed better with higher ACC and ARI than those of conventional methods.

摘要

随着技术的进步,矩阵数据,如医学/工业图像,已出现在许多实际领域。这些数据通常具有高维度,并且由于其行与列之间固有的相关结构而不易聚类。大多数方法将矩阵数据转换为多维向量并对其应用传统的聚类方法,因此,会面临极高维度问题以及行/列变量之间相关结构缺乏可解释性的问题。最近,有人提出了一种正则化模型,通过对每个聚类的均值信号施加稀疏结构来对矩阵值数据进行聚类。我们通过对协方差进一步正则化来扩展他们的方法,以更好地应对大尺寸图像的维度灾难。提出了一种在均值矩阵和协方差矩阵中都带有套索型惩罚项的惩罚矩阵正态混合模型,然后开发了一种期望最大化算法来估计参数。所提出的方法具有简约建模和反映适当条件相关结构的能力。估计量是一致的,并推导了它们的极限分布。我们将所提出的方法应用于模拟数据以及真实数据集,并使用聚类准确率(ACC)和调整兰德指数(ARI)来衡量其聚类性能。实验结果表明,所提出的方法在ACC和ARI方面比传统方法表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/bcf935ef301d/entropy-23-01249-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/edfc742dad74/entropy-23-01249-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/471d43d46b77/entropy-23-01249-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/82b508c81450/entropy-23-01249-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/d49a512ea18c/entropy-23-01249-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/9cd466fabb36/entropy-23-01249-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/a16cc65d4ab9/entropy-23-01249-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/21c578e4d1e1/entropy-23-01249-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/bcf935ef301d/entropy-23-01249-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/edfc742dad74/entropy-23-01249-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/471d43d46b77/entropy-23-01249-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/82b508c81450/entropy-23-01249-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/d49a512ea18c/entropy-23-01249-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/9cd466fabb36/entropy-23-01249-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/a16cc65d4ab9/entropy-23-01249-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/21c578e4d1e1/entropy-23-01249-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1d2/8534904/bcf935ef301d/entropy-23-01249-g008.jpg

相似文献

1
A Penalized Matrix Normal Mixture Model for Clustering Matrix Data.一种用于矩阵数据聚类的惩罚矩阵正态混合模型。
Entropy (Basel). 2021 Sep 26;23(10):1249. doi: 10.3390/e23101249.
2
Regularized matrix data clustering and its application to image analysis.正则化矩阵数据聚类及其在图像分析中的应用。
Biometrics. 2021 Sep;77(3):890-902. doi: 10.1111/biom.13354. Epub 2020 Aug 24.
3
Regularized parameter estimation in high-dimensional gaussian mixture models.高维高斯混合模型中的正则化参数估计。
Neural Comput. 2011 Jun;23(6):1605-22. doi: 10.1162/NECO_a_00128. Epub 2011 Mar 11.
4
Penalized model-based clustering with unconstrained covariance matrices.具有无约束协方差矩阵的基于惩罚模型的聚类
Electron J Stat. 2009 Jan 1;3:1473-1496. doi: 10.1214/09-EJS487.
5
2D-EM clustering approach for high-dimensional data through folding feature vectors.通过折叠特征向量的 2D-EM 聚类方法对高维数据进行聚类。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):547. doi: 10.1186/s12859-017-1970-8.
6
Sparse estimation of a covariance matrix.协方差矩阵的稀疏估计。
Biometrika. 2011 Dec;98(4):807-820. doi: 10.1093/biomet/asr054.
7
Regularized Gaussian Mixture Model for High-Dimensional Clustering.用于高维聚类的正则化高斯混合模型
IEEE Trans Cybern. 2019 Oct;49(10):3677-3688. doi: 10.1109/TCYB.2018.2846404. Epub 2018 Jun 27.
8
Sparse Count Data Clustering Using an Exponential Approximation to Generalized Dirichlet Multinomial Distributions.使用广义狄利克雷多项分布的指数近似进行稀疏计数数据聚类
IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):89-102. doi: 10.1109/TNNLS.2020.3027539. Epub 2022 Jan 5.
9
Parsimonious mixtures of multivariate contaminated normal distributions.多元受污染正态分布的简约混合
Biom J. 2016 Nov;58(6):1506-1537. doi: 10.1002/bimj.201500144. Epub 2016 Aug 11.
10
A batch rival penalized expectation-maximization algorithm for Gaussian mixture clustering with automatic model selection.批量竞争期望最大化算法在自动模型选择下用于高斯混合聚类。
Comput Math Methods Med. 2012;2012:425730. doi: 10.1155/2012/425730. Epub 2012 Jan 30.

本文引用的文献

1
Regularized matrix data clustering and its application to image analysis.正则化矩阵数据聚类及其在图像分析中的应用。
Biometrics. 2021 Sep;77(3):890-902. doi: 10.1111/biom.13354. Epub 2020 Aug 24.
2
Regularized parameter estimation in high-dimensional gaussian mixture models.高维高斯混合模型中的正则化参数估计。
Neural Comput. 2011 Jun;23(6):1605-22. doi: 10.1162/NECO_a_00128. Epub 2011 Mar 11.
3
Penalized model-based clustering with unconstrained covariance matrices.具有无约束协方差矩阵的基于惩罚模型的聚类
Electron J Stat. 2009 Jan 1;3:1473-1496. doi: 10.1214/09-EJS487.
4
Sparse inverse covariance estimation with the graphical lasso.使用图模型选择法进行稀疏逆协方差估计。
Biostatistics. 2008 Jul;9(3):432-41. doi: 10.1093/biostatistics/kxm045. Epub 2007 Dec 12.