矩阵变量数据的模态聚类

Modal clustering of matrix-variate data.

作者信息

Ferraccioli Federico, Menardi Giovanna

机构信息

Padua, Italy Dipartimento di Scienze Statistiche, Università degli Studi di Padova.

出版信息

Adv Data Anal Classif. 2023;17(2):323-345. doi: 10.1007/s11634-022-00501-x. Epub 2022 May 5.

DOI:10.1007/s11634-022-00501-x

PMID:35529071

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9069429/

Abstract

The nonparametric formulation of density-based clustering, known as modal clustering, draws a correspondence between groups and the attraction domains of the modes of the density function underlying the data. Its probabilistic foundation allows for a natural, yet not trivial, generalization of the approach to the matrix-valued setting, increasingly widespread, for example, in longitudinal and multivariate spatio-temporal studies. In this work we introduce nonparametric estimators of matrix-variate distributions based on kernel methods, and analyze their asymptotic properties. Additionally, we propose a generalization of the mean-shift procedure for the identification of the modes of the estimated density. Given the intrinsic high dimensionality of matrix-variate data, we discuss some locally adaptive solutions to handle the problem. We test the procedure via extensive simulations, also with respect to some competitors, and illustrate its performance through two high-dimensional real data applications.

摘要

基于密度的聚类的非参数公式，即模态聚类，在数据底层密度函数的模式吸引域与组之间建立了对应关系。其概率基础允许将该方法自然但并非平凡地推广到矩阵值设置，例如在纵向和多变量时空研究中越来越普遍。在这项工作中，我们基于核方法引入了矩阵变量分布的非参数估计器，并分析了它们的渐近性质。此外，我们提出了一种均值漂移过程的推广，用于识别估计密度的模式。鉴于矩阵变量数据固有的高维度，我们讨论了一些局部自适应解决方案来处理该问题。我们通过广泛的模拟对该过程进行了测试，也与一些竞争对手进行了比较，并通过两个高维实际数据应用说明了其性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d914/9069429/f2d784a2cdf2/11634_2022_501_Fig1_HTML.jpg

相似文献

Modal clustering of matrix-variate data.矩阵变量数据的模态聚类

Adv Data Anal Classif. 2023;17(2):323-345. doi: 10.1007/s11634-022-00501-x. Epub 2022 May 5.

Clustering of longitudinal interval-valued data via mixture distribution under covariance separability.协方差可分性下基于混合分布的纵向区间值数据聚类

J Appl Stat. 2019 Nov 17;47(10):1739-1756. doi: 10.1080/02664763.2019.1692795. eCollection 2020.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Provable Convex Co-clustering of Tensors.张量的可证凸共聚类

J Mach Learn Res. 2020;21.

SimpleMKKM: Simple Multiple Kernel K-Means.SimpleMKKM：简单多核 K-Means。

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5174-5186. doi: 10.1109/TPAMI.2022.3198638. Epub 2023 Mar 7.

A Bayes optimal matrix-variate LDA for extraction of spatio-spectral features from EEG signals.一种用于从脑电图信号中提取时空频谱特征的贝叶斯最优矩阵变量线性判别分析方法。

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:3955-8. doi: 10.1109/EMBC.2012.6346832.

Finite mixtures of matrix variate Poisson-log normal distributions for three-way count data.三向计数数据的矩阵变量泊松对数正态分布的有限混合。

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad167.

Semi-Supervised Kernel Mean Shift Clustering.半监督核均值漂移聚类。

IEEE Trans Pattern Anal Mach Intell. 2014 Jun;36(6):1201-15. doi: 10.1109/TPAMI.2013.190.

Fast Nonparametric Density-Based Clustering of Large Data Sets Using a Stochastic Approximation Mean-Shift Algorithm.使用随机近似均值漂移算法对大数据集进行快速非参数基于密度的聚类

J Comput Graph Stat. 2016;25(3):899-916. doi: 10.1080/10618600.2015.1051625. Epub 2016 Aug 5.

Kernel Clustering: Density Biases and Solutions.核聚类：密度偏差与解决方案

IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):136-147. doi: 10.1109/TPAMI.2017.2780166. Epub 2017 Dec 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

矩阵变量数据的模态聚类

Modal clustering of matrix-variate data.

作者信息

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献