协方差可分性下基于混合分布的纵向区间值数据聚类

Clustering of longitudinal interval-valued data via mixture distribution under covariance separability.

作者信息

Park Seongoh, Lim Johan, Choi Hyejeong, Kwak Minjung

机构信息

Department of Statistics, Seoul National University, Seoul, Korea.

Department of Statistics, Yeungnam University, Gyeongsan, Korea.

出版信息

J Appl Stat. 2019 Nov 17;47(10):1739-1756. doi: 10.1080/02664763.2019.1692795. eCollection 2020.

DOI:10.1080/02664763.2019.1692795

PMID:35707136

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9042104/

Abstract

We consider the clustering of repeatedly measured 'min-max' type interval-valued data. We read the data as matrix variate data and assume the covariance matrix is separable for the model-based clustering (M-clustering). The use of a separable covariance matrix introduces several advantages in M-clustering, which include fewer samples required for a valid procedure. In addition, the numerical study shows that this structured matrix allows us to find the correct number of clusters more accurately compared to other commonly assumed covariance matrices. We apply the M-clustering with various covariance structures to clustering the longitudinal blood pressure data from the National Heart, Lung, and Blood Institute Growth and Health Study (NGHS).

摘要

我们考虑对重复测量的“最小-最大”型区间值数据进行聚类。我们将数据视为矩阵变量数据，并假设协方差矩阵对于基于模型的聚类（M-聚类）是可分离的。在M-聚类中使用可分离协方差矩阵带来了几个优点，其中包括有效程序所需的样本更少。此外，数值研究表明，与其他通常假设的协方差矩阵相比，这种结构化矩阵使我们能够更准确地找到正确的聚类数量。我们将具有各种协方差结构的M-聚类应用于对美国国立心肺血液研究所生长与健康研究（NGHS）中的纵向血压数据进行聚类。

相似文献

Clustering of longitudinal interval-valued data via mixture distribution under covariance separability.协方差可分性下基于混合分布的纵向区间值数据聚类

J Appl Stat. 2019 Nov 17;47(10):1739-1756. doi: 10.1080/02664763.2019.1692795. eCollection 2020.

Permutation based testing on covariance separability.基于排列的协方差可分性检验。

Comput Stat. 2019 Jun 1;34(2):865-883. doi: 10.1007/s00180-018-0839-2. Epub 2018 Sep 27.

Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.具有特定聚类对角协方差矩阵和分组变量的基于惩罚模型的聚类

Electron J Stat. 2008;2:168-212. doi: 10.1214/08-EJS194.

Modal clustering of matrix-variate data.矩阵变量数据的模态聚类

Adv Data Anal Classif. 2023;17(2):323-345. doi: 10.1007/s11634-022-00501-x. Epub 2022 May 5.

A Penalized Matrix Normal Mixture Model for Clustering Matrix Data.一种用于矩阵数据聚类的惩罚矩阵正态混合模型。

Entropy (Basel). 2021 Sep 26;23(10):1249. doi: 10.3390/e23101249.

A comparison of likelihood ratio tests and Rao's score test for three separable covariance matrix structures.三种可分离协方差矩阵结构的似然比检验与拉奥得分检验的比较。

Biom J. 2017 Jan;59(1):192-215. doi: 10.1002/bimj.201600044. Epub 2016 Oct 24.

Penalized model-based clustering with unconstrained covariance matrices.具有无约束协方差矩阵的基于惩罚模型的聚类

Electron J Stat. 2009 Jan 1;3:1473-1496. doi: 10.1214/09-EJS487.

Separability tests for high-dimensional, low sample size multivariate repeated measures data.高维、低样本量多变量重复测量数据的可分离性检验

J Appl Stat. 2014;41(11):2450-2461. doi: 10.1080/02664763.2014.919251.

Parsimonious hidden Markov models for matrix-variate longitudinal data.用于矩阵变量纵向数据的简约隐马尔可夫模型。

Stat Comput. 2022;32(3):53. doi: 10.1007/s11222-022-10107-0. Epub 2022 Jun 15.

Diagonal Acceleration for Covariance Matrix Adaptation Evolution Strategies.对角加速的协方差矩阵适应进化策略。

Evol Comput. 2020 Fall;28(3):405-435. doi: 10.1162/evco_a_00260. Epub 2019 May 23.

本文引用的文献

Permutation based testing on covariance separability.基于排列的协方差可分性检验。

Comput Stat. 2019 Jun 1;34(2):865-883. doi: 10.1007/s00180-018-0839-2. Epub 2018 Sep 27.

Partition-based ultrahigh-dimensional variable screening.基于划分的超高维变量筛选

Biometrika. 2017 Nov;104(4):785-800. doi: 10.1093/biomet/asx052. Epub 2017 Oct 9.

Bayesian analysis of matrix normal graphical models.矩阵正态图形模型的贝叶斯分析。

Biometrika. 2009 Dec;96(4):821-834. doi: 10.1093/biomet/asp049. Epub 2009 Oct 9.

Model Selection and Estimation in the Matrix Normal Graphical Model.矩阵正态图形模型中的模型选择与估计

J Multivar Anal. 2012 May 1;107:119-140. doi: 10.1016/j.jmva.2012.01.005.

Relationships of body mass index with blood pressure and serum cholesterol concentrations at different ages.

Aging Clin Exp Res. 2004 Dec;16(6):461-6. doi: 10.1007/BF03327402.

Application of comparative functional genomics to identify best-fit mouse models to study human cancer.应用比较功能基因组学来确定用于研究人类癌症的最佳匹配小鼠模型。

Nat Genet. 2004 Dec;36(12):1306-11. doi: 10.1038/ng1481. Epub 2004 Nov 21.

Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.通过寡核苷酸阵列探测的肿瘤和正常结肠组织的聚类分析所揭示的基因表达广泛模式。

Proc Natl Acad Sci U S A. 1999 Jun 8;96(12):6745-50. doi: 10.1073/pnas.96.12.6745.

Obesity and cardiovascular disease risk factors in black and white girls: the NHLBI Growth and Health Study.黑人和白人女孩中的肥胖与心血管疾病风险因素：美国国立心肺血液研究所生长与健康研究

Am J Public Health. 1992 Dec;82(12):1613-20. doi: 10.2105/ajph.82.12.1613.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验