• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

张量包络混合模型用于聚类和多维降维的同时进行。

Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction.

机构信息

Department of Statistics, Florida State University, Tallahassee, Florida, USA.

出版信息

Biometrics. 2022 Sep;78(3):1067-1079. doi: 10.1111/biom.13486. Epub 2021 May 26.

DOI:10.1111/biom.13486
PMID:34010459
Abstract

In the form of multidimensional arrays, tensor data have become increasingly prevalent in modern scientific studies and biomedical applications such as computational biology, brain imaging analysis, and process monitoring system. These data are intrinsically heterogeneous with complex dependencies and structure. Therefore, ad-hoc dimension reduction methods on tensor data may lack statistical efficiency and can obscure essential findings. Model-based clustering is a cornerstone of multivariate statistics and unsupervised learning; however, existing methods and algorithms are not designed for tensor-variate samples. In this article, we propose a tensor envelope mixture model (TEMM) for simultaneous clustering and multiway dimension reduction of tensor data. TEMM incorporates tensor-structure-preserving dimension reduction into mixture modeling and drastically reduces the number of free parameters and estimative variability. An expectation-maximization-type algorithm is developed to obtain likelihood-based estimators of the cluster means and covariances, which are jointly parameterized and constrained onto a series of lower dimensional subspaces known as the tensor envelopes. We demonstrate the encouraging empirical performance of the proposed method in extensive simulation studies and a real data application in comparison with existing vector and tensor clustering methods.

摘要

张量数据以多维数组的形式在现代科学研究和生物医学应用中变得越来越普遍,如计算生物学、脑成像分析和过程监测系统。这些数据本质上具有复杂的依赖关系和结构,具有异质性。因此,张量数据的特定于任务的降维方法可能缺乏统计效率,并可能掩盖重要的发现。基于模型的聚类是多元统计和无监督学习的基石;然而,现有的方法和算法不是为张量变量样本设计的。在本文中,我们提出了一种张量包络混合模型(TEMM),用于张量数据的同时聚类和多向降维。TEMM 将张量结构保持的降维纳入混合建模中,并大大减少了自由参数的数量和估计的可变性。开发了一种期望最大化类型的算法,以获得基于似然的聚类均值和协方差的估计值,这些均值和协方差被联合参数化,并约束在一系列称为张量包络的较低维子空间上。与现有的向量和张量聚类方法相比,我们通过广泛的模拟研究和实际数据应用证明了所提出方法的令人鼓舞的经验性能。

相似文献

1
Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction.张量包络混合模型用于聚类和多维降维的同时进行。
Biometrics. 2022 Sep;78(3):1067-1079. doi: 10.1111/biom.13486. Epub 2021 May 26.
2
Provable Convex Co-clustering of Tensors.张量的可证凸共聚类
J Mach Learn Res. 2020;21.
3
Regularized matrix data clustering and its application to image analysis.正则化矩阵数据聚类及其在图像分析中的应用。
Biometrics. 2021 Sep;77(3):890-902. doi: 10.1111/biom.13354. Epub 2020 Aug 24.
4
caBIG VISDA: modeling, visualization, and discovery for cluster analysis of genomic data.caBIG VISDA:用于基因组数据聚类分析的建模、可视化与发现
BMC Bioinformatics. 2008 Sep 18;9:383. doi: 10.1186/1471-2105-9-383.
5
A multiple kernel density clustering algorithm for incomplete datasets in bioinformatics.一种用于生物信息学中不完整数据集的多核密度聚类算法。
BMC Syst Biol. 2018 Nov 22;12(Suppl 6):111. doi: 10.1186/s12918-018-0630-6.
6
Automated gating of flow cytometry data via robust model-based clustering.通过基于稳健模型的聚类实现流式细胞术数据的自动门控。
Cytometry A. 2008 Apr;73(4):321-32. doi: 10.1002/cyto.a.20531.
7
Epitope profiling via mixture modeling of ranked data.通过排序数据的混合模型进行表位分析。
Stat Med. 2014 Sep 20;33(21):3738-58. doi: 10.1002/sim.6224. Epub 2014 Jun 5.
8
Generic, network schema agnostic sparse tensor factorization for single-pass clustering of heterogeneous information networks.用于异构信息网络单遍聚类的通用、网络模式无关的稀疏张量分解
PLoS One. 2017 Feb 28;12(2):e0172323. doi: 10.1371/journal.pone.0172323. eCollection 2017.
9
Bayesian model averaging of naive Bayes for clustering.用于聚类的朴素贝叶斯的贝叶斯模型平均法。
IEEE Trans Syst Man Cybern B Cybern. 2006 Oct;36(5):1149-61. doi: 10.1109/tsmcb.2006.874132.
10
Performance Evaluation of Missing-Value Imputation Clustering Based on a Multivariate Gaussian Mixture Model.基于多元高斯混合模型的缺失值插补聚类性能评估
PLoS One. 2016 Aug 23;11(8):e0161112. doi: 10.1371/journal.pone.0161112. eCollection 2016.

引用本文的文献

1
Optimal variable clustering for high-dimensional matrix valued data.高维矩阵值数据的最优变量聚类
Inf inference. 2025 Mar 12;14(1):iaaf001. doi: 10.1093/imaiai/iaaf001. eCollection 2025 Mar.