• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

张量分解与稀疏对数线性模型

TENSOR DECOMPOSITIONS AND SPARSE LOG-LINEAR MODELS.

作者信息

Johndrow James E, Bhattacharya Anirban, Dunson David B

机构信息

Duke University.

Texas A&M University.

出版信息

Ann Stat. 2017;45(1):1-38. doi: 10.1214/15-AOS1414. Epub 2017 Feb 21.

DOI:10.1214/15-AOS1414
PMID:29332971
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5764221/
Abstract

Contingency table analysis routinely relies on log-linear models, with latent structure analysis providing a common alternative. Latent structure models lead to a reduced rank tensor factorization of the probability mass function for multivariate categorical data, while log-linear models achieve dimensionality reduction through sparsity. Little is known about the relationship between these notions of dimensionality reduction in the two paradigms. We derive several results relating the support of a log-linear model to nonnegative ranks of the associated probability tensor. Motivated by these findings, we propose a new collapsed Tucker class of tensor decompositions, which bridge existing PARAFAC and Tucker decompositions, providing a more flexible framework for parsimoniously characterizing multivariate categorical data. Taking a Bayesian approach to inference, we illustrate empirical advantages of the new decompositions.

摘要

列联表分析通常依赖于对数线性模型,潜在结构分析则提供了一种常见的替代方法。潜在结构模型导致多变量分类数据的概率质量函数的秩降低张量分解,而对数线性模型通过稀疏性实现降维。对于这两种范式中这些降维概念之间的关系,人们知之甚少。我们得出了几个将对数线性模型的支撑与相关概率张量的非负秩联系起来的结果。受这些发现的启发,我们提出了一种新的折叠塔克张量分解类,它连接了现有的PARAFAC和塔克分解,为简洁地表征多变量分类数据提供了一个更灵活的框架。采用贝叶斯推理方法,我们展示了新分解的实证优势。

相似文献

1
TENSOR DECOMPOSITIONS AND SPARSE LOG-LINEAR MODELS.张量分解与稀疏对数线性模型
Ann Stat. 2017;45(1):1-38. doi: 10.1214/15-AOS1414. Epub 2017 Feb 21.
2
Algorithms for sparse nonnegative Tucker decompositions.稀疏非负塔克分解算法。
Neural Comput. 2008 Aug;20(8):2112-31. doi: 10.1162/neco.2008.11-06-407.
3
Bayesian factorizations of big sparse tensors.大稀疏张量的贝叶斯因式分解
J Am Stat Assoc. 2015;110(512):1562-1576. doi: 10.1080/01621459.2014.983233. Epub 2016 Jan 15.
4
Efficient Nonnegative Tucker Decompositions: Algorithms and Uniqueness.高效非负 Tucker 分解:算法与唯一性。
IEEE Trans Image Process. 2015 Dec;24(12):4990-5003. doi: 10.1109/TIP.2015.2478396. Epub 2015 Sep 14.
5
Simplex Factor Models for Multivariate Unordered Categorical Data.多元无序分类数据的单纯形因子模型
J Am Stat Assoc. 2012 Mar 1;107(497):362-377. doi: 10.1080/01621459.2011.646934.
6
Bayesian modeling of temporal dependence in large sparse contingency tables.大型稀疏列联表中时间依赖性的贝叶斯建模。
J Am Stat Assoc. 2013 Jan 1;108(504):1324-1338. doi: 10.1080/01621459.2013.823866.
7
Kronecker-Basis-Representation Based Tensor Sparsity and Its Applications to Tensor Recovery.基于克罗内克基表示的张量稀疏性及其在张量恢复中的应用
IEEE Trans Pattern Anal Mach Intell. 2018 Aug;40(8):1888-1902. doi: 10.1109/TPAMI.2017.2734888. Epub 2017 Aug 2.
8
Bayesian Robust Tensor Factorization for Incomplete Multiway Data.贝叶斯稳健张量分解在不完全多路数据中的应用。
IEEE Trans Neural Netw Learn Syst. 2016 Apr;27(4):736-48. doi: 10.1109/TNNLS.2015.2423694. Epub 2015 Jun 9.
9
Tucker Tensor Regression and Neuroimaging Analysis.塔克张量回归与神经影像分析
Stat Biosci. 2018 Dec;10(3):520-545. doi: 10.1007/s12561-018-9215-6. Epub 2018 Mar 7.
10
Bayesian CP Factorization of Incomplete Tensors with Automatic Rank Determination.贝叶斯 CP 因子分解具有自动秩确定的不完全张量。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1751-63. doi: 10.1109/TPAMI.2015.2392756.

引用本文的文献

1
Dimension-Grouped Mixed Membership Models for Multivariate Categorical Data.多元分类数据的维度分组混合成员模型
J Mach Learn Res. 2023 Feb;24.
2
COMPOSITE MIXTURE OF LOG-LINEAR MODELS WITH APPLICATION TO PSYCHIATRIC STUDIES.对数线性模型的复合混合物及其在精神病学研究中的应用
Ann Appl Stat. 2022 Jun;16(2):765-790. doi: 10.1214/21-aoas1515. Epub 2022 Jun 13.
3
Bayesian nonparametric multiway regression for clustered binomial data.用于聚类二项式数据的贝叶斯非参数多元回归

本文引用的文献

1
Bayesian factorizations of big sparse tensors.大稀疏张量的贝叶斯因式分解
J Am Stat Assoc. 2015;110(512):1562-1576. doi: 10.1080/01621459.2014.983233. Epub 2016 Jan 15.
2
Bayesian modeling of temporal dependence in large sparse contingency tables.大型稀疏列联表中时间依赖性的贝叶斯建模。
J Am Stat Assoc. 2013 Jan 1;108(504):1324-1338. doi: 10.1080/01621459.2013.823866.
3
Simplex Factor Models for Multivariate Unordered Categorical Data.多元无序分类数据的单纯形因子模型
Stat (Int Stat Inst). 2021 Dec;10(1). doi: 10.1002/sta4.378. Epub 2021 Mar 24.
4
Integrating sample similarities into latent class analysis: a tree-structured shrinkage approach.将样本相似度纳入潜在类别分析:一种树状收缩方法。
Biometrics. 2023 Mar;79(1):264-279. doi: 10.1111/biom.13580. Epub 2021 Nov 10.
5
Theoretical limits of microclustering for record linkage.记录链接微聚类的理论极限
Biometrika. 2018 Jun;105(2):431-446. doi: 10.1093/biomet/asy003. Epub 2018 Mar 19.
6
Exploring dependence between categorical variables: Benefits and limitations of using variable selection within Bayesian clustering in relation to log-linear modelling with interaction terms.探索分类变量之间的依赖性:在贝叶斯聚类中使用变量选择相对于带有交互项的对数线性建模的优点和局限性。
J Stat Plan Inference. 2016 Jun;173:47-63. doi: 10.1016/j.jspi.2016.01.002.
J Am Stat Assoc. 2012 Mar 1;107(497):362-377. doi: 10.1080/01621459.2011.646934.
4
Nonparametric Bayes Modeling of Multivariate Categorical Data.多变量分类数据的非参数贝叶斯建模
J Am Stat Assoc. 2012 Jan 1;104(487):1042-1051. doi: 10.1198/jasa.2009.tm08439.
5
Decomposition and model selection for large contingency tables.大型列联表的分解与模型选择
Biom J. 2010 Apr;52(2):233-52. doi: 10.1002/bimj.200900083.
6
Log-Linear Models for Gene Association.基因关联的对数线性模型
J Am Stat Assoc. 2009;104(486):597-607. doi: 10.1198/jasa.2009.0025.
7
Some mathematical notes on three-mode factor analysis.关于三模式因子分析的一些数学注释。
Psychometrika. 1966 Sep;31(3):279-311. doi: 10.1007/BF02289464.