基于稀疏典型相关分析的支持恢复：信息论与计算限制

On Support Recovery with Sparse CCA: Information Theoretic and Computational Limits.

作者信息

Laha Nilanjana, Mukherjee Rajarshi

机构信息

Department of Statistics, Texas A&M University, College Station, TX 77843.

Department of Biostatistics, Harvard T. H. Chan School of Public Health, 677 Huntington Ave, Boston, MA 02115.

出版信息

IEEE Trans Inf Theory. 2023 Mar;69(3):1695-1738. doi: 10.1109/tit.2022.3214201. Epub 2022 Oct 17.

DOI:10.1109/tit.2022.3214201

PMID:37842015

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10569110/

Abstract

In this paper, we consider asymptotically exact support recovery in the context of high dimensional and sparse Canonical Correlation Analysis (CCA). Our main results describe four regimes of interest based on information theoretic and computational considerations. In regimes of "low" sparsity we describe a simple, general, and computationally easy method for support recovery, whereas in a regime of "high" sparsity, it turns out that support recovery is information theoretically impossible. For the sake of information theoretic lower bounds, our results also demonstrate a non-trivial requirement on the "minimal" size of the nonzero elements of the canonical vectors that is required for asymptotically consistent support recovery. Subsequently, the regime of "moderate" sparsity is further divided into two subregimes. In the lower of the two sparsity regimes, we show that polynomial time support recovery is possible by using a sharp analysis of a co-ordinate thresholding [1] type method. In contrast, in the higher end of the moderate sparsity regime, appealing to the "Low Degree Polynomial" Conjecture [2], we provide evidence that polynomial time support recovery methods are inconsistent. Finally, we carry out numerical experiments to compare the efficacy of various methods discussed.

摘要

在本文中，我们考虑高维稀疏典型相关分析（CCA）背景下的渐近精确支持恢复。我们的主要结果基于信息论和计算方面的考虑描述了四种感兴趣的情况。在“低”稀疏度情况下，我们描述了一种简单、通用且计算简便的支持恢复方法，而在“高”稀疏度情况下，事实证明从信息论角度来看支持恢复是不可能的。为了得到信息论下界，我们的结果还表明了对于渐近一致支持恢复所需的典型向量非零元素“最小”规模的一个重要要求。随后，“中等”稀疏度情况进一步分为两个子情况。在两个稀疏度情况中较低的那个情况下，我们表明通过对一种坐标阈值化[1]类型方法进行精确分析，多项式时间支持恢复是可行的。相比之下，在中等稀疏度情况的较高端，借助“低次多项式”猜想[2]，我们提供证据表明多项式时间支持恢复方法是不一致的。最后，我们进行数值实验以比较所讨论的各种方法的有效性。

相似文献

On Support Recovery with Sparse CCA: Information Theoretic and Computational Limits.基于稀疏典型相关分析的支持恢复：信息论与计算限制

IEEE Trans Inf Theory. 2023 Mar;69(3):1695-1738. doi: 10.1109/tit.2022.3214201. Epub 2022 Oct 17.

On statistical inference with high-dimensional sparse CCA.关于高维稀疏典型相关分析的统计推断

Inf inference. 2023 Nov 17;12(4):iaad040. doi: 10.1093/imaiai/iaad040. eCollection 2023 Dec.

Sparse canonical correlation analysis from a predictive point of view.从预测角度看稀疏典型相关分析。

Biom J. 2015 Sep;57(5):834-51. doi: 10.1002/bimj.201400226. Epub 2015 Jul 6.

Optimal Feature Selection in High-Dimensional Discriminant Analysis.高维判别分析中的最优特征选择

IEEE Trans Inf Theory. 2015 Feb;61(2):1063-1083. doi: 10.1109/TIT.2014.2381241.

HYPOTHESIS TESTING FOR HIGH-DIMENSIONAL SPARSE BINARY REGRESSION.高维稀疏二元回归的假设检验

Ann Stat. 2015 Feb;43(1):352-381. doi: 10.1214/14-AOS1279.

An iterative penalized least squares approach to sparse canonical correlation analysis.一种用于稀疏典型相关分析的迭代惩罚最小二乘法。

Biometrics. 2019 Sep;75(3):734-744. doi: 10.1111/biom.13043. Epub 2019 Apr 9.

Sparse multiway canonical correlation analysis for multimodal stroke recovery data.稀疏多向典范相关分析在多模态中风康复数据中的应用。

Biom J. 2024 Mar;66(2):e2300037. doi: 10.1002/bimj.202300037.

Sparsity estimation from compressive projections via sparse random matrices.通过稀疏随机矩阵从压缩投影中进行稀疏性估计。

EURASIP J Adv Signal Process. 2018;2018(1):56. doi: 10.1186/s13634-018-0578-0. Epub 2018 Sep 10.

Dimension-wise sparse low-rank approximation of a matrix with application to variable selection in high-dimensional integrative analyzes of association.矩阵的维度稀疏低秩逼近及其在高维关联综合分析中的变量选择应用

J Appl Stat. 2021 Aug 19;49(15):3889-3907. doi: 10.1080/02664763.2021.1967892. eCollection 2022.

EXACT MINIMAX ESTIMATION OF THE PREDICTIVE DENSITY IN SPARSE GAUSSIAN MODELS.稀疏高斯模型中预测密度的精确极小极大估计

Ann Stat. 2015;43(3):937-961. doi: 10.1214/14-AOS1251.

引用本文的文献

On statistical inference with high-dimensional sparse CCA.关于高维稀疏典型相关分析的统计推断

Inf inference. 2023 Nov 17;12(4):iaad040. doi: 10.1093/imaiai/iaad040. eCollection 2023 Dec.

本文引用的文献

On statistical inference with high-dimensional sparse CCA.关于高维稀疏典型相关分析的统计推断

Inf inference. 2023 Nov 17;12(4):iaad040. doi: 10.1093/imaiai/iaad040. eCollection 2023 Dec.

Sparse canonical correlation to identify breast cancer related genes regulated by copy number aberrations.稀疏正则化典型相关分析鉴定拷贝数异常调控的乳腺癌相关基因。

PLoS One. 2022 Dec 30;17(12):e0276886. doi: 10.1371/journal.pone.0276886. eCollection 2022.

STRUCTURED CORRELATION DETECTION WITH APPLICATION TO COLOCALIZATION ANALYSIS IN DUAL-CHANNEL FLUORESCENCE MICROSCOPIC IMAGING.结构化相关性检测及其在双通道荧光显微镜成像共定位分析中的应用

Stat Sin. 2021 Jan;31(1):333-360. doi: 10.5705/ss.202018.0230.

An iterative penalized least squares approach to sparse canonical correlation analysis.一种用于稀疏典型相关分析的迭代惩罚最小二乘法。

Biometrics. 2019 Sep;75(3):734-744. doi: 10.1111/biom.13043. Epub 2019 Apr 9.

JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.用于多数据类型综合分析的联合与个体变异解释（JIVE）

Ann Appl Stat. 2013 Mar 1;7(1):523-542. doi: 10.1214/12-AOAS597.

On Consistency and Sparsity for Principal Components Analysis in High Dimensions.高维主成分分析中的一致性与稀疏性

J Am Stat Assoc. 2009 Jun 1;104(486):682-693. doi: 10.1198/jasa.2009.0121.

Dementia induces correlated reductions in white matter integrity and cortical thickness: a multivariate neuroimaging study with sparse canonical correlation analysis.痴呆症导致白质完整性和皮质厚度的相关降低：使用稀疏典型相关分析的多变量神经影像学研究。

Neuroimage. 2010 Apr 15;50(3):1004-16. doi: 10.1016/j.neuroimage.2010.01.041. Epub 2010 Jan 18.

An online multi-channel SSVEP-based brain-computer interface using a canonical correlation analysis method.一种基于典型相关分析方法的在线多通道稳态视觉诱发电位脑机接口。

J Neural Eng. 2009 Aug;6(4):046002. doi: 10.1088/1741-2560/6/4/046002. Epub 2009 Jun 3.

A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis.一种惩罚矩阵分解及其在稀疏主成分分析和典型相关分析中的应用。

Biostatistics. 2009 Jul;10(3):515-34. doi: 10.1093/biostatistics/kxp008. Epub 2009 Apr 17.

Sparse canonical methods for biological data integration: application to a cross-platform study.用于生物数据整合的稀疏典型方法：在一项跨平台研究中的应用

BMC Bioinformatics. 2009 Jan 26;10:34. doi: 10.1186/1471-2105-10-34.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于稀疏典型相关分析的支持恢复：信息论与计算限制

On Support Recovery with Sparse CCA: Information Theoretic and Computational Limits.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献