Suppr超能文献

稀疏指数族主成分分析

Sparse Exponential Family Principal Component Analysis.

作者信息

Lu Meng, Huang Jianhua Z, Qian Xiaoning

机构信息

Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, US, 77840.

Department of Statistics, Texas A&M University, College Station, TX, US, 77840.

出版信息

Pattern Recognit. 2016 Dec;60:681-691. doi: 10.1016/j.patcog.2016.05.024. Epub 2016 May 21.

Abstract

We propose a Sparse exponential family Principal Component Analysis (SePCA) method suitable for any type of data following exponential family distributions, to achieve simultaneous dimension reduction and variable selection for better interpretation of the results. Because of the generality of exponential family distributions, the method can be applied to a wide range of applications, in particular when analyzing high dimensional next-generation sequencing data and genetic mutation data in genomics. The use of sparsity-inducing penalty helps produce sparse principal component loading vectors such that the principal components can focus on informative variables. By using an equivalent dual form of the formulated optimization problem for SePCA, we derive optimal solutions with efficient iterative closed-form updating rules. The results from both simulation experiments and real-world applications have demonstrated the superiority of our SePCA in reconstruction accuracy and computational efficiency over traditional exponential family PCA (ePCA), the existing Sparse PCA (SPCA) and Sparse Logistic PCA (SLPCA) algorithms.

摘要

我们提出了一种适用于任何遵循指数族分布的数据类型的稀疏指数族主成分分析(SePCA)方法,以实现同时降维和变量选择,从而更好地解释结果。由于指数族分布具有一般性,该方法可应用于广泛的应用场景,特别是在分析基因组学中的高维下一代测序数据和基因突变数据时。使用稀疏诱导惩罚有助于产生稀疏的主成分载荷向量,使得主成分能够聚焦于信息变量。通过使用SePCA公式化优化问题的等效对偶形式,我们推导出了具有高效迭代闭式更新规则的最优解。模拟实验和实际应用的结果均表明,我们的SePCA在重构精度和计算效率方面优于传统的指数族主成分分析(ePCA)、现有的稀疏主成分分析(SPCA)和稀疏逻辑主成分分析(SLPCA)算法。

相似文献

1
Sparse Exponential Family Principal Component Analysis.稀疏指数族主成分分析
Pattern Recognit. 2016 Dec;60:681-691. doi: 10.1016/j.patcog.2016.05.024. Epub 2016 May 21.
2
Simple exponential family PCA.简单指数族主成分分析。
IEEE Trans Neural Netw Learn Syst. 2013 Mar;24(3):485-97. doi: 10.1109/TNNLS.2012.2234134.
3
Stochastic convex sparse principal component analysis.随机凸稀疏主成分分析
EURASIP J Bioinform Syst Biol. 2016 Sep 9;2016(1):15. doi: 10.1186/s13637-016-0045-x. eCollection 2016 Dec.
4
Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty.基于 TV-弹性网络罚项的结构稀疏主成分分析。
IEEE Trans Med Imaging. 2018 Feb;37(2):396-407. doi: 10.1109/TMI.2017.2749140. Epub 2017 Sep 4.
5
Sparse Principal Component Analysis With Preserved Sparsity Pattern.具有保留稀疏模式的稀疏主成分分析
IEEE Trans Image Process. 2019 Jul;28(7):3274-3285. doi: 10.1109/TIP.2019.2895464. Epub 2019 Jan 25.
6
SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA.二元数据的稀疏逻辑主成分分析
Ann Appl Stat. 2010 Sep 1;4(3):1579-1601. doi: 10.1214/10-AOAS327SUPP.
8
Sparse Principal Component Analysis via Rotation and Truncation.基于旋转和截断的稀疏主成分分析。
IEEE Trans Neural Netw Learn Syst. 2016 Apr;27(4):875-90. doi: 10.1109/TNNLS.2015.2427451. Epub 2015 Dec 22.

本文引用的文献

1
SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA.二元数据的稀疏逻辑主成分分析
Ann Appl Stat. 2010 Sep 1;4(3):1579-1601. doi: 10.1214/10-AOAS327SUPP.
2
A haplotype map of the human genome.人类基因组单倍型图谱。
Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验