• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

二元数据的稀疏逻辑主成分分析

SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA.

作者信息

Lee Seokho, Huang Jianhua Z, Hu Jianhua

机构信息

Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA,

出版信息

Ann Appl Stat. 2010 Sep 1;4(3):1579-1601. doi: 10.1214/10-AOAS327SUPP.

DOI:10.1214/10-AOAS327SUPP
PMID:21116451
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2992445/
Abstract

We develop a new principal components analysis (PCA) type dimension reduction method for binary data. Different from the standard PCA which is defined on the observed data, the proposed PCA is defined on the logit transform of the success probabilities of the binary observations. Sparsity is introduced to the principal component (PC) loading vectors for enhanced interpretability and more stable extraction of the principal components. Our sparse PCA is formulated as solving an optimization problem with a criterion function motivated from penalized Bernoulli likelihood. A Majorization-Minimization algorithm is developed to efficiently solve the optimization problem. The effectiveness of the proposed sparse logistic PCA method is illustrated by application to a single nucleotide polymorphism data set and a simulation study.

摘要

我们为二元数据开发了一种新的主成分分析(PCA)类型的降维方法。与基于观测数据定义的标准PCA不同,所提出的PCA是基于二元观测成功概率的对数变换来定义的。在主成分(PC)载荷向量中引入稀疏性,以增强可解释性并更稳定地提取主成分。我们的稀疏PCA被表述为求解一个具有基于惩罚伯努利似然的准则函数的优化问题。开发了一种主元化-最小化算法来有效求解该优化问题。通过应用于一个单核苷酸多态性数据集和一项模拟研究,说明了所提出的稀疏逻辑PCA方法的有效性。

相似文献

1
SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA.二元数据的稀疏逻辑主成分分析
Ann Appl Stat. 2010 Sep 1;4(3):1579-1601. doi: 10.1214/10-AOAS327SUPP.
2
Stochastic convex sparse principal component analysis.随机凸稀疏主成分分析
EURASIP J Bioinform Syst Biol. 2016 Sep 9;2016(1):15. doi: 10.1186/s13637-016-0045-x. eCollection 2016 Dec.
3
Sparse Principal Component Analysis With Preserved Sparsity Pattern.具有保留稀疏模式的稀疏主成分分析
IEEE Trans Image Process. 2019 Jul;28(7):3274-3285. doi: 10.1109/TIP.2019.2895464. Epub 2019 Jan 25.
4
Sparse Exponential Family Principal Component Analysis.稀疏指数族主成分分析
Pattern Recognit. 2016 Dec;60:681-691. doi: 10.1016/j.patcog.2016.05.024. Epub 2016 May 21.
5
Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty.基于 TV-弹性网络罚项的结构稀疏主成分分析。
IEEE Trans Med Imaging. 2018 Feb;37(2):396-407. doi: 10.1109/TMI.2017.2749140. Epub 2017 Sep 4.
6
Sparsity-based signal extraction using dual Q-factors for gearbox fault detection.基于双 Q 因子的稀疏信号提取在齿轮箱故障检测中的应用。
ISA Trans. 2018 Aug;79:147-160. doi: 10.1016/j.isatra.2018.05.009. Epub 2018 May 26.
7
Edge-group sparse PCA for network-guided high dimensional data analysis.基于边缘群稀疏 PCA 的网络引导高维数据分析。
Bioinformatics. 2018 Oct 15;34(20):3479-3487. doi: 10.1093/bioinformatics/bty362.
8
A Guide for Sparse PCA: Model Comparison and Applications.稀疏 PCA 指南:模型比较与应用。
Psychometrika. 2021 Dec;86(4):893-919. doi: 10.1007/s11336-021-09773-2. Epub 2021 Jun 29.
9
Sparse Principal Component Analysis via Rotation and Truncation.基于旋转和截断的稀疏主成分分析。
IEEE Trans Neural Netw Learn Syst. 2016 Apr;27(4):875-90. doi: 10.1109/TNNLS.2015.2427451. Epub 2015 Dec 22.
10
Supervised Discriminative Sparse PCA for Com-Characteristic Gene Selection and Tumor Classification on Multiview Biological Data.基于多视图生物数据的共特征基因选择和肿瘤分类的有监督判别稀疏 PCA
IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):2926-2937. doi: 10.1109/TNNLS.2019.2893190. Epub 2019 Feb 22.

引用本文的文献

1
Bayesian inference on high-dimensional multivariate binary responses.高维多元二元响应的贝叶斯推断。
J Am Stat Assoc. 2024;119(548):2560-2571. doi: 10.1080/01621459.2023.2260053. Epub 2023 Nov 9.
2
Functional Multivariable Logistic Regression With an Application to HIV Viral Suppression Prediction.用于HIV病毒抑制预测的功能多变量逻辑回归
Biom J. 2024 Jul;66(5):e202300081. doi: 10.1002/bimj.202300081.
3
Learning from Binary Multiway Data: Probabilistic Tensor Decomposition and its Statistical Optimality.从二元多路数据中学习:概率张量分解及其统计最优性。
J Mach Learn Res. 2020 Jul;21.
4
The "DOLPHINS" Project: A Low-Cost Real-Time Multivariate Process Control From Large Sensor Arrays Providing Sparse Binary Data.“海豚”项目:一种基于提供稀疏二进制数据的大型传感器阵列的低成本实时多变量过程控制。
Front Chem. 2021 Sep 3;9:734132. doi: 10.3389/fchem.2021.734132. eCollection 2021.
5
THREE-WAY CLUSTERING OF MULTI-TISSUE MULTI-INDIVIDUAL GENE EXPRESSION DATA USING SEMI-NONNEGATIVE TENSOR DECOMPOSITION.使用半非负张量分解对多组织多个体基因表达数据进行三路聚类
Ann Appl Stat. 2019 Jun;13(2):1103-1127. doi: 10.1214/18-aoas1228. Epub 2019 Jun 17.
6
Low Entropy Sub-Networks Prevent the Integration of Metabolomic and Transcriptomic Data.低熵子网阻碍代谢组学和转录组学数据的整合。
Entropy (Basel). 2020 Oct 31;22(11):1238. doi: 10.3390/e22111238.
7
Morbidity and Mortality After Acute Myocardial Infarction After Elective Major Noncardiac Surgery.择期非心脏大手术后急性心肌梗死后的发病率和死亡率。
J Cardiothorac Vasc Anesth. 2021 Mar;35(3):834-842. doi: 10.1053/j.jvca.2020.10.016. Epub 2020 Oct 15.
8
A Hierarchical Framework for State-Space Matrix Inference and Clustering.用于状态空间矩阵推理与聚类的分层框架
Ann Appl Stat. 2016 Sep;10(3):1348-1372. doi: 10.1214/16-AOAS938. Epub 2016 Sep 28.
9
Sparse Exponential Family Principal Component Analysis.稀疏指数族主成分分析
Pattern Recognit. 2016 Dec;60:681-691. doi: 10.1016/j.patcog.2016.05.024. Epub 2016 May 21.
10
Supervised categorical principal component analysis for genome-wide association analyses.监督类别主成分分析在全基因组关联分析中的应用。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2164-15-S1-S10. Epub 2014 Jan 24.

本文引用的文献

1
Variable Selection using MM Algorithms.使用MM算法进行变量选择
Ann Stat. 2005;33(4):1617-1642. doi: 10.1214/009053605000000200.
2
Correction of population stratification in large multi-ethnic association studies.大型多民族关联研究中群体分层的校正
PLoS One. 2008 Jan 2;3(1):e1382. doi: 10.1371/journal.pone.0001382.
3
A haplotype map of the human genome.人类基因组单倍型图谱。
Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.
4
Detect and adjust for population stratification in population-based association study using genomic control markers: an application of Affymetrix Genechip Human Mapping 10K array.利用基因组对照标记在基于人群的关联研究中检测并校正人群分层:Affymetrix基因芯片人类映射10K阵列的应用
Eur J Hum Genet. 2004 Dec;12(12):1001-6. doi: 10.1038/sj.ejhg.5201273.
5
Categorization of humans in biomedical research: genes, race and disease.生物医学研究中的人类分类:基因、种族与疾病。
Genome Biol. 2002 Jul 1;3(7):comment2007. doi: 10.1186/gb-2002-3-7-comment2007.
6
The essence of SNPs.单核苷酸多态性的本质。
Gene. 1999 Jul 8;234(2):177-86. doi: 10.1016/s0378-1119(99)00219-x.
7
Increasing the information content of STS-based genome maps: identifying polymorphisms in mapped STSs.增加基于序列标签位点(STS)的基因组图谱的信息含量:鉴定已定位STS中的多态性。
Genomics. 1996 Jan 1;31(1):123-6. doi: 10.1006/geno.1996.0019.
8
The transmission/disequilibrium test: history, subdivision, and admixture.传递/不平衡检验:历史、细分与混合
Am J Hum Genet. 1995 Aug;57(2):455-64.