Suppr超能文献

用于同时进行降维和变量选择的稀疏偏最小二乘回归。

Sparse partial least squares regression for simultaneous dimension reduction and variable selection.

作者信息

Chun Hyonho, Keleş Sündüz

机构信息

University of Wisconsin Madison, USA.

出版信息

J R Stat Soc Series B Stat Methodol. 2010 Jan;72(1):3-25. doi: 10.1111/j.1467-9868.2009.00723.x.

Abstract

Partial least squares regression has been an alternative to ordinary least squares for handling multicollinearity in several areas of scientific research since the 1960s. It has recently gained much attention in the analysis of high dimensional genomic data. We show that known asymptotic consistency of the partial least squares estimator for a univariate response does not hold with the very large p and small n paradigm. We derive a similar result for a multivariate response regression with partial least squares. We then propose a sparse partial least squares formulation which aims simultaneously to achieve good predictive performance and variable selection by producing sparse linear combinations of the original predictors. We provide an efficient implementation of sparse partial least squares regression and compare it with well-known variable selection and dimension reduction approaches via simulation experiments. We illustrate the practical utility of sparse partial least squares regression in a joint analysis of gene expression and genomewide binding data.

摘要

自20世纪60年代以来,偏最小二乘回归一直是普通最小二乘法的一种替代方法,用于处理多个科学研究领域中的多重共线性问题。最近,它在高维基因组数据分析中备受关注。我们表明,对于单变量响应,偏最小二乘估计量已知的渐近一致性在p非常大而n非常小的范式下并不成立。我们针对多变量响应回归与偏最小二乘法得出了类似结果。然后,我们提出了一种稀疏偏最小二乘公式,旨在通过生成原始预测变量的稀疏线性组合,同时实现良好的预测性能和变量选择。我们提供了稀疏偏最小二乘回归的有效实现,并通过模拟实验将其与著名的变量选择和降维方法进行比较。我们在基因表达与全基因组结合数据的联合分析中说明了稀疏偏最小二乘回归的实际效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89e7/2810828/8baf4011caac/rssb0072-0003-f2.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验