• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于同时进行降维和变量选择的稀疏偏最小二乘回归。

Sparse partial least squares regression for simultaneous dimension reduction and variable selection.

作者信息

Chun Hyonho, Keleş Sündüz

机构信息

University of Wisconsin Madison, USA.

出版信息

J R Stat Soc Series B Stat Methodol. 2010 Jan;72(1):3-25. doi: 10.1111/j.1467-9868.2009.00723.x.

DOI:10.1111/j.1467-9868.2009.00723.x
PMID:20107611
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2810828/
Abstract

Partial least squares regression has been an alternative to ordinary least squares for handling multicollinearity in several areas of scientific research since the 1960s. It has recently gained much attention in the analysis of high dimensional genomic data. We show that known asymptotic consistency of the partial least squares estimator for a univariate response does not hold with the very large p and small n paradigm. We derive a similar result for a multivariate response regression with partial least squares. We then propose a sparse partial least squares formulation which aims simultaneously to achieve good predictive performance and variable selection by producing sparse linear combinations of the original predictors. We provide an efficient implementation of sparse partial least squares regression and compare it with well-known variable selection and dimension reduction approaches via simulation experiments. We illustrate the practical utility of sparse partial least squares regression in a joint analysis of gene expression and genomewide binding data.

摘要

自20世纪60年代以来,偏最小二乘回归一直是普通最小二乘法的一种替代方法,用于处理多个科学研究领域中的多重共线性问题。最近,它在高维基因组数据分析中备受关注。我们表明,对于单变量响应,偏最小二乘估计量已知的渐近一致性在p非常大而n非常小的范式下并不成立。我们针对多变量响应回归与偏最小二乘法得出了类似结果。然后,我们提出了一种稀疏偏最小二乘公式,旨在通过生成原始预测变量的稀疏线性组合,同时实现良好的预测性能和变量选择。我们提供了稀疏偏最小二乘回归的有效实现,并通过模拟实验将其与著名的变量选择和降维方法进行比较。我们在基因表达与全基因组结合数据的联合分析中说明了稀疏偏最小二乘回归的实际效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89e7/2810828/ccb5547735ce/rssb0072-0003-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89e7/2810828/8baf4011caac/rssb0072-0003-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89e7/2810828/ccb5547735ce/rssb0072-0003-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89e7/2810828/8baf4011caac/rssb0072-0003-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89e7/2810828/ccb5547735ce/rssb0072-0003-f1.jpg

相似文献

1
Sparse partial least squares regression for simultaneous dimension reduction and variable selection.用于同时进行降维和变量选择的稀疏偏最小二乘回归。
J R Stat Soc Series B Stat Methodol. 2010 Jan;72(1):3-25. doi: 10.1111/j.1467-9868.2009.00723.x.
2
Expression quantitative trait loci mapping with multivariate sparse partial least squares regression.使用多变量稀疏偏最小二乘回归进行表达数量性状基因座定位。
Genetics. 2009 May;182(1):79-90. doi: 10.1534/genetics.109.100362. Epub 2009 Mar 6.
3
Sparse partial least-squares regression for high-throughput survival data analysis.用于高通量生存数据分析的稀疏偏最小二乘回归
Stat Med. 2013 Dec 30;32(30):5340-52. doi: 10.1002/sim.5975. Epub 2013 Sep 18.
4
Sparse partial least squares classification for high dimensional data.高维数据的稀疏偏最小二乘分类
Stat Appl Genet Mol Biol. 2010;9(1):Article17. doi: 10.2202/1544-6115.1492. Epub 2010 Mar 3.
5
Envelope-based partial partial least squares with application to cytokine-based biomarker analysis for COVID-19.基于信封的偏最小二乘部分法及其在基于细胞因子的 COVID-19 生物标志物分析中的应用。
Stat Med. 2022 Oct 15;41(23):4578-4592. doi: 10.1002/sim.9526. Epub 2022 Jul 15.
6
Sparse Regression by Projection and Sparse Discriminant Analysis.基于投影的稀疏回归与稀疏判别分析
J Comput Graph Stat. 2015 Apr 1;24(2):416-438. doi: 10.1080/10618600.2014.907094.
7
Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins.降维与变量选择在基因组选择中的应用:以荷斯坦奶牛产奶量预测为例
J Anim Breed Genet. 2011 Aug;128(4):247-57. doi: 10.1111/j.1439-0388.2011.00917.x. Epub 2011 Mar 28.
8
Integrative sparse partial least squares.综合稀疏偏最小二乘法。
Stat Med. 2021 Apr;40(9):2239-2256. doi: 10.1002/sim.8900. Epub 2021 Feb 8.
9
Sparse partial least squares with group and subgroup structure.稀疏偏最小二乘与分组和子分组结构。
Stat Med. 2018 Oct 15;37(23):3338-3356. doi: 10.1002/sim.7821. Epub 2018 Jun 11.
10
Fitting and Cross-Validating Cox Models to Censored Big Data With Missing Values Using Extensions of Partial Least Squares Regression Models.使用偏最小二乘回归模型的扩展方法对带有缺失值的删失大数据进行Cox模型拟合和交叉验证
Front Big Data. 2021 Nov 1;4:684794. doi: 10.3389/fdata.2021.684794. eCollection 2021.

引用本文的文献

1
Decoding longitudinal microbiome trajectories: an interpretable machine learning approach for biomarker discovery and prediction.解码纵向微生物组轨迹:一种用于生物标志物发现和预测的可解释机器学习方法。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf374.
2
A systematic benchmark of integrative strategies for microbiome-metabolome data.微生物组-代谢组数据整合策略的系统基准测试
Commun Biol. 2025 Jul 25;8(1):1100. doi: 10.1038/s42003-025-08515-9.
3
Disruption of gut microbiome and metabolome in treatment-naïve children with attention deficit hyperactivity disorder.

本文引用的文献

1
Expression quantitative trait loci mapping with multivariate sparse partial least squares regression.使用多变量稀疏偏最小二乘回归进行表达数量性状基因座定位。
Genetics. 2009 May;182(1):79-90. doi: 10.1534/genetics.109.100362. Epub 2009 Mar 6.
2
Group SCAD regression analysis for microarray time course gene expression data.用于微阵列时间进程基因表达数据的SCAD回归分析组。
Bioinformatics. 2007 Jun 15;23(12):1486-94. doi: 10.1093/bioinformatics/btm125. Epub 2007 Apr 26.
3
Partial least squares: a versatile tool for the analysis of high-dimensional genomic data.
未经治疗的注意力缺陷多动障碍儿童的肠道微生物组和代谢组紊乱。
BMC Microbiol. 2025 Jul 2;25(1):381. doi: 10.1186/s12866-025-04048-7.
4
Pathway-Specific Insights into Colorectal Cancer Through Comprehensive Multi-Omics Data Integration.通过综合多组学数据整合对结直肠癌进行特定通路的深入研究。
Biology (Basel). 2025 Apr 25;14(5):468. doi: 10.3390/biology14050468.
5
Low density marker-based effectiveness and efficiency of early-generation genomic selection relative to phenotype-based selection in dolichos bean (Lablab purpureus L. Sweet).基于低密度标记的菜豆(Lablab purpureus L. Sweet)早期基因组选择相对于基于表型选择的有效性和效率
Plant Genome. 2025 Jun;18(2):e70039. doi: 10.1002/tpg2.70039.
6
sTPLS: identifying common and specific correlated patterns under multiple biological conditions.sTPLS:识别多种生物学条件下的共同和特定相关模式。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf195.
7
An Integrative Multi-Omics Random Forest Framework for Robust Biomarker Discovery.一种用于稳健生物标志物发现的综合多组学随机森林框架。
bioRxiv. 2025 Mar 6:2025.03.05.641533. doi: 10.1101/2025.03.05.641533.
8
Image-based and ML-driven analysis for assessing blueberry fruit quality.基于图像和机器学习驱动的蓝莓果实品质评估分析。
Heliyon. 2025 Jan 27;11(3):e42288. doi: 10.1016/j.heliyon.2025.e42288. eCollection 2025 Feb 15.
9
Prioritizing chemicals of emerging concern in the Great Lakes Basin using covariance of chemical concentrations and diverse biological responses from a variety of species.利用化学物质浓度的协方差以及来自各种物种的多样生物反应,对五大湖流域新出现的关注化学品进行优先级排序。
Environ Toxicol Chem. 2025 Mar 1;44(3):764-776. doi: 10.1093/etojnl/vgae094.
10
Innovative Infrared Spectroscopic Technologies for the Prediction of Deoxynivalenol in Wheat.用于预测小麦中脱氧雪腐镰刀菌烯醇的创新红外光谱技术
ACS Food Sci Technol. 2025 Jan 8;5(1):209-217. doi: 10.1021/acsfoodscitech.4c00730. eCollection 2025 Jan 17.
偏最小二乘法:一种用于分析高维基因组数据的通用工具。
Brief Bioinform. 2007 Jan;8(1):32-44. doi: 10.1093/bib/bbl016. Epub 2006 May 26.
4
Predicting transcription factor activities from combined analysis of microarray and ChIP data: a partial least squares approach.通过微阵列和染色质免疫沉淀数据的联合分析预测转录因子活性:一种偏最小二乘法
Theor Biol Med Model. 2005 Jun 24;2:23. doi: 10.1186/1742-4682-2-23.
5
Modeling the relationship between LVAD support time and gene expression changes in the human heart by penalized partial least squares.通过惩罚偏最小二乘法建立左心室辅助装置支持时间与人类心脏基因表达变化之间的关系模型。
Bioinformatics. 2004 Apr 12;20(6):888-94. doi: 10.1093/bioinformatics/btg499. Epub 2004 Jan 29.
6
Transcriptional regulatory networks in Saccharomyces cerevisiae.酿酒酵母中的转录调控网络。
Science. 2002 Oct 25;298(5594):799-804. doi: 10.1126/science.1075090.
7
'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns.“基因消减”作为一种识别具有相似表达模式的不同基因集的方法。
Genome Biol. 2000;1(2):RESEARCH0003. doi: 10.1186/gb-2000-1-2-research0003. Epub 2000 Aug 4.
8
Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.通过微阵列杂交全面鉴定酿酒酵母细胞周期调控基因。
Mol Biol Cell. 1998 Dec;9(12):3273-97. doi: 10.1091/mbc.9.12.3273.