Suppr超能文献

综合稀疏偏最小二乘法。

Integrative sparse partial least squares.

机构信息

School of Statistics, Renmin University of China, Beijing, China.

School of Public Health, Yale University, New Haven, Connecticut, USA.

出版信息

Stat Med. 2021 Apr;40(9):2239-2256. doi: 10.1002/sim.8900. Epub 2021 Feb 8.

Abstract

Partial least squares, as a dimension reduction technique, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken estimation performance, the sparse partial least squares (SPLS) technique has been proposed to identify important variables and generate more interpretable results. However, the small sample size of a single dataset limits the performance of conventional methods. An effective solution comes from gathering information from multiple comparable studies. Integrative analysis has essential importance in multidatasets analysis. The main idea is to improve performance by assembling raw data from multiple independent datasets and analyzing them jointly. In this article, we develop an integrative SPLS (iSPLS) method using penalization based on the SPLS technique. The proposed approach consists of two penalties. The first penalty conducts variable selection under the context of integrative analysis. The second penalty, a contrasted penalty, is imposed to encourage the similarity of estimates across datasets and generate more sensible and accurate results. Computational algorithms are developed. Simulation experiments are conducted to compare iSPLS with alternative approaches. The practical utility of iSPLS is shown in the analysis of two TCGA gene expression data.

摘要

偏最小二乘法作为一种降维技术,因其能够处理大量变量的问题而变得越来越重要。由于噪声变量可能会削弱估计性能,因此提出了稀疏偏最小二乘法(SPLS)技术来识别重要变量并生成更具可解释性的结果。然而,单个数据集的小样本量限制了传统方法的性能。一个有效的解决方案是从多个可比研究中收集信息。综合分析在多数据集分析中具有重要意义。其主要思想是通过从多个独立数据集组装原始数据并联合分析来提高性能。在本文中,我们基于 SPLS 技术开发了一种使用惩罚的集成 SPLS(iSPLS)方法。所提出的方法包括两个惩罚项。第一个惩罚项在综合分析的背景下进行变量选择。第二个惩罚项,对比惩罚项,旨在鼓励数据集之间估计的相似性,并生成更合理和准确的结果。开发了计算算法。进行了模拟实验来比较 iSPLS 与替代方法。在对两个 TCGA 基因表达数据的分析中展示了 iSPLS 的实际效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2058/8071349/3231d31052e6/nihms-1690455-f0001.jpg

相似文献

1
Integrative sparse partial least squares.综合稀疏偏最小二乘法。
Stat Med. 2021 Apr;40(9):2239-2256. doi: 10.1002/sim.8900. Epub 2021 Feb 8.
2
Integrative sparse principal component analysis of gene expression data.基因表达数据的整合稀疏主成分分析
Genet Epidemiol. 2017 Dec;41(8):844-865. doi: 10.1002/gepi.22089. Epub 2017 Nov 8.
3
Sparse partial least squares classification for high dimensional data.高维数据的稀疏偏最小二乘分类
Stat Appl Genet Mol Biol. 2010;9(1):Article17. doi: 10.2202/1544-6115.1492. Epub 2010 Mar 3.
4
Sparse partial least squares with group and subgroup structure.稀疏偏最小二乘与分组和子分组结构。
Stat Med. 2018 Oct 15;37(23):3338-3356. doi: 10.1002/sim.7821. Epub 2018 Jun 11.
5
iSFun: an R package for integrative dimension reduction analysis.iSFun:一个用于整合维度缩减分析的 R 包。
Bioinformatics. 2022 May 26;38(11):3134-3135. doi: 10.1093/bioinformatics/btac281.
8
Integrative Analysis of Cancer Diagnosis Studies with Composite Penalization.采用复合惩罚的癌症诊断研究综合分析
Scand Stat Theory Appl. 2014 Mar 1;41(1):87-103. doi: 10.1111/j.1467-9469.2012.00816.x.
10
Integrative Analysis of "-Omics" Data Using Penalty Functions.使用惩罚函数对“组学”数据进行综合分析。
Wiley Interdiscip Rev Comput Stat. 2015 Jan-Feb;7(1):99-108. doi: 10.1002/wics.1322.

本文引用的文献

4
Integrative Analysis of "-Omics" Data Using Penalty Functions.使用惩罚函数对“组学”数据进行综合分析。
Wiley Interdiscip Rev Comput Stat. 2015 Jan-Feb;7(1):99-108. doi: 10.1002/wics.1322.
5
: Coordinate Descent With Nonconvex Penalties.带非凸惩罚项的坐标下降法
J Am Stat Assoc. 2011;106(495):1125-1138. doi: 10.1198/jasa.2011.tm09738.
6
9
Identification of cancer genomic markers via integrative sparse boosting.通过集成稀疏提升识别癌症基因组标记物。
Biostatistics. 2012 Jul;13(3):509-22. doi: 10.1093/biostatistics/kxr033. Epub 2011 Oct 31.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验