• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

广义典型相关分析中的变量选择

Variable selection for generalized canonical correlation analysis.

作者信息

Tenenhaus Arthur, Philippe Cathy, Guillemot Vincent, Le Cao Kim-Anh, Grill Jacques, Frouin Vincent

机构信息

SUPELEC, Plateau de moulon, 3 rue Joliot-Curie, 91192 Gif-sur-Yvette Cedex, France

CNRS-IGR-Paris XI university, UMR8203, 94805 Villejuif cedex, France.

出版信息

Biostatistics. 2014 Jul;15(3):569-83. doi: 10.1093/biostatistics/kxu001. Epub 2014 Feb 17.

DOI:10.1093/biostatistics/kxu001
PMID:24550197
Abstract

Regularized generalized canonical correlation analysis (RGCCA) is a generalization of regularized canonical correlation analysis to 3 or more sets of variables. RGCCA is a component-based approach which aims to study the relationships between several sets of variables. The quality and interpretability of the RGCCA components are likely to be affected by the usefulness and relevance of the variables in each block. Therefore, it is an important issue to identify within each block which subsets of significant variables are active in the relationships between blocks. In this paper, RGCCA is extended to address the issue of variable selection. Specifically, sparse generalized canonical correlation analysis (SGCCA) is proposed to combine RGCCA with an [Formula: see text]-penalty in a unified framework. Within this framework, blocks are not necessarily fully connected, which makes SGCCA a flexible method for analyzing a wide variety of practical problems. Finally, the versatility and usefulness of SGCCA are illustrated on a simulated dataset and on a 3-block dataset which combine gene expression, comparative genomic hybridization, and a qualitative phenotype measured on a set of 53 children with glioma. SGCCA is available on CRAN as part of the RGCCA package.

摘要

正则化广义典型相关分析(RGCCA)是正则化典型相关分析对三组或更多组变量的推广。RGCCA是一种基于成分的方法,旨在研究多组变量之间的关系。RGCCA成分的质量和可解释性可能会受到每个模块中变量的有用性和相关性的影响。因此,确定每个模块中哪些显著变量子集在模块间关系中起作用是一个重要问题。本文对RGCCA进行扩展以解决变量选择问题。具体而言,提出了稀疏广义典型相关分析(SGCCA),将RGCCA与一个[公式:见原文]惩罚项在一个统一框架中相结合。在此框架内,模块不一定是完全连接的,这使得SGCCA成为分析各种实际问题的灵活方法。最后,在一个模拟数据集和一个由基因表达、比较基因组杂交以及对一组53例神经胶质瘤患儿测量的定性表型组成的三模块数据集上说明了SGCCA的通用性和实用性。SGCCA作为RGCCA包的一部分可在CRAN上获取。

相似文献

1
Variable selection for generalized canonical correlation analysis.广义典型相关分析中的变量选择
Biostatistics. 2014 Jul;15(3):569-83. doi: 10.1093/biostatistics/kxu001. Epub 2014 Feb 17.
2
Regularized Generalized Canonical Correlation Analysis: A Framework for Sequential Multiblock Component Methods.正则化广义典型相关分析:一种用于顺序多块成分方法的框架。
Psychometrika. 2017 May 23. doi: 10.1007/s11336-017-9573-x.
3
A strategy for multimodal data integration: application to biomarkers identification in spinocerebellar ataxia.一种多模态数据整合策略:在脊髓小脑共济失调中生物标志物识别的应用。
Brief Bioinform. 2018 Nov 27;19(6):1356-1369. doi: 10.1093/bib/bbx060.
4
Multiway generalized canonical correlation analysis.多路广义典型相关分析
Biostatistics. 2022 Jan 13;23(1):240-256. doi: 10.1093/biostatistics/kxaa010.
5
Sparse canonical correlation analysis with application to genomic data integration.应用于基因组数据整合的稀疏典型相关分析。
Stat Appl Genet Mol Biol. 2009;8:Article 1. doi: 10.2202/1544-6115.1406. Epub 2009 Jan 6.
6
Sparse canonical correlation analysis from a predictive point of view.从预测角度看稀疏典型相关分析。
Biom J. 2015 Sep;57(5):834-51. doi: 10.1002/bimj.201400226. Epub 2015 Jul 6.
7
Multiblock variable influence on orthogonal projections (MB-VIOP) for enhanced interpretation of total, global, local and unique variations in OnPLS models.多块变量对正交投影(MB-VIOP)的影响,用于增强 OnPLS 模型中总变异性、全局变异性、局部变异性和独特变异性的解释。
BMC Bioinformatics. 2021 Apr 3;22(1):176. doi: 10.1186/s12859-021-04015-9.
8
Canonical Measure of Correlation (CMC) and Canonical Measure of Distance (CMD) between sets of data Part 2. Variable reduction.数据集之间的典型相关性度量(CMC)和典型距离度量(CMD) 第2部分。变量约简。
Anal Chim Acta. 2009 Aug 19;648(1):52-9. doi: 10.1016/j.aca.2009.06.035. Epub 2009 Jun 21.
9
A unified approach to multiple-set canonical correlation analysis and principal components analysis.多组典范相关分析与主成分分析的统一方法。
Br J Math Stat Psychol. 2013 May;66(2):308-21. doi: 10.1111/j.2044-8317.2012.02052.x. Epub 2012 May 22.
10
Generalized covariance-adjusted canonical correlation analysis with application to psychiatry.广义协方差调整典型相关分析及其在精神病学中的应用。
Stat Med. 2003 Feb 28;22(4):595-610. doi: 10.1002/sim.1332.

引用本文的文献

1
Framework for Brain-Derived Dimensions of Psychopathology.精神病理学脑源性维度框架
JAMA Psychiatry. 2025 Jun 18. doi: 10.1001/jamapsychiatry.2025.1246.
2
Algorithms and tools for data-driven omics integration to achieve multilayer biological insights: a narrative review.用于数据驱动的组学整合以实现多层生物学见解的算法和工具:一篇综述
J Transl Med. 2025 Apr 10;23(1):425. doi: 10.1186/s12967-025-06446-x.
3
Benchtop Proton NMR Spectroscopy for High-Throughput Lipoprotein Quantification in Human Serum and Plasma.用于人血清和血浆中高通量脂蛋白定量的台式质子核磁共振波谱法。
Anal Chem. 2025 Apr 1;97(12):6399-6409. doi: 10.1021/acs.analchem.4c04660. Epub 2025 Mar 17.
4
Multimodal data integration in early-stage breast cancer.早期乳腺癌的多模态数据整合
Breast. 2025 Apr;80:103892. doi: 10.1016/j.breast.2025.103892. Epub 2025 Jan 28.
5
NMFProfiler: a multi-omics integration method for samples stratified in groups.NMFProfiler:一种用于对分组样本进行多组学整合的方法。
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf066.
6
Using multiomic integration to improve blood biomarkers of major depressive disorder: a case-control study.利用多组学整合改善重度抑郁症的血液生物标志物:一项病例对照研究。
EBioMedicine. 2025 Mar;113:105569. doi: 10.1016/j.ebiom.2025.105569. Epub 2025 Feb 5.
7
Bioactivity Profiling of Chemical Mixtures for Hazard Characterization.用于危害特征描述的化学混合物生物活性分析
Environ Sci Technol. 2025 Jan 14;59(1):291-301. doi: 10.1021/acs.est.4c11095. Epub 2024 Dec 20.
8
Stress-Resilience Impacts Psychological Wellbeing: Evidence from Brain-Gut Microbiome Interactions.应激适应力对心理健康的影响:来自脑-肠-微生物组相互作用的证据。
Nat Ment Health. 2024 Aug;2(8):935-950. doi: 10.1038/s44220-024-00266-6. Epub 2024 Jun 21.
9
Methods for multi-omic data integration in cancer research.癌症研究中的多组学数据整合方法。
Front Genet. 2024 Sep 19;15:1425456. doi: 10.3389/fgene.2024.1425456. eCollection 2024.
10
An extension of latent unknown clustering integrating multi-omics data (LUCID) incorporating incomplete omics data.整合多组学数据的潜在未知聚类扩展(LUCID),纳入不完整的组学数据。
Bioinform Adv. 2024 Aug 24;4(1):vbae123. doi: 10.1093/bioadv/vbae123. eCollection 2024.