用于MRI与基因数据联合分析的典型相关分析和偏最小二乘变体比较。

Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data.

作者信息

Grellmann Claudia, Bitzer Sebastian, Neumann Jane, Westlye Lars T, Andreassen Ole A, Villringer Arno, Horstmann Annette

机构信息

Department of Neurology, Max Planck Institute for Human Cognitive and Brain Sciences, Stephanstraße 1A, 04103 Leipzig, Germany; Leipzig University Hospital, IFB Adiposity Diseases, Philipp-Rosenthal-Straße 27, 04103 Leipzig, Germany.

Department of Neurology, Max Planck Institute for Human Cognitive and Brain Sciences, Stephanstraße 1A, 04103 Leipzig, Germany.

出版信息

Neuroimage. 2015 Feb 15;107:289-310. doi: 10.1016/j.neuroimage.2014.12.025. Epub 2014 Dec 17.

DOI:10.1016/j.neuroimage.2014.12.025

PMID:25527238

Abstract

The standard analysis approach in neuroimaging genetics studies is the mass-univariate linear modeling (MULM) approach. From a statistical view, however, this approach is disadvantageous, as it is computationally intensive, cannot account for complex multivariate relationships, and has to be corrected for multiple testing. In contrast, multivariate methods offer the opportunity to include combined information from multiple variants to discover meaningful associations between genetic and brain imaging data. We assessed three multivariate techniques, partial least squares correlation (PLSC), sparse canonical correlation analysis (sparse CCA) and Bayesian inter-battery factor analysis (Bayesian IBFA), with respect to their ability to detect multivariate genotype-phenotype associations. Our goal was to systematically compare these three approaches with respect to their performance and to assess their suitability for high-dimensional and multi-collinearly dependent data as is the case in neuroimaging genetics studies. In a series of simulations using both linearly independent and multi-collinear data, we show that sparse CCA and PLSC are suitable even for very high-dimensional collinear imaging data sets. Among those two, the predictive power was higher for sparse CCA when voxel numbers were below 400 times sample size and candidate SNPs were considered. Accordingly, we recommend Sparse CCA for candidate phenotype, candidate SNP studies. When voxel numbers exceeded 500 times sample size, the predictive power was the highest for PLSC. Therefore, PLSC can be considered a promising technique for multivariate modeling of high-dimensional brain-SNP-associations. In contrast, Bayesian IBFA cannot be recommended, since additional post-processing steps were necessary to detect causal relations. To verify the applicability of sparse CCA and PLSC, we applied them to an experimental imaging genetics data set provided for us. Most importantly, application of both methods replicated the findings of this data set.

摘要

神经影像遗传学研究中的标准分析方法是单变量线性建模（MULM）方法。然而，从统计学角度来看，这种方法存在劣势，因为它计算量很大，无法考虑复杂的多变量关系，并且必须针对多重检验进行校正。相比之下，多变量方法提供了整合多个变异体的综合信息以发现遗传数据与脑成像数据之间有意义关联的机会。我们评估了三种多变量技术，即偏最小二乘相关分析（PLSC）、稀疏典型相关分析（稀疏CCA）和贝叶斯电池间因子分析（贝叶斯IBFA），考察它们检测多变量基因型-表型关联的能力。我们的目标是系统比较这三种方法的性能，并评估它们对神经影像遗传学研究中出现的高维和多重共线性相关数据的适用性。在一系列使用线性独立数据和多重共线数据的模拟中，我们表明稀疏CCA和PLSC即使对于非常高维的共线成像数据集也适用。在这两种方法中，当体素数量低于样本量的400倍且考虑候选单核苷酸多态性（SNP）时，稀疏CCA的预测能力更高。因此，对于候选表型、候选SNP研究，我们推荐使用稀疏CCA。当体素数量超过样本量的500倍时，PLSC的预测能力最高。因此，PLSC可被视为一种用于高维脑-SNP关联多变量建模的有前景的技术。相比之下，贝叶斯IBFA不推荐使用，因为需要额外的后处理步骤来检测因果关系。为了验证稀疏CCA和PLSC的适用性，我们将它们应用于为我们提供的一个实验性影像遗传学数据集。最重要的是，这两种方法的应用都重复了该数据集的研究结果。

相似文献

Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data.用于MRI与基因数据联合分析的典型相关分析和偏最小二乘变体比较。

Neuroimage. 2015 Feb 15;107:289-310. doi: 10.1016/j.neuroimage.2014.12.025. Epub 2014 Dec 17.

Significant correlation between a set of genetic polymorphisms and a functional brain network revealed by feature selection and sparse Partial Least Squares.通过特征选择和稀疏偏最小二乘法揭示了一组遗传多态性与功能大脑网络之间的显著相关性。

Neuroimage. 2012 Oct 15;63(1):11-24. doi: 10.1016/j.neuroimage.2012.06.061. Epub 2012 Jul 8.

Identifying Candidate Genetic Associations with MRI-Derived AD-Related ROI via Tree-Guided Sparse Learning.通过树引导稀疏学习识别与 MRI 衍生的 AD 相关 ROI 相关的候选遗传关联。

IEEE/ACM Trans Comput Biol Bioinform. 2019 Nov-Dec;16(6):1986-1996. doi: 10.1109/TCBB.2018.2833487. Epub 2018 May 7.

Multi-Task Learning and Sparse Discriminant Canonical Correlation Analysis for Identification of Diagnosis-Specific Genotype-Phenotype Association.多任务学习和稀疏判别典范相关分析在鉴定特定于诊断的基因型 - 表型关联中的应用。

IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1390-1402. doi: 10.1109/TCBB.2024.3386406. Epub 2024 Oct 9.

Partial Least Squares (PLS) methods for neuroimaging: a tutorial and review.偏最小二乘法（PLS）在神经影像学中的方法：教程与综述。

Neuroimage. 2011 May 15;56(2):455-75. doi: 10.1016/j.neuroimage.2010.07.034. Epub 2010 Jul 23.

Canonical correlation analysis for multilabel classification: a least-squares formulation, extensions, and analysis.多标签分类的典范相关分析：最小二乘法公式、扩展及分析。

IEEE Trans Pattern Anal Mach Intell. 2011 Jan;33(1):194-200. doi: 10.1109/TPAMI.2010.160.

Structured and Sparse Canonical Correlation Analysis as a Brain-Wide Multi-Modal Data Fusion Approach.结构稀疏典型相关分析作为一种全脑多模态数据融合方法。

IEEE Trans Med Imaging. 2017 Jul;36(7):1438-1448. doi: 10.1109/TMI.2017.2681966. Epub 2017 Mar 14.

Random Projection for Fast and Efficient Multivariate Correlation Analysis of High-Dimensional Data: A New Approach.用于高维数据快速高效多变量相关性分析的随机投影：一种新方法。

Front Genet. 2016 Jun 7;7:102. doi: 10.3389/fgene.2016.00102. eCollection 2016.

Correspondence between fMRI and SNP data by group sparse canonical correlation analysis.通过组稀疏典型相关分析实现功能磁共振成像（fMRI）数据与单核苷酸多态性（SNP）数据之间的对应关系。

Med Image Anal. 2014 Aug;18(6):891-902. doi: 10.1016/j.media.2013.10.010. Epub 2013 Oct 31.

Radial basis function-sparse partial least squares for application to brain imaging data.径向基函数-稀疏偏最小二乘法在脑成像数据中的应用。

Comput Math Methods Med. 2013;2013:591032. doi: 10.1155/2013/591032. Epub 2013 May 13.

引用本文的文献

FPLS-DC: functional partial least squares through distance covariance for imaging genetics.FPLS-DC：用于影像遗传学的基于距离协方差的功能偏最小二乘法

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae173.

Sparse Hierarchical Representation Learning on Functional Brain Networks for Prediction of Autism Severity Levels.用于预测自闭症严重程度的功能性脑网络稀疏分层表示学习

Front Neurosci. 2022 Jul 7;16:935431. doi: 10.3389/fnins.2022.935431. eCollection 2022.

Improved Interpretability of Brain-Behavior CCA With Domain-Driven Dimension Reduction.通过领域驱动的降维提高脑行为典型相关分析的可解释性

Front Neurosci. 2022 Jun 23;16:851827. doi: 10.3389/fnins.2022.851827. eCollection 2022.

Joint Sparse Collaborative Regression on Imaging Genetics Study of Schizophrenia.精神分裂症影像学遗传学研究中的联合稀疏协同回归。

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1137-1146. doi: 10.1109/TCBB.2022.3172289. Epub 2023 Apr 3.

Identification of multimodal brain imaging association via a parameter decomposition based sparse multi-view canonical correlation analysis method.基于参数分解的稀疏多视图典型相关分析方法识别多模态脑影像关联。

BMC Bioinformatics. 2022 Apr 12;23(Suppl 3):128. doi: 10.1186/s12859-022-04669-z.

The "Neural Shift" of Sleep Quality and Cognitive Aging: A Resting-State MEG Study of Transient Neural Dynamics.睡眠质量与认知衰老的“神经偏移”：一项关于瞬态神经动力学的静息态脑磁图研究

Front Aging Neurosci. 2022 Jan 31;13:746236. doi: 10.3389/fnagi.2021.746236. eCollection 2021.

Differences in Performance of ASD and ADHD Subjects Facing Cognitive Loads in an Innovative Reasoning Experiment.在一项创新性推理实验中，自闭症谱系障碍（ASD）和注意力缺陷多动障碍（ADHD）受试者面对认知负荷时的表现差异。

Brain Sci. 2021 Nov 18;11(11):1531. doi: 10.3390/brainsci11111531.

Multiscale neurobiological correlates of human neuroticism.人类神经质的多尺度神经生物学相关性。

Hum Brain Mapp. 2020 Nov;41(16):4730-4743. doi: 10.1002/hbm.25153. Epub 2020 Aug 17.

Permutation inference for canonical correlation analysis.典范相关分析的置换推断。

Neuroimage. 2020 Oct 15;220:117065. doi: 10.1016/j.neuroimage.2020.117065. Epub 2020 Jun 27.

A technical review of canonical correlation analysis for neuroscience applications.神经科学应用中的典型相关分析技术综述。

Hum Brain Mapp. 2020 Sep;41(13):3807-3833. doi: 10.1002/hbm.25090. Epub 2020 Jun 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于MRI与基因数据联合分析的典型相关分析和偏最小二乘变体比较。

Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献