Suppr超能文献

测量多元关联及其他。

Measuring multivariate association and beyond.

作者信息

Josse Julie, Holmes Susan

机构信息

Department of Statistics, Agrocampus Ouest - INRIA, Saclay Paris Sud University, France.

Department of Statistics, Stanford University, California, USA.

出版信息

Stat Surv. 2016;10:132-167. doi: 10.1214/16-SS116. Epub 2016 Nov 17.

Abstract

Simple correlation coefficients between two variables have been generalized to measure association between two matrices in many ways. Coefficients such as the RV coefficient, the distance covariance (dCov) coefficient and kernel based coefficients are being used by different research communities. Scientists use these coefficients to test whether two random vectors are linked. Once it has been ascertained that there is such association through testing, then a next step, often ignored, is to explore and uncover the association's underlying patterns. This article provides a survey of various measures of dependence between random vectors and tests of independence and emphasizes the connections and differences between the various approaches. After providing definitions of the coefficients and associated tests, we present the recent improvements that enhance their statistical properties and ease of interpretation. We summarize multi-table approaches and provide scenarii where the indices can provide useful summaries of heterogeneous multi-block data. We illustrate these different strategies on several examples of real data and suggest directions for future research.

摘要

两个变量之间的简单相关系数已被广泛推广,用于以多种方式度量两个矩阵之间的关联。不同的研究群体使用诸如RV系数、距离协方差(dCov)系数和基于核的系数等。科学家们使用这些系数来检验两个随机向量是否相关联。一旦通过检验确定存在这种关联,那么接下来一个常常被忽视的步骤就是探索并揭示这种关联的潜在模式。本文对随机向量之间各种相依性度量以及独立性检验进行了综述,并强调了各种方法之间的联系与差异。在给出系数及相关检验的定义之后,我们介绍了最近的改进,这些改进增强了它们的统计特性并便于解释。我们总结了多表方法,并给出了一些场景,在这些场景中这些指标可以为异构多块数据提供有用的汇总。我们在几个实际数据示例中说明了这些不同的策略,并提出了未来研究的方向。

相似文献

1
Measuring multivariate association and beyond.
Stat Surv. 2016;10:132-167. doi: 10.1214/16-SS116. Epub 2016 Nov 17.
2
Beyond Pearson's Correlation: Modern Nonparametric Independence Tests for Psychological Research.
Multivariate Behav Res. 2024 Sep-Oct;59(5):957-977. doi: 10.1080/00273171.2024.2347960. Epub 2024 Aug 4.
3
A fast small-sample kernel independence test for microbiome community-level association analysis.
Biometrics. 2017 Dec;73(4):1453-1463. doi: 10.1111/biom.12684. Epub 2017 Mar 10.
4
Resampling-based approaches to study variation in morphological modularity.
PLoS One. 2013 Jul 16;8(7):e69376. doi: 10.1371/journal.pone.0069376. Print 2013.
5
Dependence and independence: Structure and inference.
Stat Methods Med Res. 2017 Oct;26(5):2114-2132. doi: 10.1177/0962280215594198. Epub 2015 Jul 29.
6
Using empirical Bayes predictors from generalized linear mixed models to test and visualize associations among longitudinal outcomes.
Stat Methods Med Res. 2019 May;28(5):1399-1411. doi: 10.1177/0962280218758357. Epub 2018 Feb 28.
7
CONDITIONAL DISTANCE CORRELATION.
J Am Stat Assoc. 2015;110(512):1726-1734. doi: 10.1080/01621459.2014.993081. Epub 2015 Jan 23.
8
Conditional independence test by generalized Kendall's tau with generalized odds ratio.
Stat Methods Med Res. 2018 Nov;27(11):3224-3235. doi: 10.1177/0962280217695345. Epub 2017 Feb 23.
9
Estimating Feature-Label Dependence Using Gini Distance Statistics.
IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):1947-1963. doi: 10.1109/TPAMI.2019.2960358. Epub 2021 May 11.
10
A Distance-Based Kernel Association Test Based on the Generalized Linear Mixed Model for Correlated Microbiome Studies.
Front Genet. 2019 May 16;10:458. doi: 10.3389/fgene.2019.00458. eCollection 2019.

引用本文的文献

2
Craniofacial integration and modularity in untreated cleft lip and palate.
Clin Oral Investig. 2025 Mar 31;29(4):218. doi: 10.1007/s00784-025-06296-3.
3
A powerful framework for differential co-expression analysis of general risk factors.
bioRxiv. 2024 Dec 3:2024.11.29.626006. doi: 10.1101/2024.11.29.626006.
4
Diagnostic checks in time series models based on a new correlation coefficient of residuals.
J Appl Stat. 2023 Dec 22;51(12):2402-2419. doi: 10.1080/02664763.2023.2297155. eCollection 2024.
5
Identifying latent genetic interactions in genome-wide association studies using multiple traits.
Genome Med. 2024 Apr 25;16(1):62. doi: 10.1186/s13073-024-01329-0.
6
Delving into gene-set multiplex networks facilitated by a k-nearest neighbor-based measure of similarity.
Comput Struct Biotechnol J. 2023 Oct 11;21:4988-5002. doi: 10.1016/j.csbj.2023.09.042. eCollection 2023.
7
An exploration of linkage fine-mapping on sequences from case-control studies.
Genet Epidemiol. 2023 Feb;47(1):78-94. doi: 10.1002/gepi.22502. Epub 2022 Sep 1.
8
Detecting the Guttman effect with the help of ordinal correspondence analysis in synchrotron X-ray diffraction data analysis.
J Appl Stat. 2020 Aug 26;49(2):291-316. doi: 10.1080/02664763.2020.1810644. eCollection 2022.
9
Comparative analysis of molecular fingerprints in prediction of drug combination effects.
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab291.

本文引用的文献

1
How well do multivariate data sets match? The advantages of a Procrustean superimposition approach over the Mantel test.
Oecologia. 2001 Oct;129(2):169-178. doi: 10.1007/s004420100720. Epub 2001 Oct 1.
2
Does nasal echolocation influence the modularity of the mammal skull?
J Evol Biol. 2013 Nov;26(11):2520-6. doi: 10.1111/jeb.12235. Epub 2013 Sep 10.
3
A class of multivariate distribution-free tests of independence based on graphs.
J Stat Plan Inference. 2012 Dec;142(12):3097-3106. doi: 10.1016/j.jspi.2012.06.003.
4
A distance-based test of association between paired heterogeneous genomic data.
Bioinformatics. 2013 Oct 15;29(20):2555-63. doi: 10.1093/bioinformatics/btt450. Epub 2013 Aug 5.
5
Resampling-based approaches to study variation in morphological modularity.
PLoS One. 2013 Jul 16;8(7):e69376. doi: 10.1371/journal.pone.0069376. Print 2013.
7
Exploratory analysis of multiple omics datasets using the adjusted RV coefficient.
Stat Appl Genet Mol Biol. 2011;10:Article 14. doi: 10.2202/1544-6115.1540.
8
10
Matrix correlations for high-dimensional data: the modified RV-coefficient.
Bioinformatics. 2009 Feb 1;25(3):401-5. doi: 10.1093/bioinformatics/btn634. Epub 2008 Dec 10.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验