Suppr超能文献

均值向量的高维非参数多元检验

A High-Dimensional Nonparametric Multivariate Test for Mean Vector.

作者信息

Wang Lan, Peng Bo, Li Runze

机构信息

Associate Professor, School of Statistics, University of Minnesota, Minneapolis, MN 55455.

Graduate student, School of Statistics, University of Minnesota, Minneapolis, MN 55455.

出版信息

J Am Stat Assoc. 2015;110(512):1658-1669. doi: 10.1080/01621459.2014.988215. Epub 2016 Jan 15.

Abstract

This work is concerned with testing the population mean vector of nonnormal high-dimensional multivariate data. Several tests for high-dimensional mean vector, based on modifying the classical Hotelling test, have been proposed in the literature. Despite their usefulness, they tend to have unsatisfactory power performance for heavy-tailed multivariate data, which frequently arise in genomics and quantitative finance. This paper proposes a novel high-dimensional nonparametric test for the population mean vector for a general class of multivariate distributions. With the aid of new tools in modern probability theory, we proved that the limiting null distribution of the proposed test is normal under mild conditions when is substantially larger than . We further study the local power of the proposed test and compare its relative efficiency with a modified Hotelling test for high-dimensional data. An interesting finding is that the newly proposed test can have even more substantial power gain with large than the traditional nonparametric multivariate test does with finite fixed . We study the finite sample performance of the proposed test via Monte Carlo simulations. We further illustrate its application by an empirical analysis of a genomics data set.

摘要

这项工作关注于检验非正态高维多元数据的总体均值向量。文献中已经提出了几种基于修改经典霍特林检验的高维均值向量检验方法。尽管它们很有用,但对于重尾多元数据,它们的功效表现往往不尽人意,而重尾多元数据在基因组学和定量金融中经常出现。本文针对一般类别的多元分布,提出了一种用于总体均值向量的新型高维非参数检验方法。借助现代概率论中的新工具,我们证明了在温和条件下,当 远大于 时,所提出检验的极限零分布是正态的。我们进一步研究了所提出检验的局部功效,并将其相对效率与用于高维数据的修改后的霍特林检验进行比较。一个有趣的发现是,新提出的检验在 较大时,相比于传统非参数多元检验在有限固定 时,能获得更大的功效提升。我们通过蒙特卡罗模拟研究了所提出检验的有限样本性能。我们还通过对一个基因组学数据集的实证分析进一步说明了它的应用。

相似文献

1
A High-Dimensional Nonparametric Multivariate Test for Mean Vector.
J Am Stat Assoc. 2015;110(512):1658-1669. doi: 10.1080/01621459.2014.988215. Epub 2016 Jan 15.
2
Multivariate nonparametric techniques for astigmatism analysis.
J Cataract Refract Surg. 2010 Apr;36(4):594-602. doi: 10.1016/j.jcrs.2009.11.002.
3
Multivariate tests based on interpoint distances with application to magnetic resonance imaging.
Stat Methods Med Res. 2016 Dec;25(6):2593-2610. doi: 10.1177/0962280214529104. Epub 2014 Apr 16.
5
HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH DIMENSIONAL COVARIANCE MATRIX.
Ann Stat. 2019;47(6):3300-3334. doi: 10.1214/18-AOS1779. Epub 2019 Oct 31.
6
Linear Hypothesis Testing in Linear Models With High-Dimensional Responses.
J Am Stat Assoc. 2022;117(540):1738-1750. doi: 10.1080/01621459.2021.1884561. Epub 2021 Apr 27.
7
Finite sample t-tests for high-dimensional means.
J Multivar Anal. 2023 Jul;196. doi: 10.1016/j.jmva.2023.105183. Epub 2023 Mar 28.
9
Multivariate test power approximations for balanced linear mixed models in studies with missing data.
Stat Med. 2016 Jul 30;35(17):2921-37. doi: 10.1002/sim.6811. Epub 2015 Nov 24.
10
A new powerful nonparametric rank test for ordered alternative problem.
PLoS One. 2014 Nov 18;9(11):e112924. doi: 10.1371/journal.pone.0112924. eCollection 2014.

引用本文的文献

1
A Novel Approach of High Dimensional Linear Hypothesis Testing Problem.
J Am Stat Assoc. 2025 Mar 3. doi: 10.1080/01621459.2024.2428467.
2
3
Nonparametric two-sample tests of high dimensional mean vectors via random integration.
J Am Stat Assoc. 2024;119(545):701-714. doi: 10.1080/01621459.2022.2141636. Epub 2022 Dec 12.
4
Finite sample t-tests for high-dimensional means.
J Multivar Anal. 2023 Jul;196. doi: 10.1016/j.jmva.2023.105183. Epub 2023 Mar 28.
5
Adaptive Huber Regression on Markov-dependent Data.
Stoch Process Their Appl. 2022 Aug;150:802-818. doi: 10.1016/j.spa.2019.09.004. Epub 2019 Sep 25.
7
Speech-Driven Spectrotemporal Receptive Fields Beyond the Auditory Cortex.
Hear Res. 2021 Sep 1;408:108307. doi: 10.1016/j.heares.2021.108307. Epub 2021 Jul 10.
8
Adaptive Huber Regression.
J Am Stat Assoc. 2020;115(529):254-265. doi: 10.1080/01621459.2018.1543124. Epub 2019 Apr 22.
9
Variable Selection via Partial Correlation.
Stat Sin. 2017 Jul;27(3):983-996. doi: 10.5705/ss.202015.0473.

本文引用的文献

1
Reproducibility of differential gene detection across multiple microarray studies.
Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:4231-4. doi: 10.1109/IEMBS.2007.4353270.
2
3
Error distribution for gene expression data.
Stat Appl Genet Mol Biol. 2005;4:Article16. doi: 10.2202/1544-6115.1070. Epub 2005 Jul 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验