P值的不均匀性可能在维度发散的早期出现。

Nonuniformity of P-values Can Occur Early in Diverging Dimensions.

作者信息

Fan Yingying, Demirkaya Emre, Lv Jinchi

机构信息

Data Sciences and Operations Department, University of Southern California, Los Angeles, CA 90089, USA.

Business Analytics & Statistics, The University of Tennessee, Knoxville, Knoxville, TN 37996-4140, USA.

出版信息

J Mach Learn Res. 2019;20.

PMID:32190012

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7079742/

Abstract

Evaluating the joint significance of covariates is of fundamental importance in a wide range of applications. To this end, p-values are frequently employed and produced by algorithms that are powered by classical large-sample asymptotic theory. It is well known that the conventional p-values in Gaussian linear model are valid even when the dimensionality is a non-vanishing fraction of the sample size, but can break down when the design matrix becomes singular in higher dimensions or when the error distribution deviates from Gaussianity. A natural question is when the conventional p-values in generalized linear models become invalid in diverging dimensions. We establish that such a breakdown can occur early in nonlinear models. Our theoretical characterizations are confirmed by simulation studies.

摘要

评估协变量的联合显著性在广泛的应用中至关重要。为此，p值经常被使用，并由基于经典大样本渐近理论的算法生成。众所周知，高斯线性模型中的传统p值即使在维度是样本量的非零比例时也是有效的，但当设计矩阵在高维中变得奇异或误差分布偏离高斯性时可能会失效。一个自然的问题是广义线性模型中的传统p值在维度发散时何时变得无效。我们证明这种失效可能在非线性模型中很早就会出现。我们的理论特征通过模拟研究得到了证实。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d472/7079742/0a11b2a80927/nihms-1566718-f0001.jpg

相似文献

Nonuniformity of P-values Can Occur Early in Diverging Dimensions.P值的不均匀性可能在维度发散的早期出现。

J Mach Learn Res. 2019;20.

Asymptotic Theory of Eigenvectors for Random Matrices with Diverging Spikes.具有发散尖峰的随机矩阵特征向量的渐近理论

J Am Stat Assoc. 2022;117(538):996-1009. doi: 10.1080/01621459.2020.1840990. Epub 2020 Dec 8.

Accurate and Efficient -value Calculation via Gaussian Approximation: a Novel Monte-Carlo Method.通过高斯近似进行准确高效的价值计算：一种新型蒙特卡罗方法。

J Am Stat Assoc. 2019;114(525):384-392. doi: 10.1080/01621459.2017.1407776. Epub 2018 Jun 28.

Testing for treatment effect in covariate-adaptive randomized trials with generalized linear models and omitted covariates.使用广义线性模型和遗漏协变量在协变量自适应随机试验中检验治疗效果。

Stat Methods Med Res. 2021 Sep;30(9):2148-2164. doi: 10.1177/09622802211008206. Epub 2021 Apr 26.

On singular values of large dimensional lag- sample auto-correlation matrices.关于大维滞后样本自相关矩阵的奇异值

J Multivar Anal. 2023 Sep;197. doi: 10.1016/j.jmva.2023.105205. Epub 2023 Jun 1.

Projection pursuit in high dimensions.高维中的投影寻踪。

Proc Natl Acad Sci U S A. 2018 Sep 11;115(37):9151-9156. doi: 10.1073/pnas.1801177115. Epub 2018 Aug 27.

ARE DISCOVERIES SPURIOUS? DISTRIBUTIONS OF MAXIMUM SPURIOUS CORRELATIONS AND THEIR APPLICATIONS.发现是虚假的吗？最大虚假相关性的分布及其应用。

Ann Stat. 2018 Jun;46(3):989-1017. doi: 10.1214/17-AOS1575. Epub 2018 May 3.

Finite sample t-tests for high-dimensional means.高维均值的有限样本t检验。

J Multivar Anal. 2023 Jul;196. doi: 10.1016/j.jmva.2023.105183. Epub 2023 Mar 28.

Testing hypotheses under adaptive randomization with continuous covariates in clinical trials.临床试验中具有连续协变量的适应性随机化下的假设检验。

Stat Methods Med Res. 2019 Jun;28(6):1609-1621. doi: 10.1177/0962280218770231. Epub 2018 May 17.

Infrared behavior in systems with a broken continuous symmetry: classical O(N) model versus interacting bosons.具有破缺连续对称性的系统中的红外行为：经典O(N)模型与相互作用玻色子

Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031120. doi: 10.1103/PhysRevE.83.031120. Epub 2011 Mar 18.

引用本文的文献

High-Dimensional Knockoffs Inference for Time Series Data.时间序列数据的高维仿冒品推断

J Am Stat Assoc. 2025 Feb 27. doi: 10.1080/01621459.2024.2431344.

DeepLINK: Deep learning inference using knockoffs with applications to genomics.DeepLINK：使用 Knockoffs 进行深度学习推断及其在基因组学中的应用。

Proc Natl Acad Sci U S A. 2021 Sep 7;118(36). doi: 10.1073/pnas.2104683118.

IPAD: Stable Interpretable Forecasting with Knockoffs Inference.IPAD：基于仿冒品推断的稳定可解释预测

J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.

RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs.RANK：基于图形非线性仿样的大规模推断

J Am Stat Assoc. 2020;115(529):362-379. doi: 10.1080/01621459.2018.1546589. Epub 2019 Apr 11.

A modern maximum-likelihood theory for high-dimensional logistic regression.一种高维逻辑回归的现代极大似然理论。

Proc Natl Acad Sci U S A. 2019 Jul 16;116(29):14516-14525. doi: 10.1073/pnas.1810420116. Epub 2019 Jul 1.

本文引用的文献

RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs.RANK：基于图形非线性仿样的大规模推断

J Am Stat Assoc. 2020;115(529):362-379. doi: 10.1080/01621459.2018.1546589. Epub 2019 Apr 11.

A modern maximum-likelihood theory for high-dimensional logistic regression.一种高维逻辑回归的现代极大似然理论。

Proc Natl Acad Sci U S A. 2019 Jul 16;116(29):14516-14525. doi: 10.1073/pnas.1810420116. Epub 2019 Jul 1.

On robust regression with high-dimensional predictors.高维预测变量的鲁棒回归。

Proc Natl Acad Sci U S A. 2013 Sep 3;110(36):14557-62. doi: 10.1073/pnas.1307842110. Epub 2013 Aug 16.

Optimal M-estimation in high-dimensional regression.高维回归中的最优 M 估计。

Proc Natl Acad Sci U S A. 2013 Sep 3;110(36):14563-8. doi: 10.1073/pnas.1307845110. Epub 2013 Aug 16.

Non-Concave Penalized Likelihood with NP-Dimensionality.具有NP维数的非凹惩罚似然法

IEEE Trans Inf Theory. 2011 Aug;57(8):5467-5484. doi: 10.1109/TIT.2011.2158486.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

P值的不均匀性可能在维度发散的早期出现。

Nonuniformity of P-values Can Occur Early in Diverging Dimensions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献