Suppr超能文献

IPAD:基于仿冒品推断的稳定可解释预测

IPAD: Stable Interpretable Forecasting with Knockoffs Inference.

作者信息

Fan Yingying, Lv Jinchi, Sharifvaghefi Mahrad, Uematsu Yoshimasa

机构信息

University of Southern California.

Tohoku University.

出版信息

J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.

Abstract

Interpretability and stability are two important features that are desired in many contemporary big data applications arising in statistics, economics, and finance. While the former is enjoyed to some extent by many existing forecasting approaches, the latter in the sense of controlling the fraction of wrongly discovered features which can enhance greatly the interpretability is still largely underdeveloped. To this end, in this paper we exploit the general framework of model-X knockoffs introduced recently in Candès, Fan, Janson and Lv (2018), which is nonconventional for reproducible large-scale inference in that the framework is completely free of the use of p-values for significance testing, and suggest a new method of intertwined probabilistic factors decoupling (IPAD) for stable interpretable forecasting with knockoffs inference in high-dimensional models. The recipe of the method is constructing the knockoff variables by assuming a latent factor model that is exploited widely in economics and finance for the association structure of covariates. Our method and work are distinct from the existing literature in that we estimate the covariate distribution from data instead of assuming that it is known when constructing the knockoff variables, our procedure does not require any sample splitting, we provide theoretical justifications on the asymptotic false discovery rate control, and the theory for the power analysis is also established. Several simulation examples and the real data analysis further demonstrate that the newly suggested method has appealing finite-sample performance with desired interpretability and stability compared to some popularly used forecasting methods.

摘要

可解释性和稳定性是统计、经济和金融领域中许多当代大数据应用所期望的两个重要特征。虽然许多现有预测方法在一定程度上具备前者,但在控制错误发现特征的比例以极大提高可解释性这方面,后者仍在很大程度上未得到充分发展。为此,在本文中,我们利用了Candès、Fan、Janson和Lv(2018)最近引入的模型X仿样的通用框架,该框架在可重复大规模推断方面是非传统的,因为它完全不使用p值进行显著性检验,并提出了一种新的交织概率因子解耦(IPAD)方法,用于在高维模型中通过仿样推断进行稳定的可解释预测。该方法的诀窍是通过假设一个在经济和金融中广泛用于协变量关联结构的潜在因子模型来构建仿样变量。我们的方法和工作与现有文献不同之处在于,我们从数据中估计协变量分布,而不是在构建仿样变量时假设其已知,我们的过程不需要任何样本分割,我们提供了关于渐近错误发现率控制的理论依据,并且还建立了功效分析理论。几个模拟示例和实际数据分析进一步表明,与一些常用的预测方法相比,新提出的方法具有吸引人的有限样本性能,具备所需的可解释性和稳定性。

相似文献

1
IPAD: Stable Interpretable Forecasting with Knockoffs Inference.IPAD:基于仿冒品推断的稳定可解释预测
J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.
2
RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs.RANK:基于图形非线性仿样的大规模推断
J Am Stat Assoc. 2020;115(529):362-379. doi: 10.1080/01621459.2018.1546589. Epub 2019 Apr 11.
3
Kernel Knockoffs Selection for Nonparametric Additive Models.非参数加法模型的核仿冒品选择
J Am Stat Assoc. 2023;118(543):2158-2170. doi: 10.1080/01621459.2022.2039671. Epub 2022 Mar 14.
4
Knockoff boosted tree for model-free variable selection.无模型变量选择的仿射提升树。
Bioinformatics. 2021 May 17;37(7):976-983. doi: 10.1093/bioinformatics/btaa770.
6
Deep direct likelihood knockoffs.深度直接似然性仿样
Adv Neural Inf Process Syst. 2020 Dec;33:5036-5046.
7
Gene hunting with hidden Markov model knockoffs.使用隐马尔可夫模型仿样进行基因搜寻。
Biometrika. 2019 Mar;106(1):1-18. doi: 10.1093/biomet/asy033. Epub 2018 Aug 4.
8
Sparse regression and marginal testing using cluster prototypes.使用聚类原型的稀疏回归和边际检验。
Biostatistics. 2016 Apr;17(2):364-76. doi: 10.1093/biostatistics/kxv049. Epub 2015 Nov 27.
9
Competition-based control of the false discovery proportion.基于竞争的假发现率控制。
Biometrics. 2023 Dec;79(4):3472-3484. doi: 10.1111/biom.13830. Epub 2023 Jan 30.

引用本文的文献

7
Asymptotic Theory of Eigenvectors for Random Matrices with Diverging Spikes.具有发散尖峰的随机矩阵特征向量的渐近理论
J Am Stat Assoc. 2022;117(538):996-1009. doi: 10.1080/01621459.2020.1840990. Epub 2020 Dec 8.
8
Null-free False Discovery Rate Control Using Decoy Permutations.使用诱饵排列的无空值错误发现率控制
Acta Math Appl Sin. 2022;38(2):235-253. doi: 10.1007/s10255-022-1077-5. Epub 2022 Apr 9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验