Suppr超能文献

非参数加法模型的核仿冒品选择

Kernel Knockoffs Selection for Nonparametric Additive Models.

作者信息

Dai Xiaowu, Lyu Xiang, Li Lexin

机构信息

University of California, Berkeley.

出版信息

J Am Stat Assoc. 2023;118(543):2158-2170. doi: 10.1080/01621459.2022.2039671. Epub 2022 Mar 14.

Abstract

Thanks to its fine balance between model flexibility and interpretability, the nonparametric additive model has been widely used, and variable selection for this type of model has been frequently studied. However, none of the existing solutions can control the false discovery rate (FDR) unless the sample size tends to infinity. The knockoff framework is a recent proposal that can address this issue, but few knockoff solutions are directly applicable to nonparametric models. In this article, we propose a novel kernel knockoffs selection procedure for the nonparametric additive model. We integrate three key components: the knockoffs, the subsampling for stability, and the random feature mapping for nonparametric function approximation. We show that the proposed method is guaranteed to control the FDR for any sample size, and achieves a power that approaches one as the sample size tends to infinity. We demonstrate the efficacy of our method through intensive simulations and comparisons with the alternative solutions. our proposal thus makes useful contributions to the methodology of nonparametric variable selection, FDR-based inference, as well as knockoffs.

摘要

由于非参数加法模型在模型灵活性和可解释性之间取得了良好平衡,它已被广泛使用,并且针对此类模型的变量选择也得到了频繁研究。然而,现有的解决方案都无法控制错误发现率(FDR),除非样本量趋于无穷大。仿冒框架是最近提出的一种可以解决此问题的方法,但很少有仿冒解决方案能直接应用于非参数模型。在本文中,我们为非参数加法模型提出了一种新颖的核仿冒选择程序。我们整合了三个关键组件:仿冒、用于稳定性的子采样以及用于非参数函数逼近的随机特征映射。我们表明,所提出的方法能够保证在任何样本量下都控制FDR,并且当样本量趋于无穷大时,其检验功效趋近于1。我们通过大量模拟以及与替代解决方案的比较来证明我们方法的有效性。因此,我们的提议为非参数变量选择、基于FDR的推断以及仿冒方法做出了有益贡献。

相似文献

1
Kernel Knockoffs Selection for Nonparametric Additive Models.
J Am Stat Assoc. 2023;118(543):2158-2170. doi: 10.1080/01621459.2022.2039671. Epub 2022 Mar 14.
2
Knockoff boosted tree for model-free variable selection.
Bioinformatics. 2021 May 17;37(7):976-983. doi: 10.1093/bioinformatics/btaa770.
3
DeepLINK: Deep learning inference using knockoffs with applications to genomics.
Proc Natl Acad Sci U S A. 2021 Sep 7;118(36). doi: 10.1073/pnas.2104683118.
4
RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs.
J Am Stat Assoc. 2020;115(529):362-379. doi: 10.1080/01621459.2018.1546589. Epub 2019 Apr 11.
5
IPAD: Stable Interpretable Forecasting with Knockoffs Inference.
J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.
6
Gene hunting with hidden Markov model knockoffs.
Biometrika. 2019 Mar;106(1):1-18. doi: 10.1093/biomet/asy033. Epub 2018 Aug 4.
7
Bayesian variable selection using Knockoffs with applications to genomics.
Comput Stat. 2022 Sep 18:1-20. doi: 10.1007/s00180-022-01283-8.
9
A general framework of nonparametric feature selection in high-dimensional data.
Biometrics. 2023 Jun;79(2):951-963. doi: 10.1111/biom.13664. Epub 2022 Apr 7.
10
Deep direct likelihood knockoffs.
Adv Neural Inf Process Syst. 2020 Dec;33:5036-5046.

引用本文的文献

1
Nonparametric estimation via partial derivatives.
J R Stat Soc Series B Stat Methodol. 2024 Sep 11;87(2):319-336. doi: 10.1093/jrsssb/qkae093. eCollection 2025 Apr.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验