• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

曲面估计、变量选择与非参数最优属性

Surface Estimation, Variable Selection, and the Nonparametric Oracle Property.

作者信息

Storlie Curtis B, Bondell Howard D, Reich Brian J, Zhang Hao Helen

出版信息

Stat Sin. 2011 Apr;21(2):679-705. doi: 10.5705/ss.2011.030a.

DOI:10.5705/ss.2011.030a
PMID:21603586
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3095957/
Abstract

Variable selection for multivariate nonparametric regression is an important, yet challenging, problem due, in part, to the infinite dimensionality of the function space. An ideal selection procedure should be automatic, stable, easy to use, and have desirable asymptotic properties. In particular, we define a selection procedure to be nonparametric oracle (np-oracle) if it consistently selects the correct subset of predictors and at the same time estimates the smooth surface at the optimal nonparametric rate, as the sample size goes to infinity. In this paper, we propose a model selection procedure for nonparametric models, and explore the conditions under which the new method enjoys the aforementioned properties. Developed in the framework of smoothing spline ANOVA, our estimator is obtained via solving a regularization problem with a novel adaptive penalty on the sum of functional component norms. Theoretical properties of the new estimator are established. Additionally, numerous simulated and real examples further demonstrate that the new approach substantially outperforms other existing methods in the finite sample setting.

摘要

多元非参数回归中的变量选择是一个重要但具有挑战性的问题,部分原因在于函数空间的无限维特性。一个理想的选择过程应该是自动的、稳定的、易于使用的,并且具有理想的渐近性质。特别地,如果一个选择过程在样本量趋于无穷时能够一致地选择正确的预测变量子集,同时以最优的非参数速率估计光滑曲面,我们就将其定义为非参数神谕(np - 神谕)。在本文中,我们提出了一种非参数模型的模型选择过程,并探讨了新方法具有上述性质的条件。我们的估计器是在平滑样条方差分析的框架下开发的,通过求解一个对函数分量范数之和施加新颖自适应惩罚的正则化问题得到。建立了新估计器的理论性质。此外,大量的模拟和实际例子进一步表明,在有限样本情况下,新方法显著优于其他现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/a73e8f5d878d/nihms192282f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/f14e72e41fad/nihms192282f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/593b4a710a04/nihms192282f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/a73e8f5d878d/nihms192282f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/f14e72e41fad/nihms192282f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/593b4a710a04/nihms192282f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f4/3095957/a73e8f5d878d/nihms192282f3.jpg

相似文献

1
Surface Estimation, Variable Selection, and the Nonparametric Oracle Property.曲面估计、变量选择与非参数最优属性
Stat Sin. 2011 Apr;21(2):679-705. doi: 10.5705/ss.2011.030a.
2
Sparse and Efficient Estimation for Partial Spline Models with Increasing Dimension.高维部分样条模型的稀疏有效估计
Ann Inst Stat Math. 2015 Feb 1;67(1):93-127. doi: 10.1007/s10463-013-0440-y.
3
Automatic Model Selection for Partially Linear Models.部分线性模型的自动模型选择
J Multivar Anal. 2009 Oct 1;100(9):2100-2111. doi: 10.1016/j.jmva.2009.06.009.
4
PENALIZED VARIABLE SELECTION PROCEDURE FOR COX MODELS WITH SEMIPARAMETRIC RELATIVE RISK.具有半参数相对风险的Cox模型的惩罚变量选择程序
Ann Stat. 2010 Aug 1;38(4):2092-2117. doi: 10.1214/09-AOS780.
5
Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements.用于重复测量分析的非参数变系数模型中的变量选择
J Am Stat Assoc. 2008 Dec 1;103(484):1556-1569. doi: 10.1198/016214508000000788.
6
Estimation and model selection for nonparametric function-on-function regression.非参数函数对函数回归的估计与模型选择
J Comput Graph Stat. 2022;31(3):835-845. doi: 10.1080/10618600.2022.2037434. Epub 2022 Mar 28.
7
Robust signed-rank estimation and variable selection for semi-parametric additive partial linear models.半参数可加部分线性模型的稳健符号秩估计与变量选择
J Appl Stat. 2019 Nov 27;47(10):1794-1819. doi: 10.1080/02664763.2019.1695759. eCollection 2020.
8
ADAPTIVE ROBUST VARIABLE SELECTION.自适应鲁棒变量选择
Ann Stat. 2014 Feb 1;42(1):324-351. doi: 10.1214/13-AOS1191.
9
A general framework of nonparametric feature selection in high-dimensional data.高维数据中非参数特征选择的一般框架。
Biometrics. 2023 Jun;79(2):951-963. doi: 10.1111/biom.13664. Epub 2022 Apr 7.
10
Robust learning for optimal treatment decision with NP-dimensionality.具有NP维数的最优治疗决策的稳健学习。
Electron J Stat. 2016;10:2894-2921. doi: 10.1214/16-EJS1178. Epub 2016 Oct 13.

引用本文的文献

1
Scalable Empirical Bayes Inference and Bayesian Sensitivity Analysis.可扩展的经验贝叶斯推断与贝叶斯敏感性分析。
Stat Sci. 2024 Nov;39(4):601-622. doi: 10.1214/24-sts936. Epub 2024 Oct 30.
2
Detection of Interaction Effects in a Nonparametric Concurrent Regression Model.非参数并发回归模型中交互效应的检测
Entropy (Basel). 2023 Sep 12;25(9):1327. doi: 10.3390/e25091327.
3
Spike-and-slab least absolute shrinkage and selection operator generalized additive models and scalable algorithms for high-dimensional data analysis.

本文引用的文献

1
Generalized additive models for medical research.医学研究中的广义相加模型。
Stat Methods Med Res. 1995 Sep;4(3):187-96. doi: 10.1177/096228029500400302.
基于 Spike-and-Slab 最小绝对收缩和选择算子的广义加性模型及其在高维数据分析中的可扩展算法。
Stat Med. 2022 Sep 10;41(20):3899-3914. doi: 10.1002/sim.9483. Epub 2022 Jun 5.
4
A comparison of covariate selection techniques applied to pre-exposure prophylaxis (PrEP) drug concentration data in men and transgender women at risk for HIV.比较应用于有 HIV 风险的男性和跨性别女性的暴露前预防(PrEP)药物浓度数据的协变量选择技术。
J Pharmacokinet Pharmacodyn. 2021 Oct;48(5):655-669. doi: 10.1007/s10928-021-09763-y. Epub 2021 May 19.
5
Variable selection methods for identifying predictor interactions in data with repeatedly measured binary outcomes.用于识别具有重复测量二元结局的数据中预测变量交互作用的变量选择方法。
J Clin Transl Sci. 2020 Nov 16;5(1):e59. doi: 10.1017/cts.2020.556.
6
Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods.使用测量误差选择似然性的核回归中的变量选择
J Am Stat Assoc. 2017;112(520):1587-1597. doi: 10.1080/01621459.2016.1222287. Epub 2017 Jul 19.
7
Variable Selection for Nonparametric Quantile Regression via Smoothing Spline AN OVA.基于样条方差分析的非参数分位数回归变量选择
Stat. 2013;2(1):255-268. doi: 10.1002/sta4.33.
8
Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models.线性还是非线性?部分线性模型的自动结构发现
J Am Stat Assoc. 2011 Sep 1;106(495):1099-1112. doi: 10.1198/jasa.2011.tm10281.
9
Buckley-James boosting for survival analysis with high-dimensional biomarker data.用于高维生物标志物数据生存分析的Buckley-James提升法。
Stat Appl Genet Mol Biol. 2010;9(1):Article24. doi: 10.2202/1544-6115.1550. Epub 2010 Jun 8.