• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于具有扩散先验的贝叶斯子集建模的部分线性模型的变量选择

Variable selection for partially linear models via Bayesian subset modeling with diffusing prior.

作者信息

Wang Jia, Cai Xizhen, Li Runze

机构信息

Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA.

Department of Mathematics and Statistics, Williams College, Williamstown, MA 01267, USA.

出版信息

J Multivar Anal. 2021 May;183. doi: 10.1016/j.jmva.2021.104733. Epub 2021 Feb 13.

DOI:10.1016/j.jmva.2021.104733
PMID:33867594
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8046162/
Abstract

Most existing methods of variable selection in partially linear models (PLM) with ultrahigh dimensional covariates are based on partial residuals, which involve a two-step estimation procedure. While the estimation error produced in the first step may have an impact on the second step, multicollinearity among predictors adds additional challenges in the model selection procedure. In this paper, we propose a new Bayesian variable selection approach for PLM. This new proposal addresses those two issues simultaneously as (1) it is a one-step method which selects variables in PLM, even when the dimension of covariates increases at an exponential rate with the sample size, and (2) the method retains model selection consistency, and outperforms existing ones in the setting of highly correlated predictors. Distinguished from existing ones, our proposed procedure employs the difference-based method to reduce the impact from the estimation of the nonparametric component, and incorporates Bayesian subset modeling with diffusing prior (BSM-DP) to shrink the corresponding estimator in the linear component. The estimation is implemented by Gibbs sampling, and we prove that the posterior probability of the true model being selected converges to one asymptotically. Simulation studies support the theory and the efficiency of our methods as compared to other existing ones, followed by an application in a study of supermarket data.

摘要

在具有超高维协变量的部分线性模型(PLM)中,大多数现有的变量选择方法都是基于部分残差的,这涉及到一个两步估计过程。虽然第一步产生的估计误差可能会对第二步产生影响,但预测变量之间的多重共线性在模型选择过程中增加了额外的挑战。在本文中,我们提出了一种用于PLM的新的贝叶斯变量选择方法。这个新方法同时解决了这两个问题,因为(1)它是一种一步法,即使协变量的维度随着样本量以指数速率增加,它也能在PLM中选择变量;(2)该方法保持模型选择的一致性,并且在高度相关预测变量的设置中优于现有方法。与现有方法不同,我们提出的过程采用基于差异的方法来减少非参数分量估计的影响,并结合具有扩散先验的贝叶斯子集建模(BSM-DP)来收缩线性分量中的相应估计器。估计通过吉布斯抽样实现,并且我们证明了选择真实模型的后验概率渐近收敛到1。模拟研究支持了我们方法相对于其他现有方法的理论和效率,随后是在超市数据研究中的应用。

相似文献

1
Variable selection for partially linear models via Bayesian subset modeling with diffusing prior.基于具有扩散先验的贝叶斯子集建模的部分线性模型的变量选择
J Multivar Anal. 2021 May;183. doi: 10.1016/j.jmva.2021.104733. Epub 2021 Feb 13.
2
Variable selection for partially linear models via partial correlation.基于偏相关的部分线性模型变量选择
J Multivar Anal. 2018 Sep;167:18-434. doi: 10.1016/j.jmva.2018.06.005. Epub 2018 Jun 20.
3
Variable Selection for High-dimensional Nodal Attributes in Social Networks with Degree Heterogeneity.具有度异质性的社交网络中高维节点属性的变量选择
J Am Stat Assoc. 2024;119(546):1322-1335. doi: 10.1080/01621459.2023.2187815. Epub 2023 Apr 13.
4
Variable Selection via Partial Correlation.通过偏相关进行变量选择。
Stat Sin. 2017 Jul;27(3):983-996. doi: 10.5705/ss.202015.0473.
5
A ROBUST AND EFFICIENT APPROACH TO CAUSAL INFERENCE BASED ON SPARSE SUFFICIENT DIMENSION REDUCTION.一种基于稀疏充分降维的稳健且高效的因果推断方法。
Ann Stat. 2019 Jun;47(3):1505-1535. doi: 10.1214/18-AOS1722. Epub 2019 Feb 13.
6
Robust estimation for partially linear models with large-dimensional covariates.具有高维协变量的部分线性模型的稳健估计
Sci China Math. 2013 Oct 1;56(10):2069-2088. doi: 10.1007/s11425-013-4675-0.
7
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.第1部分. 多种空气污染成分影响的统计学习方法
Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):5-50.
8
Variable Selection in Semiparametric Regression Modeling.半参数回归建模中的变量选择
Ann Stat. 2008;36(1):261-286. doi: 10.1214/009053607000000604.
9
Bayes variable selection in semiparametric linear models.半参数线性模型中的贝叶斯变量选择
J Am Stat Assoc. 2014 Mar 1;109(505):437-447. doi: 10.1080/01621459.2014.881153.
10
NEW EFFICIENT ESTIMATION AND VARIABLE SELECTION METHODS FOR SEMIPARAMETRIC VARYING-COEFFICIENT PARTIALLY LINEAR MODELS.半参数变系数部分线性模型的新有效估计与变量选择方法
Ann Stat. 2011 Feb 1;39(1):305-332. doi: 10.1214/10-AOS842.

引用本文的文献

1
Marginalized LASSO in the low-dimensional difference-based partially linear model for variable selection.用于变量选择的低维基于差异的部分线性模型中的边际化套索法
J Appl Stat. 2024 Jul 9;52(2):400-428. doi: 10.1080/02664763.2024.2372676. eCollection 2025.
2
Variable Selection for High-dimensional Nodal Attributes in Social Networks with Degree Heterogeneity.具有度异质性的社交网络中高维节点属性的变量选择
J Am Stat Assoc. 2024;119(546):1322-1335. doi: 10.1080/01621459.2023.2187815. Epub 2023 Apr 13.

本文引用的文献

1
Variable selection for partially linear models via partial correlation.基于偏相关的部分线性模型变量选择
J Multivar Anal. 2018 Sep;167:18-434. doi: 10.1016/j.jmva.2018.06.005. Epub 2018 Jun 20.
2
Error Variance Estimation in Ultrahigh-Dimensional Additive Models.超高维加法模型中的误差方差估计
J Am Stat Assoc. 2018;113(521):315-327. doi: 10.1080/01621459.2016.1251440. Epub 2017 Sep 26.
3
Robust estimation for partially linear models with large-dimensional covariates.具有高维协变量的部分线性模型的稳健估计
Sci China Math. 2013 Oct 1;56(10):2069-2088. doi: 10.1007/s11425-013-4675-0.
4
Bayesian Model Selection in High-Dimensional Settings.高维情形下的贝叶斯模型选择
J Am Stat Assoc. 2012;107(498). doi: 10.1080/01621459.2012.682536.
5
Variable Selection for Partially Linear Models with Measurement Errors.含测量误差的部分线性模型的变量选择
J Am Stat Assoc. 2009;104(485):234-248. doi: 10.1198/jasa.2009.0127.
6
Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.《超高维特征空间中的确定独立性筛选》讨论
J R Stat Soc Series B Stat Methodol. 2008 Nov;70(5):903. doi: 10.1111/j.1467-9868.2008.00674.x.