Suppr超能文献

用于重复测量分析的非参数变系数模型中的变量选择

Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements.

作者信息

Wang Lifeng, Li Hongzhe, Huang Jianhua Z

机构信息

Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Philadelphia, PA 19104,

出版信息

J Am Stat Assoc. 2008 Dec 1;103(484):1556-1569. doi: 10.1198/016214508000000788.

Abstract

Nonparametric varying-coefficient models are commonly used for analysis of data measured repeatedly over time, including longitudinal and functional responses data. While many procedures have been developed for estimating the varying-coefficients, the problem of variable selection for such models has not been addressed. In this article, we present a regularized estimation procedure for variable selection that combines basis function approximations and the smoothly clipped absolute deviation (SCAD) penalty. The proposed procedure simultaneously selects significant variables with time-varying effects and estimates the nonzero smooth coefficient functions. Under suitable conditions, we have established the theoretical properties of our procedure, including consistency in variable selection and the oracle property in estimation. Here the oracle property means that the asymptotic distribution of an estimated coefficient function is the same as that when it is known a priori which variables are in the model. The method is illustrated with simulations and two real data examples, one for identifying risk factors in the study of AIDS and one using microarray time-course gene expression data to identify the transcription factors related to the yeast cell cycle process.

摘要

非参数变系数模型常用于分析随时间重复测量的数据,包括纵向数据和函数响应数据。虽然已经开发了许多用于估计变系数的方法,但此类模型的变量选择问题尚未得到解决。在本文中,我们提出了一种用于变量选择的正则化估计方法,该方法结合了基函数逼近和平滑截断绝对偏差(SCAD)惩罚。所提出的方法同时选择具有时变效应的显著变量,并估计非零平滑系数函数。在适当的条件下,我们建立了该方法的理论性质,包括变量选择的一致性和估计中的神谕性质。这里的神谕性质是指估计系数函数的渐近分布与事先知道模型中哪些变量时的渐近分布相同。通过模拟和两个实际数据示例对该方法进行了说明,一个用于识别艾滋病研究中的风险因素,另一个使用微阵列时间序列基因表达数据来识别与酵母细胞周期过程相关的转录因子。

相似文献

8
Weighted Wilcoxon-type smoothly clipped absolute deviation method.加权威尔科克森型平滑截断绝对偏差法。
Biometrics. 2009 Jun;65(2):564-71. doi: 10.1111/j.1541-0420.2008.01099.x. Epub 2008 Jul 18.

引用本文的文献

本文引用的文献

3
Statistical methods for identifying yeast cell cycle transcription factors.鉴定酵母细胞周期转录因子的统计方法。
Proc Natl Acad Sci U S A. 2005 Sep 20;102(38):13532-7. doi: 10.1073/pnas.0505874102. Epub 2005 Sep 12.
5
Integrating regulatory motif discovery and genome-wide expression analysis.整合调控基序发现与全基因组表达分析。
Proc Natl Acad Sci U S A. 2003 Mar 18;100(6):3339-44. doi: 10.1073/pnas.0630591100. Epub 2003 Mar 7.
7
Transcriptional regulatory networks in Saccharomyces cerevisiae.酿酒酵母中的转录调控网络。
Science. 2002 Oct 25;298(5594):799-804. doi: 10.1126/science.1075090.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验