Kai Bo, Li Runze, Zou Hui
Department of Mathematics, College of Charleston, Charleston, South Carolina 29424, USA,
Ann Stat. 2011 Feb 1;39(1):305-332. doi: 10.1214/10-AOS842.
The complexity of semiparametric models poses new challenges to statistical inference and model selection that frequently arise from real applications. In this work, we propose new estimation and variable selection procedures for the semiparametric varying-coefficient partially linear model. We first study quantile regression estimates for the nonparametric varying-coefficient functions and the parametric regression coefficients. To achieve nice efficiency properties, we further develop a semiparametric composite quantile regression procedure. We establish the asymptotic normality of proposed estimators for both the parametric and nonparametric parts and show that the estimators achieve the best convergence rate. Moreover, we show that the proposed method is much more efficient than the least-squares-based method for many non-normal errors and that it only loses a small amount of efficiency for normal errors. In addition, it is shown that the loss in efficiency is at most 11.1% for estimating varying coefficient functions and is no greater than 13.6% for estimating parametric components. To achieve sparsity with high-dimensional covariates, we propose adaptive penalization methods for variable selection in the semiparametric varying-coefficient partially linear model and prove that the methods possess the oracle property. Extensive Monte Carlo simulation studies are conducted to examine the finite-sample performance of the proposed procedures. Finally, we apply the new methods to analyze the plasma beta-carotene level data.
半参数模型的复杂性给统计推断和模型选择带来了新的挑战,这些挑战在实际应用中经常出现。在这项工作中,我们针对半参数变系数部分线性模型提出了新的估计和变量选择方法。我们首先研究非参数变系数函数和参数回归系数的分位数回归估计。为了获得良好的效率性质,我们进一步开发了一种半参数复合分位数回归方法。我们建立了所提出的参数和非参数部分估计量的渐近正态性,并表明这些估计量达到了最佳收敛速度。此外,我们表明,对于许多非正态误差,所提出的方法比基于最小二乘法的方法效率更高,而对于正态误差,它仅损失少量效率。另外,结果表明,在估计变系数函数时效率损失至多为11.1%,在估计参数分量时不超过13.6%。为了在高维协变量情况下实现稀疏性,我们提出了半参数变系数部分线性模型中变量选择的自适应惩罚方法,并证明这些方法具有神谕性质。进行了广泛的蒙特卡罗模拟研究以检验所提出方法的有限样本性能。最后,我们应用新方法分析血浆β-胡萝卜素水平数据。