Zeng Xianbin, Ma Shuangge, Qin Yichen, Li Yang
School of Statistics, Statistical Consulting Center, Renmin University of China, Beijing 100872, P.R. China.
School of Statistics, Renmin University of China, Beijing 100872, P.R. China, Department of Biostatistics, Yale University, New Haven 06511, USA.
Stat Interface. 2015;8(3):355-365. doi: 10.4310/SII.2015.v8.n3.a9.
In this paper, we consider the variable selection problem in semiparametric additive partially linear models for longitudinal data. Our goal is to identify relevant main effects and corresponding interactions associated with the response variable. Meanwhile, we enforce the strong hierarchical restriction on the model, that is, an interaction can be included in the model only if both the associated main effects are included. Based on B-splines basis approximation for the nonparametric components, we propose an iterative estimation procedure for the model by penalizing the likelihood with a partial group minimax concave penalty (MCP), and use BIC to select the tuning parameter. To further improve the estimation efficiency, we specify the working covariance matrix by maximum likelihood estimation. Simulation studies indicate that the proposed method tends to consistently select the true model and works efficiently in estimation and prediction with finite samples, especially when the true model obeys the strong hierarchy. Finally, the China Stock Market data are fitted with the proposed model to illustrate its effectiveness.
在本文中,我们考虑纵向数据半参数加法部分线性模型中的变量选择问题。我们的目标是识别与响应变量相关的相关主效应和相应的交互作用。同时,我们对模型施加强层次限制,即只有当相关的两个主效应都包含在模型中时,才能将交互作用包含在模型中。基于非参数分量的B样条基逼近,我们通过使用部分组极小极大凹惩罚(MCP)对似然进行惩罚,提出了一种模型的迭代估计程序,并使用BIC选择调谐参数。为了进一步提高估计效率,我们通过最大似然估计指定工作协方差矩阵。模拟研究表明,所提出的方法倾向于一致地选择真实模型,并且在有限样本的估计和预测中有效地工作,特别是当真实模型服从强层次结构时。最后,将中国股票市场数据拟合到所提出的模型中以说明其有效性。