Jiang Fei, Zhou Yeqing, Liu Jianxuan, Ma Yanyuan
Department of Epidemiology and Biostatistics, The University of California, San Francisco.
School of Mathematical Sciences, Tongji University.
Ann Stat. 2023 Feb;51(1):233-259. doi: 10.1214/22-aos2248. Epub 2023 Mar 23.
We study estimation and testing in the Poisson regression model with noisy high dimensional covariates, which has wide applications in analyzing noisy big data. Correcting for the estimation bias due to the covariate noise leads to a non-convex target function to minimize. Treating the high dimensional issue further leads us to augment an amenable penalty term to the target function. We propose to estimate the regression parameter through minimizing the penalized target function. We derive the and convergence rates of the estimator and prove the variable selection consistency. We further establish the asymptotic normality of any subset of the parameters, where the subset can have infinitely many components as long as its cardinality grows sufficiently slow. We develop Wald and score tests based on the asymptotic normality of the estimator, which permits testing of linear functions of the members if the subset. We examine the finite sample performance of the proposed tests by extensive simulation. Finally, the proposed method is successfully applied to the Alzheimer's Disease Neuroimaging Initiative study, which motivated this work initially.
我们研究了具有噪声高维协变量的泊松回归模型中的估计和检验问题,该模型在分析噪声大数据方面有广泛应用。校正由于协变量噪声导致的估计偏差会产生一个非凸目标函数以进行最小化。进一步处理高维问题使我们在目标函数中增加一个合适的惩罚项。我们建议通过最小化惩罚目标函数来估计回归参数。我们推导了估计器的收敛速度并证明了变量选择一致性。我们进一步建立了参数任何子集的渐近正态性,只要该子集的基数增长足够缓慢,其子集可以有无限多个分量。我们基于估计器的渐近正态性开发了 Wald 检验和得分检验,这允许对该子集中成员的线性函数进行检验。我们通过广泛的模拟研究了所提出检验的有限样本性能。最后,所提出的方法成功应用于阿尔茨海默病神经影像倡议研究,该研究最初激发了这项工作。