Li Yi, Dicker Lee, Zhao Sihai Dave
University of Michigan.
Rutgers University.
Stat Sin. 2014 Jan 1;24(1):251-2568. doi: 10.5705/ss.2011.220.
The Dantzig variable selector has recently emerged as a powerful tool for fitting regularized regression models. To our knowledge, most work involving the Dantzig selector has been performed with fully-observed response variables. This paper proposes a new class of adaptive Dantzig variable selectors for linear regression models when the response variable is subject to right censoring. This is motivated by a clinical study to identify genes predictive of event-free survival in newly diagnosed multiple myeloma patients. Under some mild conditions, we establish the theoretical properties of our procedures, including consistency in model selection (i.e. the right subset model will be identified with a probability tending to 1) and the optimal efficiency of estimation (i.e. the asymptotic distribution of the estimates is the same as that when the true subset model is known a priori). The practical utility of the proposed adaptive Dantzig selectors is verified via extensive simulations. We apply our new methods to the aforementioned myeloma clinical trial and identify important predictive genes.
丹齐格变量选择器最近已成为拟合正则化回归模型的强大工具。据我们所知,大多数涉及丹齐格选择器的工作都是在响应变量完全可观测的情况下进行的。本文针对响应变量受到右删失的线性回归模型,提出了一类新的自适应丹齐格变量选择器。这是受一项临床研究的启发,该研究旨在识别新诊断的多发性骨髓瘤患者中预测无事件生存期的基因。在一些温和条件下,我们建立了我们方法的理论性质,包括模型选择的一致性(即正确的子集模型将以趋于1的概率被识别)和估计的最优效率(即估计的渐近分布与真实子集模型先验已知时的渐近分布相同)。通过广泛的模拟验证了所提出的自适应丹齐格选择器的实际效用。我们将新方法应用于上述骨髓瘤临床试验,并识别出重要的预测基因。