Zhao Junlong, Zhou Yang, Liu Yufeng
Professor, School of Statistics, Beijing Normal University, China.
Assistant Professor, School of Statistics, Beijing Normal University, Beijing China.
J Am Stat Assoc. 2024;119(546):1579-1591. doi: 10.1080/01621459.2023.2206084. Epub 2023 Jun 22.
High dimensional linear models are commonly used in practice. In many applications, one is interested in linear transformations of regression coefficients , where is a specific point and is not required to be identically distributed as the training data. One common approach is the plug-in technique which first estimates , then plugs the estimator in the linear transformation for prediction. Despite its popularity, estimation of can be difficult for high dimensional problems. Commonly used assumptions in the literature include that the signal of coefficients is sparse and predictors are weakly correlated. These assumptions, however, may not be easily verified, and can be violated in practice. When is non-sparse or predictors are strongly correlated, estimation of can be very difficult. In this paper, we propose a novel pointwise estimator for linear transformations of . This new estimator greatly relaxes the common assumptions for high dimensional problems, and is adaptive to the degree of sparsity of and strength of correlations among the predictors. In particular, can be sparse or non-sparse and predictors can be strongly or weakly correlated. The proposed method is simple for implementation. Numerical and theoretical results demonstrate the competitive advantages of the proposed method for a wide range of problems.
高维线性模型在实际中被广泛使用。在许多应用中,人们关注回归系数的线性变换,其中 是一个特定点,且不需要与训练数据具有相同的分布。一种常见的方法是插件技术,该技术首先估计 ,然后将估计值代入线性变换中进行预测。尽管它很受欢迎,但对于高维问题,估计 可能会很困难。文献中常用的假设包括系数 的信号是稀疏的,且预测变量之间是弱相关的。然而,这些假设可能不容易验证,并且在实际中可能会被违反。当 是非稀疏的或预测变量是强相关的时,估计 可能会非常困难。在本文中,我们提出了一种用于 的线性变换的新颖的逐点估计器。这种新的估计器极大地放宽了高维问题的常见假设,并且能够适应 的稀疏程度和预测变量之间的相关强度。特别地, 可以是稀疏的或非稀疏的,并且预测变量可以是强相关的或弱相关的。所提出的方法实现起来很简单。数值和理论结果证明了该方法在广泛问题上的竞争优势。