Fan Jianqing, Xue Lingzhou, Yao Jiawei
Princeton University.
Pennsylvania State University.
J Econom. 2017 Dec;201(2):292-306. doi: 10.1016/j.jeconom.2017.08.009. Epub 2017 Aug 26.
We consider forecasting a single time series when there is a large number of predictors and a possible nonlinear effect. The dimensionality was first reduced via a high-dimensional (approximate) factor model implemented by the principal component analysis. Using the extracted factors, we develop a novel forecasting method called the sufficient forecasting, which provides a set of sufficient predictive indices, inferred from high-dimensional predictors, to deliver additional predictive power. The projected principal component analysis will be employed to enhance the accuracy of inferred factors when a semi-parametric (approximate) factor model is assumed. Our method is also applicable to cross-sectional sufficient regression using extracted factors. The connection between the sufficient forecasting and the deep learning architecture is explicitly stated. The sufficient forecasting correctly estimates projection indices of the underlying factors even in the presence of a nonparametric forecasting function. The proposed method extends the sufficient dimension reduction to high-dimensional regimes by condensing the cross-sectional information through factor models. We derive asymptotic properties for the estimate of the central subspace spanned by these projection directions as well as the estimates of the sufficient predictive indices. We further show that the natural method of running multiple regression of target on estimated factors yields a linear estimate that actually falls into this central subspace. Our method and theory allow the number of predictors to be larger than the number of observations. We finally demonstrate that the sufficient forecasting improves upon the linear forecasting in both simulation studies and an empirical study of forecasting macroeconomic variables.
当存在大量预测变量以及可能的非线性效应时,我们考虑对单个时间序列进行预测。首先通过主成分分析实现的高维(近似)因子模型来降低维度。利用提取的因子,我们开发了一种名为充分预测的新颖预测方法,该方法从高维预测变量中推断出一组充分的预测指标,以提供额外的预测能力。当假设采用半参数(近似)因子模型时,将使用投影主成分分析来提高推断因子的准确性。我们的方法也适用于使用提取因子的横截面充分回归。明确阐述了充分预测与深度学习架构之间的联系。即使存在非参数预测函数,充分预测也能正确估计潜在因子的投影指标。所提出的方法通过因子模型压缩横截面信息,将充分降维扩展到高维情形。我们推导了由这些投影方向所跨越的中心子空间估计值以及充分预测指标估计值的渐近性质。我们进一步表明,对目标变量在估计因子上进行多元回归的自然方法会产生一个实际上落入该中心子空间的线性估计值。我们的方法和理论允许预测变量的数量大于观测值的数量。最后,我们在模拟研究和预测宏观经济变量的实证研究中均证明,充分预测优于线性预测。