Shi Runmin, Liang Faming, Song Qifan, Luo Ye, Ghosh Malay
Department of Statistics, University of Florida, Gainesville, FL 32611.
Department of Statistics, Purdue University, West Lafayette, IN 47906.
Sankhya Ser B. 2018 Dec;80(1 Suppl):179-223. doi: 10.1007/s13571-018-0183-0. Epub 2019 Feb 7.
The drastic improvement in data collection and acquisition technologies has enabled scientists to collect a great amount of data. With the growing dataset size, typically comes a growing complexity of data structures and of complex models to account for the data structures. How to estimate the parameters of complex models has put a great challenge on current statistical methods. This paper proposes a approach as a potential solution to the problem, which works by iteratively finding consistent estimates for each block of parameters conditional on the current estimates of the parameters in other blocks. The blockwise consistency approach decomposes the high-dimensional parameter estimation problem into a series of lower-dimensional parameter estimation problems, which often have much simpler structures than the original problem and thus can be easily solved. Moreover, under the framework provided by the blockwise consistency approach, a variety of methods, such as Bayesian and frequentist methods, can be jointly used to achieve a consistent estimator for the original high-dimensional complex model. The blockwise consistency approach is illustrated using two high-dimensional problems, variable selection and multivariate regression. The results of both problems show that the blockwise consistency approach can provide drastic improvements over the existing methods. Extension of the blockwise consistency approach to many other complex models is straightforward.
数据收集与获取技术的巨大进步使科学家能够收集大量数据。随着数据集规模的不断扩大,数据结构以及用于解释这些数据结构的复杂模型通常也会变得越来越复杂。如何估计复杂模型的参数给当前的统计方法带来了巨大挑战。本文提出了一种方法作为该问题的潜在解决方案,它通过在其他块参数的当前估计值条件下,对每个参数块迭代地找到一致估计值来工作。逐块一致性方法将高维参数估计问题分解为一系列低维参数估计问题,这些问题的结构通常比原始问题简单得多,因此可以轻松解决。此外,在逐块一致性方法提供的框架下,可以联合使用多种方法,如贝叶斯方法和频率论方法,来获得原始高维复杂模型的一致估计量。通过变量选择和多元回归这两个高维问题对逐块一致性方法进行了说明。两个问题的结果都表明,逐块一致性方法相比现有方法能带来显著改进。将逐块一致性方法扩展到许多其他复杂模型是很直接的。