Chaurasia Ashok, Harel Ofer
Department of Statistics, University of Connecticut, Storrs, CT, USA.
Health Serv Outcomes Res Methodol. 2012 Jun;12(2-3):219-233. doi: 10.1007/s10742-012-0088-8.
Many model selection criteria proposed over the years have become common procedures in applied research. However, these procedures were designed for complete data. Complete data is rare in applied statistics, in particular in medical, public health and health policy settings. Incomplete data, another common problem in applied statistics, introduces its own set of complications in light of which the task of model selection can get quite complicated. Recently, few have suggested model selection procedures for incomplete data with varying degrees of success. In this paper we explore model selection by the Akaike Information Criterion (AIC) in the multivariate regression setting with ignorable missing data accounted for via multiple imputation.
多年来提出的许多模型选择标准已成为应用研究中的常见程序。然而,这些程序是为完整数据设计的。完整数据在应用统计学中很少见,尤其是在医学、公共卫生和卫生政策环境中。不完整数据是应用统计学中的另一个常见问题,它带来了一系列自身的复杂情况,鉴于此,模型选择任务可能会变得相当复杂。最近,很少有人针对不完整数据提出模型选择程序,且取得了不同程度的成功。在本文中,我们探讨在多元回归设置下,通过对可忽略的缺失数据进行多重填补来使用赤池信息准则(AIC)进行模型选择。