Andridge Rebecca R, Little Roderick J A
Division of Biostatistics, The Ohio State University, Columbus, OH 43210, USA.
Int Stat Rev. 2010 Apr;78(1):40-64. doi: 10.1111/j.1751-5823.2010.00103.x.
Hot deck imputation is a method for handling missing data in which each missing value is replaced with an observed response from a "similar" unit. Despite being used extensively in practice, the theory is not as well developed as that of other imputation methods. We have found that no consensus exists as to the best way to apply the hot deck and obtain inferences from the completed data set. Here we review different forms of the hot deck and existing research on its statistical properties. We describe applications of the hot deck currently in use, including the U.S. Census Bureau's hot deck for the Current Population Survey (CPS). We also provide an extended example of variations of the hot deck applied to the third National Health and Nutrition Examination Survey (NHANES III). Some potential areas for future research are highlighted.
热卡插补是一种处理缺失数据的方法,其中每个缺失值都被来自“相似”单元的观测响应所取代。尽管在实践中被广泛使用,但其理论发展不如其他插补方法完善。我们发现,对于应用热卡插补的最佳方法以及从完整数据集中得出推断,目前尚无共识。在此,我们回顾热卡插补的不同形式及其统计特性的现有研究。我们描述了目前正在使用的热卡插补的应用,包括美国人口普查局用于当前人口调查(CPS)的热卡插补。我们还提供了一个扩展示例,展示了应用于第三次全国健康和营养检查调查(NHANES III)的热卡插补的变体。文中突出了一些未来研究的潜在领域。