Rubin D B, Schenker N
Department of Statistics, Harvard University, Cambridge, MA 02138.
Stat Med. 1991 Apr;10(4):585-98. doi: 10.1002/sim.4780100410.
Multiple imputation for non-response replaces each missing value by two or more plausible values. The values can be chosen to represent both uncertainty about the reasons for non-response and uncertainty about which values to impute assuming the reasons for non-response are known. This paper provides an overview of methods for creating and analysing multiply-imputed data sets, and illustrates the dramatic improvements possible when using multiple rather than single imputation. A major application of multiple imputation to public-use files from the 1970 census is discussed, and several exploratory studies related to health care that have used multiple imputation are described.
针对无应答情况的多重填补通过两个或更多似然值替换每个缺失值。这些值的选取既能体现无应答原因的不确定性,又能体现假设无应答原因已知时应填补哪些值的不确定性。本文概述了创建和分析多重填补数据集的方法,并说明了使用多重填补而非单一填补时可能实现的显著改进。文中讨论了多重填补在1970年人口普查公共使用文件中的一个主要应用,并描述了几项使用多重填补的与医疗保健相关的探索性研究。