Fox-Wasylyshyn Susan M, El-Masri Maher M
University of Windsor, Faculty of Nursing, 401 Sunset, Health Education Centre, Room 322, Windsor, Ontario, Canada N9B3P4.
Res Nurs Health. 2005 Dec;28(6):488-95. doi: 10.1002/nur.20100.
Self-report measures are extensively used in nursing research. Data derived from such reports can be compromised by the problem of missing data. To help ensure accurate parameter estimates and valid research results, the problem of missing data needs to be appropriately addressed. However, a review of nursing research literature revealed that issues such as the extent and pattern of missingness, and the approach used to handle missing data are seldom reported. The purpose of this article is to provide researchers with a conceptual overview of the issues associated with missing data, procedures used in determining the pattern of missingness, and techniques for handling missing data. The article also highlights the advantages and disadvantages of these techniques, and makes distinctions between data that are missing at the item versus variable levels. Missing data handling techniques addressed in this article include deletion approaches, mean substitution, regression-based imputation, hot-deck imputation, multiple imputation, and maximum likelihood imputation.
自我报告测量方法在护理研究中被广泛使用。从这类报告中得出的数据可能会因数据缺失问题而受到影响。为了帮助确保准确的参数估计和有效的研究结果,数据缺失问题需要得到妥善解决。然而,对护理研究文献的回顾发现,诸如缺失程度和模式以及处理缺失数据所采用的方法等问题很少被报告。本文的目的是为研究人员提供与缺失数据相关问题的概念性概述、确定缺失模式所使用的程序以及处理缺失数据的技术。本文还强调了这些技术的优缺点,并区分了在项目层面与变量层面缺失的数据。本文讨论的缺失数据处理技术包括删除法、均值替换、基于回归的插补、热卡插补、多重插补和最大似然插补。