Cuesta Izquierdo Marcelino, Fonseca Pedrero Eduardo
Universidad de Oviedo.
Psicothema. 2014;26(4):516-23. doi: 10.7334/psicothema2014.98.
The problem of missing values at the item level is common in studies using educational and psychological tests. The aim of the present work is to explore how the estimation of reliability is affected by missing values.
Using real data, we simulated missing values in accordance with a "missing at random mechanism". Four factors were manipulated with the aim of checking their effect on the estimation of the reliability of the instrument: missing data mechanism, percentage of missing data in the database, sample size, and procedure employed for the treatment of missing values.
The results show that the quality of estimations depends on the interaction of various factors. The general tendency is that the estimations are worse when the sample size is small and the percentage of missing values increases. Listwise is the worst procedure for treatment of the missing data in the simulated conditions.
It is concluded that with a small percentage of missing values one can obtain estimations that are acceptable from a practical point of view with all the procedures employed, except Listwise.
在使用教育和心理测试的研究中,项目层面的缺失值问题很常见。本研究的目的是探讨缺失值如何影响信度估计。
使用真实数据,我们根据“随机缺失机制”模拟缺失值。操纵了四个因素,以检验它们对工具信度估计的影响:缺失数据机制、数据库中缺失数据的百分比、样本量以及处理缺失值所采用的程序。
结果表明,估计的质量取决于各种因素的相互作用。一般趋势是,当样本量小且缺失值百分比增加时,估计会更差。在模拟条件下,删除法是处理缺失数据最差的程序。
得出的结论是,对于少量的缺失值,除了删除法外,使用所有程序从实际角度都能获得可接受的估计。