Department of Psychology, University of Notre Dame.
Psychol Methods. 2021 Oct;26(5):559-598. doi: 10.1037/met0000377. Epub 2021 Jun 28.
issing values that are missing not at random (MNAR) can result from a variety of missingness processes. However, two fundamental subtypes of MNAR values can be obtained from the definition of the MNAR mechanism itself. The distinction between them deserves consideration because they have characteristic differences in how they distort relationships in the data. This has implications for the validity of statistical results and generalizability of methodological findings that are based on data (empirical or generated) with MNAR values. However, these MNAR subtypes have largely gone unnoticed by the literature. As few studies have considered both subtypes, their relevance to methodological and substantive research has been overlooked. This article systematically introduces the two MNAR subtypes and gives them descriptive names. A case study demonstrates they are mechanically distinct from each other and from other missing-data mechanisms. Applied examples are given to help researchers conceptually identify MNAR subtypes in real data. Methods are provided to generate missing values from both subtypes in simulation studies. Simulation studies for regression and growth curve modeling contexts show MNAR subtypes consistently differ in the severity of their impact on statistical inference. This behavior is examined in light of how relationships in the data become characteristically distorted. The contents of this article are intended to provide a foundation and tools for organized consideration of MNAR subtypes. (PsycInfo Database Record (c) 2021 APA, all rights reserved).
缺失值是指在数据收集过程中丢失的数据,这些缺失值可能是随机的,也可能是非随机的。如果缺失值不是随机的(MNAR),那么可能是由于各种缺失机制导致的。然而,从 MNAR 机制的定义本身可以得到两种基本的 MNAR 值子类型。这两种子类型之间的区别值得考虑,因为它们在数据中扭曲关系的方式上存在特征差异。这对基于具有 MNAR 值的(经验或生成)数据的统计结果的有效性和方法学发现的可推广性有影响。然而,这些 MNAR 子类型在文献中基本上没有被注意到。由于很少有研究同时考虑这两种子类型,因此它们对方法学和实质性研究的相关性被忽视了。本文系统地介绍了这两种 MNAR 子类型,并为它们赋予了描述性的名称。一个案例研究表明,它们在机制上彼此不同,也与其他缺失数据机制不同。提供了应用实例,以帮助研究人员在实际数据中概念上识别 MNAR 子类型。提供了方法来在模拟研究中从这两种子类型生成缺失值。回归和增长曲线建模情境的模拟研究表明,MNAR 子类型在其对统计推断的影响的严重程度上始终存在差异。这种行为是根据数据中的关系如何变得特征性地扭曲来检查的。本文的内容旨在为有组织地考虑 MNAR 子类型提供基础和工具。(PsycInfo 数据库记录(c)2021 APA,保留所有权利)。