Chen Tian, Zhang Hui, Zhang Bo
Department of Mathematics and Statistics, University of Toledo, Toledo, OH 43606, U.S.A.
Department of Biostatistics, St. Jude Children's Research Hospital, TN 38105, U.S.A.
J Appl Stat. 2019;46(16):2862-2883. doi: 10.1080/02664763.2019.1620705. Epub 2019 May 22.
Zero-inflated count outcomes arise quite often in research and practice. Parametric models such as the zero-inflated Poisson and zero-inflated negative binomial are widely used to model such responses. However, interpretations of those models focus on the at-risk subpopulation of a two-component population mixture and fail to provide direct inference about marginal effects for the overall population. Recently, new approaches have been proposed to facilitate such marginal inferences for count responses with excess zeros. However, they are likelihood based and impose strong assumptions on data distributions. In this paper, we propose a new distribution-free, or semiparametric, alternative to provide robust inference for marginal effects when population mixtures are defined by zero-inflated count outcomes. The proposed method also applies to longitudinal studies with missing data following the general missing at random mechanism. The proposed approach is illustrated with both simulated and real study data.
零膨胀计数结果在研究和实践中经常出现。诸如零膨胀泊松模型和零膨胀负二项式模型等参数模型被广泛用于对此类响应进行建模。然而,这些模型的解释侧重于两成分总体混合中的风险子总体,未能提供关于总体边际效应的直接推断。最近,已经提出了新的方法来促进对具有过多零值的计数响应进行此类边际推断。然而,它们基于似然性,并且对数据分布施加了很强的假设。在本文中,我们提出了一种新的无分布或半参数替代方法,以便在总体混合由零膨胀计数结果定义时,为边际效应提供稳健的推断。所提出的方法也适用于遵循一般随机缺失机制的具有缺失数据的纵向研究。通过模拟数据和实际研究数据对所提出的方法进行了说明。