School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, China.
Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.
Stat Med. 2024 May 20;43(11):2096-2121. doi: 10.1002/sim.10052. Epub 2024 Mar 15.
Excessive zeros in multivariate count data are often observed in scenarios of biomedicine and public health. To provide a better analysis on this type of data, we first develop a marginalized multivariate zero-inflated Poisson (MZIP) regression model to directly interpret the overall exposure effects on marginal means. Then, we define a multiple Pearson residual for our newly developed MZIP regression model by simultaneously taking heterogeneity and correlation into consideration. Furthermore, a new model averaging prediction method is introduced based on the multiple Pearson residual, and the asymptotical optimality of this model averaging prediction is proved. Simulations and two empirical applications in medicine are used to illustrate the effectiveness of the proposed method.
多元计数数据中过度的零值在生物医学和公共卫生领域经常出现。为了对这类数据进行更好的分析,我们首先开发了一种边缘化多元零膨胀泊松(MZIP)回归模型,以直接解释总体暴露效应对边缘均值的影响。然后,我们通过同时考虑异质性和相关性,为我们新开发的 MZIP 回归模型定义了一个多重 Pearson 残差。此外,我们还基于多重 Pearson 残差引入了一种新的模型平均预测方法,并证明了该方法的渐近最优性。模拟和医学中的两个实证应用说明了所提出方法的有效性。