Joe Harry, Zhu Rong
Department of Statistics, University of British Columbia, Canada.
Biom J. 2005 Apr;47(2):219-29. doi: 10.1002/bimj.200410102.
We prove that the generalized Poisson distribution GP(theta, eta) (eta > or = 0) is a mixture of Poisson distributions; this is a new property for a distribution which is the topic of the book by Consul (1989). Because we find that the fits to count data of the generalized Poisson and negative binomial distributions are often similar, to understand their differences, we compare the probability mass functions and skewnesses of the generalized Poisson and negative binomial distributions with the first two moments fixed. They have slight differences in many situations, but their zero-inflated distributions, with masses at zero, means and variances fixed, can differ more. These probabilistic comparisons are helpful in selecting a better fitting distribution for modelling count data with long right tails. Through a real example of count data with large zero fraction, we illustrate how the generalized Poisson and negative binomial distributions as well as their zero-inflated distributions can be discriminated.
我们证明广义泊松分布GP(θ, η)(η≥0)是泊松分布的混合;这是一种分布的新特性,而该分布是康索尔(1989年)所著书籍的主题。由于我们发现广义泊松分布和负二项分布对计数数据的拟合通常相似,为了解它们之间的差异,我们在固定前两个矩的情况下比较广义泊松分布和负二项分布的概率质量函数与偏度。它们在许多情况下有细微差异,但在零膨胀分布中,在零处的质量、均值和方差固定时,差异可能更大。这些概率比较有助于选择更适合的分布来对具有长右尾的计数数据进行建模。通过一个零比例较大的计数数据实例,我们说明了如何区分广义泊松分布和负二项分布及其零膨胀分布。