Pal Suvra, Balakrishnan N
1 Department of Mathematics, University of Texas, Arlington, TX, USA.
2 Department of Mathematics and Statistics, McMaster University, Hamilton, ON, Canada.
Stat Methods Med Res. 2017 Oct;26(5):2093-2113. doi: 10.1177/0962280217708686. Epub 2017 Jun 28.
In this paper, we consider a competing cause scenario and assume the number of competing causes to follow a Conway-Maxwell Poisson distribution which can capture both over and under dispersion that is usually encountered in discrete data. Assuming the population of interest having a component cure and the form of the data to be interval censored, as opposed to the usually considered right-censored data, the main contribution is in developing the steps of the expectation maximization algorithm for the determination of the maximum likelihood estimates of the model parameters of the flexible Conway-Maxwell Poisson cure rate model with Weibull lifetimes. An extensive Monte Carlo simulation study is carried out to demonstrate the performance of the proposed estimation method. Model discrimination within the Conway-Maxwell Poisson distribution is addressed using the likelihood ratio test and information-based criteria to select a suitable competing cause distribution that provides the best fit to the data. A simulation study is also carried out to demonstrate the loss in efficiency when selecting an improper competing cause distribution which justifies the use of a flexible family of distributions for the number of competing causes. Finally, the proposed methodology and the flexibility of the Conway-Maxwell Poisson distribution are illustrated with two known data sets from the literature: smoking cessation data and breast cosmesis data.
在本文中,我们考虑一种竞争风险情形,并假设竞争风险的数量服从康威 - 麦克斯韦泊松分布,该分布能够捕捉离散数据中通常会遇到的过度离散和欠离散情况。假设感兴趣的总体具有部分治愈特性,且数据形式为区间删失,与通常考虑的右删失数据不同,主要贡献在于推导期望最大化算法的步骤,以确定具有威布尔寿命的灵活的康威 - 麦克斯韦泊松治愈率模型的模型参数的最大似然估计。进行了广泛的蒙特卡罗模拟研究以证明所提出估计方法的性能。使用似然比检验和基于信息的准则来解决康威 - 麦克斯韦泊松分布内的模型判别问题,以选择一个最适合数据的合适竞争风险分布。还进行了一项模拟研究,以证明选择不合适的竞争风险分布时效率的损失,这证明了对竞争风险数量使用灵活的分布族是合理的。最后,用文献中的两个已知数据集:戒烟数据和乳房美容数据,说明了所提出的方法以及康威 - 麦克斯韦泊松分布的灵活性。