Queensland University of Technology, Brisbane, Australia.
Value Health. 2009 Mar-Apr;12(2):309-14. doi: 10.1111/j.1524-4733.2008.00421.x.
To present a relatively novel method for modeling length-of-stay data and assess the role of covariates, some of which are related to adverse events. To undertake critical comparisons with alternative models based on the gamma and log-normal distributions. To demonstrate the effect of poorly fitting models on decision-making.
The model has the process of hospital stay organized into Markov phases/states that describe stay in hospital before discharge to an absorbing state. Admission is via state 1 and discharge from this first state would correspond to a short stay, with transitions to later states corresponding to longer stays. The resulting phase-type probability distributions provide a flexible modeling framework for length-of-stay data which are known to be awkward and difficult to fit to other distributions.
The dataset consisted of 1901 patients' lengths of stay and values for a number of covariates. The fitted model comprised six Markov phases, and provided a good fit to the data. Alternative gamma and log-normal models did not fit as well, gave different coefficient estimates, and statistical significance of covariate effects differed between the models.
Models that fit should generally be preferred over those that do not, as they will produce more statistically reliable coefficient estimates. Poor coefficient estimates may mislead decision-makers by either understating or overstating the cost of some event or the cost savings from preventing that event. There is no obvious way of identifying a priori when coefficient estimates from poorly fitting models might be misleading.
提出一种相对新颖的方法来对住院时间数据进行建模,并评估协变量的作用,其中一些与不良事件有关。与基于伽马和对数正态分布的替代模型进行批判性比较。证明拟合不良模型对决策的影响。
该模型将住院过程组织成马尔可夫阶段/状态,描述出院前在医院的停留情况,直至进入吸收状态。入院通过状态 1,从第一个状态出院将对应较短的停留时间,向较晚状态的转移将对应较长的停留时间。由此产生的相型概率分布为住院时间数据提供了一个灵活的建模框架,因为住院时间数据通常难以拟合到其他分布。
数据集包括 1901 名患者的住院时间和一些协变量的值。拟合模型由六个马尔可夫阶段组成,与数据拟合良好。替代的伽马和对数正态模型拟合效果不佳,给出了不同的系数估计,并且模型之间协变量效应的统计显著性也不同。
一般来说,应优先选择拟合良好的模型,而不是拟合不好的模型,因为它们将产生更具统计学可靠性的系数估计。不良的系数估计可能会误导决策者,要么低估某些事件的成本,要么高估预防该事件的成本节约。没有明显的方法可以事先确定拟合不良模型的系数估计可能会产生误导。