Cooper Nicola J, Lambert Paul C, Abrams Keith R, Sutton Alexander J
Centre for Biostatistics and Genetic Epidemiology, Department of Health Sciences, University of Leicester, UK.
Health Econ. 2007 Jan;16(1):37-56. doi: 10.1002/hec.1141.
This article focuses on the modelling and prediction of costs due to disease accrued over time, to inform the planning of future services and budgets. It is well documented that the modelling of cost data is often problematic due to the distribution of such data; for example, strongly right skewed with a significant percentage of zero-cost observations. An additional problem associated with modelling costs over time is that cost observations measured on the same individual at different time points will usually be correlated. In this study we compare the performance of four different multilevel/hierarchical models (which allow for both the within-subject and between-subject variability) for analysing healthcare costs in a cohort of individuals with early inflammatory polyarthritis (IP) who were followed-up annually over a 5-year time period from 1990/1991. The hierarchical models fitted included linear regression models and two-part models with log-transformed costs, and two-part model with gamma regression and a log link. The cohort was split into a learning sample, to fit the different models, and a test sample to assess the predictive ability of these models. To obtain predicted costs on the original cost scale (rather than the log-cost scale) two different retransformation factors were applied. All analyses were carried out using Bayesian Markov chain Monte Carlo (MCMC) simulation methods.
本文着重于对随时间累积的疾病成本进行建模和预测,以为未来服务和预算规划提供依据。有充分文献记载,由于成本数据的分布情况,成本数据建模往往存在问题;例如,数据严重右偏,且零成本观测值占相当大的比例。与随时间对成本进行建模相关的另一个问题是,在不同时间点对同一个体进行测量的成本观测值通常会相互关联。在本研究中,我们比较了四种不同的多水平/分层模型(其考虑了个体内部和个体间的变异性)在分析一组患有早期炎症性多关节炎(IP)的个体医疗成本方面的性能,这些个体在1990/1991年开始的5年时间里每年接受随访。所拟合的分层模型包括线性回归模型以及对成本进行对数变换的两部分模型,还有带伽马回归和对数链接的两部分模型。该队列被分为一个用于拟合不同模型的学习样本和一个用于评估这些模型预测能力的测试样本。为了在原始成本尺度(而非对数成本尺度)上获得预测成本,应用了两种不同的逆变换因子。所有分析均使用贝叶斯马尔可夫链蒙特卡罗(MCMC)模拟方法进行。