Austin Peter C
Institute for Clinical Evaluative Sciences, North York, Ontario M4N 3M5, Canada.
Value Health. 2002 Jul-Aug;5(4):329-37. doi: 10.1046/j.1524-4733.2002.54128.x.
Self-reported health status is often measured using psychometric or utility indices that provide a score intended to summarize an individual's health. Measurements of health status can be subject to a ceiling effect. Frequently, researchers want to examine relationships between determinants of health and measures of health status. Regression methods that ignore the censoring in the health status measurement can produce biased coefficient estimates. The authors examine the performance of three different models for assessing the relationship between demographic characteristics and health status.
Three methods that allow one to analyze data subject to a ceiling effect are compared. The first model is the classic Tobit model. The second and third models are robust variants of the Tobit model: symmetrically trimmed least squares and censored least absolute deviations (Censored LAD) regression. These models were fit to data from the Canadian National Population Health Survey. The results are compared to three models that ignore the presence of a ceiling effect.
The Censored LAD model produced coefficient estimates that tended to be shrunk toward 0, compared to the other two models. The three models produced conflicting evidence on the effect of gender on health status. Similarly, the rate of decay in health status with increasing age differed across the three models. The Censored LAD model produced results very similar to median regression. Furthermore, the censored LAD model had the lowest prediction error in an independent validation dataset.
Our results highlight the need for careful consideration about how best to model variation in health status. Based upon our study, we recommend the use of Censored LAD regression.
自我报告的健康状况通常使用心理测量或效用指数来衡量,这些指数提供一个分数以概括个人的健康状况。健康状况的测量可能会受到天花板效应的影响。研究人员经常希望研究健康决定因素与健康状况测量之间的关系。忽略健康状况测量中的删失情况的回归方法可能会产生有偏差的系数估计。作者研究了三种不同模型在评估人口统计学特征与健康状况之间关系时的表现。
比较了三种允许分析受天花板效应影响的数据的方法。第一个模型是经典的托比特模型。第二个和第三个模型是托比特模型的稳健变体:对称修剪最小二乘法和删失最小绝对偏差(Censored LAD)回归。这些模型被应用于加拿大国家人口健康调查的数据。将结果与忽略天花板效应存在的三种模型进行比较。
与其他两个模型相比,删失最小绝对偏差模型产生的系数估计往往向0收缩。这三种模型在性别对健康状况的影响方面提供了相互矛盾的证据。同样,随着年龄增长健康状况的衰退率在这三种模型中也有所不同。删失最小绝对偏差模型产生的结果与中位数回归非常相似。此外,在一个独立的验证数据集中,删失最小绝对偏差模型的预测误差最低。
我们的结果强调了需要仔细考虑如何最好地对健康状况的变化进行建模。基于我们的研究,我们建议使用删失最小绝对偏差回归。