Coca Perraillon Marcelo, Shih Ya-Chen Tina, Thisted Ronald A
Department of Public Health Sciences, University of Chicago, Chicago, IL (MCP)
Section of Cancer Economics and Policy, Department of Health Services Research, The University of Texas MD Anderson Cancer Center, Houston, TX (YCTS)
Med Decis Making. 2015 Oct;35(7):888-901. doi: 10.1177/0272989X15577362. Epub 2015 Apr 3.
. When data on preferences are not available, analysts rely on condition-specific or generic measures of health status like the SF-12 for predicting or mapping preferences. Such prediction is challenging because of the characteristics of preference data, which are bounded, have multiple modes, and have a large proportion of observations clustered at values of 1.
. We developed a finite mixture model for cross-sectional data that maps the SF-12 to the EQ-5D-3L preference index. Our model characterizes the observed EQ-5D-3L index as a mixture of 3 distributions: a degenerate distribution with mass at values indicating perfect health and 2 censored (Tobit) normal distributions. Using estimation and validation samples derived from the Medical Expenditure Panel Survey 2000 dataset, we compared the prediction performance of these mixture models to that of 2 previously proposed methods: ordinary least squares regression (OLS) and two-part models.
. Finite mixture models in which predictions are based on classification outperform two-part models and OLS regression based on mean absolute error, with substantial improvement for samples with fewer respondents in good health. The potential for misclassification is reflected on larger root mean square errors. Moreover, mixture models underperform around the center of the observed distribution.
. Finite mixtures offer a flexible modeling approach that can take into account idiosyncratic characteristics of the distribution of preferences. The use of mixture models allows researchers to obtain estimates of health utilities when only summary scores from the SF-12 and a limited number of demographic characteristics are available. Mixture models are particularly useful when the target sample does not have a large proportion of individuals in good health.
当偏好数据不可用时,分析人员依靠特定疾病或通用的健康状况测量指标(如SF - 12)来预测或映射偏好。由于偏好数据具有有界性、多峰性且大量观测值聚集在1值处等特征,这种预测具有挑战性。
我们为横断面数据开发了一种有限混合模型,该模型将SF - 12映射到EQ - 5D - 3L偏好指数。我们的模型将观察到的EQ - 5D - 3L指数表征为三种分布的混合:一种在表示完美健康的值处有质量的退化分布,以及两种截尾(托宾)正态分布。使用从2000年医疗支出面板调查数据集得出的估计样本和验证样本,我们将这些混合模型的预测性能与之前提出的两种方法进行了比较:普通最小二乘回归(OLS)和两部分模型。
基于分类的有限混合模型在平均绝对误差方面优于两部分模型和OLS回归,对于健康状况良好的受访者较少的样本有显著改进。误分类的可能性反映在较大的均方根误差上。此外,混合模型在观察到的分布中心附近表现不佳。
有限混合提供了一种灵活的建模方法,可以考虑偏好分布的特殊特征。当只有SF - 12的汇总分数和有限数量的人口统计学特征可用时,使用混合模型可以使研究人员获得健康效用的估计值。当目标样本中健康状况良好的个体比例不大时,混合模型特别有用。