Krabbe P F, Essink-Bot M L, Bonsel G J
Department of Public Health, Faculty of Medicine, Erasmus University Rotterdam, The Netherlands.
Soc Sci Med. 1997 Dec;45(11):1641-52. doi: 10.1016/s0277-9536(97)00099-3.
The objective of the study was to consider five methods for valuing health states with respect to their comparability (convergent validity, value functions) and reliability. Valuation tasks were performed by 104 student volunteers using five frequently used valuation methods: standard gamble (SG), time trade-off (TTO), rating scale (RS), willingness-to-pay (WTP) and the paired comparisons method (PC). Throughout the study, the EuroQol classification system was used to construct 13 health-state descriptions. Validity was investigated using the multitrait-multimethod (MTMM) methodology. The extent to which results of one method could be predicted by another was examined by transformations. Reliability of the methods was studied parametrically with Generalisability Theory (an ANOVA extension), as well as non-parametrically. Mean values for SG were slightly higher than TTO values. The RS could be distinguished from the other methods. After a simple power transformation, the RS values were found to be close to SG and TTO. Mean values of WTP were linearly related to SG and TTO, except at the extremes of the scale. However, the reliability of WTP was low and the number of inconsistencies substantial. Valuations made by the RS proved to be the most reliable. Paired comparisons did not provide stable results. In conclusion, the results of the parametric transformation function between RS and SG/TTO provide evidence to justify the current use of RS (with transformations) not only for reasons of feasibility and reliability but also for reasons of comparability. A definite judgement on PC requires data of a complete design. Due to the specific structure of the correlation matrix which is inherent in valuing health states, we believe that full MTMM is not applicable for the standard analysis of health-state valuations.
本研究的目的是考量五种评估健康状态的方法在可比性(收敛效度、价值函数)和可靠性方面的表现。104名学生志愿者使用五种常用的评估方法进行评估任务:标准博弈法(SG)、时间权衡法(TTO)、评分量表法(RS)、支付意愿法(WTP)和配对比较法(PC)。在整个研究过程中,使用欧洲五维健康量表分类系统构建了13种健康状态描述。采用多特质多方法(MTMM)方法研究效度。通过转换来检验一种方法的结果能在多大程度上被另一种方法预测。使用概化理论(方差分析的扩展)以及非参数方法对这些方法的可靠性进行了研究。SG的均值略高于TTO的值。RS能与其他方法区分开来。经过简单的幂转换后,发现RS值与SG和TTO接近。WTP的均值与SG和TTO呈线性相关,但在量表的两端除外。然而,WTP的可靠性较低,不一致的情况较多。RS所做的评估被证明是最可靠的。配对比较未提供稳定的结果。总之,RS与SG/TTO之间参数转换函数的结果提供了证据,不仅从可行性和可靠性方面,而且从可比性方面证明了当前使用RS(经过转换)的合理性。对PC的明确判断需要完整设计的数据。由于评估健康状态时固有的相关矩阵的特定结构,我们认为完整的MTMM不适用于健康状态评估的标准分析。