Walters Stephen J, Brazier John E
Health Economics and Decision Science, School of Health and Related Research, University of Sheffield, UK.
Qual Life Res. 2005 Aug;14(6):1523-32. doi: 10.1007/s11136-004-7713-0.
The SF-6D and EQ-5D are both preference-based measures of health. Empirical work is required to determine what the smallest change is in utility scores that can be regarded as important and whether this change in utility value is constant across measures and conditions.
To use distribution and anchor-based methods to determine and compare the minimally important difference (MID) for the SF-6D and EQ-5D for various datasets.
The SF-6D is scored on a 0.29-1.00 scale and the EQ-5D on a -0.59-1.00 scale, with a score of 1.00 on both, indicating 'full health'. Patients were followed for a period of time, then asked, using question 2 of the SF-36 as our anchor, if their general health is much better (5), somewhat better (4), stayed the same (3), somewhat worse (2) or much worse (1) compared to the last time they were assessed. We considered patients whose global rating score was 4 or 2 as having experienced some change equivalent to the MID. This paper describes and compares the MID and standardised response mean (SRM) for the SF-6D and EQ-5D from eight longitudinal studies in 11 patient groups that used both instruments.
From the 11 reviewed studies, the MID for the SF-6D ranged from 0.011 to 0.097, mean 0.041. The corresponding SRMs ranged from 0.12 to 0.87, mean 0.39 and were mainly in the 'small to moderate' range using Cohen's criteria, supporting the MID results. The mean MID for the EQ-5D was 0.074 (range -0.011-0.140) and the SRMs ranged from -0.05 to 0.43, mean 0.24. The mean MID for the EQ-SD was almost double that of the mean MID for the SF-6D.
There is evidence that the MID for these two utility measures are not equal and differ in absolute values. The EQ-5D scale has approximately twice the range of the SF-6D scale. Therefore, the estimates of the MID for each scale appear to be proportionally equivalent in the context of the range of utility scores for each scale. Further empirical work is required to see whether or not this holds true for other utility measures, patient groups and populations.
SF-6D和EQ-5D均为基于偏好的健康测量工具。需要开展实证研究来确定效用得分中可被视为重要的最小变化量,以及这种效用值变化在不同测量工具和条件下是否恒定。
运用分布法和基于锚定的方法来确定并比较不同数据集下SF-6D和EQ-5D的最小重要差异(MID)。
SF-6D的评分范围是0.29至1.00,EQ-5D的评分范围是-0.59至1.00,两者得分均为1.00表示“完全健康”。对患者进行一段时间的随访,然后以SF-36的问题2作为锚定,询问他们与上次评估相比,总体健康状况是好多了(5)、稍好些(4)、保持不变(3)、稍差些(2)还是差得多(1)。我们将总体评分得分为4或2的患者视为经历了相当于MID的某种变化。本文描述并比较了11个患者组中8项纵向研究里同时使用这两种工具时SF-6D和EQ-5D的MID及标准化反应均值(SRM)。
在11项综述研究中,SF-6D的MID范围为0.011至0.097,均值为0.041。相应的SRM范围为0.12至0.87,均值为0.39,根据科恩标准,主要处于 “小到中等” 范围,支持了MID结果。EQ-5D的平均MID为0.074(范围为-0.011至0.140),SRM范围为-0.05至0.43,均值为0.24。EQ-5D的平均MID几乎是SF-6D平均MID的两倍。
有证据表明这两种效用测量工具的MID不相等且绝对值不同。EQ-5D量表的范围约为SF-6D量表的两倍。因此,在每个量表的效用得分范围内,每个量表的MID估计值似乎成比例相等。还需要进一步的实证研究来确定这是否适用于其他效用测量工具、患者群体和人群。