Institute for Clinical Epidemiology and Applied Biometrics, Medical University of Tübingen, Silcherstraße 5 72076, Tübingen, Germany.
Institute for Community Medicine, University of Greifswald, Greifswald, Germany.
Qual Life Res. 2022 Jan;31(1):11-23. doi: 10.1007/s11136-021-02922-1. Epub 2021 Jul 8.
The EQ-5D-5L is a well-established health questionnaire that estimates health utilities by applying preference-based weights. Limited work has been done to examine alternative scoring approaches when utility weights are unavailable or inapplicable. We examined whether the Mokken scaling approach can elucidate 1) if the level summary score is appropriate for the EQ-5D-5L and 2) an interpretation of such a score.
The R package "mokken" was used to assess monotonicity (scaling coefficients H, automated item selection procedure) and manifest invariant item ordering (MIIO: paired item response functions [IRF], H). We used a rich dataset (the Multiple Instrument Comparison, MIC) which includes EQ-5D-5L data from six Western countries.
While all EQ-5D-5L items demonstrated monotonicity, the anxiety/depression (AD) item had weak scalability (H = 0.377). Without AD, scalability improved from H = 0.559 to H = 0.714. MIIO revealed that the 5 items can be ordered, and the ordering is moderately accurate in the MIC data (H = 0.463). Excluding AD, H improves to 0.743. Results were largely consistent across disease and country subgroups.
The 5 items of the EQ-5D-5L form a moderate to strong Mokken scale, enabling persons to be ordered using the level summary score. Item ordering suggests that the lower range of the score represents mainly problems with pain and anxiety/depression, the mid-range indicates additional problems with mobility and usual activities, and middle to higher range of scores reveals additional limitations with self-care. Scalability and item ordering are even stronger when the anxiety/depression item is not included in the scale.
EQ-5D-5L 是一种经过充分验证的健康问卷,通过应用基于偏好的权重来估计健康效用。当效用权重不可用或不适用时,很少有研究探讨替代评分方法。我们检验了莫肯分层分析法是否可以阐明:1)水平总结评分是否适用于 EQ-5D-5L;2)对这种评分的解释。
使用 R 包“mokken”来评估单调性(分级系数 H、自动项目选择程序)和明显不变的项目排序(MIIO:配对项目反应函数[IRF]、H)。我们使用了一个丰富的数据集(多仪器比较,MIC),其中包括来自六个西方国家的 EQ-5D-5L 数据。
虽然 EQ-5D-5L 的所有项目都表现出单调性,但焦虑/抑郁(AD)项目的可分级性较弱(H=0.377)。没有 AD,可分级性从 H=0.559 提高到 H=0.714。MIIO 表明,这 5 个项目可以排序,在 MIC 数据中排序的准确性中等(H=0.463)。排除 AD 后,H 值提高到 0.743。结果在疾病和国家亚组中基本一致。
EQ-5D-5L 的 5 个项目构成了一个从中等到较强的莫肯分层,使人们能够使用水平总结评分对其进行排序。项目排序表明,评分的较低范围主要代表疼痛和焦虑/抑郁方面的问题,中间范围表示移动和日常活动方面的额外问题,而中间到较高的评分范围则显示出自我护理方面的额外限制。当不包括焦虑/抑郁项目时,可分级性和项目排序更强。