Riazi Afsane, Cano Stefan J, Cooper J Mark, Bradley Jane L, Schapira Anthony H V, Hobart Jeremy C
Neurological Outcomes Measures Unit, Institute of Neurology, London, United Kingdom.
Mov Disord. 2006 Sep;21(9):1396-403. doi: 10.1002/mds.20985.
The objective of this study was to examine the psychometric properties of four widely used generic health status measures in Friedreich's ataxia (FA), to determine their suitability as outcome measures. Fifty-six people with genetically confirmed FA completed the Barthel Index (BI), General Health Questionnaire (GHQ-12), EuroQol (EQ-5D), and Medical Outcomes Study 36-item Short Form Health Survey (SF-36) by means of postal survey. Six psychometric properties (data quality, scaling assumptions, acceptability, reliability, validity, and responsiveness) were examined. The response rate was 97%. In general, the psychometric properties of the four measures satisfied recommended criteria. However, closer examination highlighted limitations restricting their use for treatment trials. For example, the BI had high levels of missing data, EQ-5D had poor discriminant ability, and five SF-36 scales had high floor and/or ceiling effects. Most scale scores did not span the entire scale range, had means that differed notably from the scale mid-point, and had wide confidence intervals. Effect sizes (ES) were small for all four measures raising questions about their ability to detect clinically significant change. Results highlight the potential limitations of these four scales for evaluating health outcomes in FA and suggest the need for new disease-specific patient-based measures of its impact.
本研究的目的是检验四种广泛使用的通用健康状况测量工具在弗里德赖希共济失调(FA)中的心理测量特性,以确定它们作为结局指标的适用性。56名经基因确诊为FA的患者通过邮寄调查完成了巴氏指数(BI)、一般健康问卷(GHQ-12)、欧洲五维度健康量表(EQ-5D)和医学结局研究简明健康调查36项量表(SF-36)。研究考察了六种心理测量特性(数据质量、量表假设、可接受性、信度、效度和反应性)。回复率为97%。总体而言,这四种测量工具的心理测量特性符合推荐标准。然而,进一步检查发现了一些限制其用于治疗试验的局限性。例如,BI有大量缺失数据,EQ-5D的区分能力较差,SF-36的五个量表有较高的地板效应和/或天花板效应。大多数量表分数没有涵盖整个量表范围,其均值与量表中点有显著差异,且置信区间较宽。所有四种测量工具的效应量(ES)都较小,这引发了对它们检测临床显著变化能力的质疑。结果突出了这四种量表在评估FA健康结局方面的潜在局限性,并表明需要新的基于患者的疾病特异性测量工具来评估其影响。