SF-36健康调查作为骨关节炎和类风湿关节炎患者临床试验中的通用结局指标：数据质量、量表假设和得分可靠性测试

The SF-36 Health Survey as a generic outcome measure in clinical trials of patients with osteoarthritis and rheumatoid arthritis: tests of data quality, scaling assumptions and score reliability.

作者信息

Kosinski M, Keller S D, Hatoum H T, Kong S X, Ware J E

机构信息

Health Assessment Lab, Health Institute, New England Medical Center, Boston, MA 02111, USA.

出版信息

Med Care. 1999 May;37(5 Suppl):MS10-22. doi: 10.1097/00005650-199905001-00002.

DOI:10.1097/00005650-199905001-00002

PMID:10335740

Abstract

OBJECTIVE

To evaluate the psychometric assumptions underlying the construction and scoring of SF-36 scales and summary measures among clinical trial participants with arthritis.

METHODS

Cross-sectional SF-36 data from the baseline assessment of adult patients (n = 1,016) participating in four placebo-controlled clinical trials of treatment for arthritis were analyzed with blinding as to treatment. Tests of the completeness of data, scaling assumptions, internal-consistency reliability, and factor structure of SF-36 scales were performed for the combined sample. Eligible participants had at least a 6-month history of moderate to severe osteoarthritis or rheumatoid arthritis of the knee or hip. Participants meeting inclusion criteria had undergone a washout period of 3-14 days before baseline assessment to bring about a flare state in osteoarthritis or rheumatoid arthritis symptoms. Baseline sample sizes for the three osteoarthritis trials were n = 121, n = 341, and n = 187. The baseline sample size for the rheumatoid arthritis trial was n = 367. The average age of participants was 60 years, and the majority were females (72%). Measured were functional health and well-being scales and physical and mental health summary measures from the SF-36 Health Survey acute form.

RESULTS

Missing responses ranged from 0.0% to 1.5% across SF-36 items, and scale scores could be computed for 96.8% to 100% of participants across trials. In all four trials, item internal consistency tests were passed (91.4%-97.1%) and item discriminant validity tests were passed (96.9%-100.0%). Across the four trials, internal-consistency reliability coefficients ranged from a low of 0.75 to a high of 0.91 for the eight scales (median = 0.84), exceeding the minimum standards for group comparisons. Ceiling effects were minimal for most scales, and floor effects were noteworthy for the role physical and role emotional scales. Physical and mental health factors identified in previous studies were replicated.

CONCLUSION

The SF-36 Health Survey proved to be a psychometrically sound tool for the assessment of the health status of adult participants in clinical trials of arthritis.

摘要

目的

评估关节炎临床试验参与者中SF-36量表构建和评分背后的心理测量学假设以及汇总指标。

方法

对参与四项关节炎治疗安慰剂对照临床试验的成年患者（n = 1,016）基线评估的横断面SF-36数据进行分析，分析时对治疗情况设盲。对合并样本进行数据完整性、量表假设、内部一致性信度和SF-36量表因子结构的检验。符合条件的参与者至少有6个月中度至重度膝或髋骨关节炎或类风湿关节炎病史。符合纳入标准的参与者在基线评估前经过3 - 14天的洗脱期，以使骨关节炎或类风湿关节炎症状达到发作状态。三项骨关节炎试验的基线样本量分别为n = 121、n = 341和n = 187。类风湿关节炎试验的基线样本量为n = 367。参与者的平均年龄为60岁，大多数为女性（72%）。测量了SF-36健康调查急性版中的功能健康和幸福量表以及身心健康汇总指标。

结果

SF-36各项目的缺失回答率在0.0%至1.5%之间，各试验中96.8%至100%的参与者可计算量表得分。在所有四项试验中，项目内部一致性检验通过（91.4% - 97.1%），项目区分效度检验通过（96.9% - 100.0%）。在四项试验中，八个量表的内部一致性信度系数范围从低的0.75到高的0.91（中位数 = 0.84），超过了组间比较的最低标准。大多数量表的天花板效应最小，而角色身体和角色情感量表的地板效应值得注意。重复了先前研究中确定的身体和心理健康因子。