Department of Medicine, Stanford University School of Medicine, 1000 Welch Road, Suite 203, Stanford, CA 94304, USA.
Arthritis Res Ther. 2009;11(6):R191. doi: 10.1186/ar2890. Epub 2009 Dec 16.
Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank.
The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects.
We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models having comparable model fits. Correlations between factors in the test data sets were > 0.90.
Item improvement must underlie attempts to improve outcome assessment. The clear, personally important and relevant, ability-framed items in the PROMIS Physical Function item bank perform well in PRO assessment. They will benefit from further study and application in a wider variety of rheumatic diseases in diverse clinical groups, including those at the extremes of physical functioning, and in different administration modes.
身体功能是风湿病患者报告结局(PRO)评估的关键组成部分。现代心理计量学方法,如项目反应理论(IRT)和计算机自适应测试,可以在项目水平上显著提高测量精度。我们介绍了开发患者报告结局测量信息系统(PROMIS)身体功能项目库的定性和定量项目评估过程。
该过程是逐步进行的:我们广泛搜索以确定现有的身体功能项目,然后对项目进行分类和有选择地减少项目库。我们通过专家和患者调查、焦点小组和认知访谈,评估保留项目的内容、清晰度、相关性和理解、阅读水平以及翻译难易程度。然后,我们使用经典测试理论和 IRT 评估项目,使用验证性因素分析估计项目参数,并进行分级响应建模以进行参数估计。我们保留了 20 个原始 Health Assessment Questionnaire Disability Index(HAQ-DI)和 10 个 SF-36 的 PF-10 项目进行比较。研究对象来自类风湿关节炎、骨关节炎和健康老龄化队列(n=1100)以及全国互联网样本 21133 名受试者。
我们确定了 1860 个项目。经过定性和定量评估,124 个新开发的 PROMIS 项目组成了 PROMIS 项目库,其中包括符合 IRT 模型假设的经过修订的 Legacy 项目。结果表明,最清晰和最容易理解的项目是简单的、现在时态的和直接的。基本任务(如穿衣)比复杂任务(如跳舞)更相关和重要。修订后的 HAQ-DI 和 PF-10 项目具有五个反应选项,其项目信息含量高于具有较少反应选项的可比原始 Legacy 项目。IRT 分析表明,身体功能域满足单维性的一般标准,具有单因素、双因素、三因素和四因素模型,模型拟合度相当。测试数据集之间的因子相关性>0.90。
必须改进项目才能尝试改进结局评估。PROMIS 身体功能项目库中的清晰、个人重要且相关、以能力为框架的项目在 PRO 评估中表现良好。它们将受益于进一步在更广泛的风湿性疾病、不同临床群体中,包括身体功能处于极端状态的患者,以及不同的管理模式中进行研究和应用。