Arthritis Centre Twente and University of Twente, Enschede, The Netherlands.
Arthritis Rheumatol. 2014 Oct;66(10):2900-8. doi: 10.1002/art.38759.
To evaluate and compare the measurement precision and sensitivity to change of the Health Assessment Questionnaire disability index (HAQ DI), the Short Form 36 physical functioning scale (PF-10), and simulated Patient-Reported Outcomes Measurement Information System (PROMIS) physical function computer adaptive tests (CATs) with 5, 10, and 15 items, using item response theory-based simulation studies.
The measurement precision of the various physical function instruments was evaluated by calculating root mean square errors (RMSEs) between true physical function levels (latent physical function score) and estimated physical function levels. Measurement precision was evaluated at 9 levels of physical function, with 5,000 simulated response patterns per level. Sensitivity to change was evaluated by the ability of a simple statistical test to detect simulated change scores of small to moderate magnitude (standardized effect sizes 0.20, 0.35, and 0.50).
RMSEs were smaller for the PROMIS physical function 15-item CAT (CAT-15) and CAT-10 than for the HAQ DI and PF-10 across all levels of the latent physical function scale. Only marginal improvement in performance was observed for the CAT-15 compared with the CAT-10, and the CAT-5 performed quite similarly to the HAQ DI and PF-10 across most levels of the latent physical function scale. Substantially improved sensitivity to change was observed for the CAT-10 compared with the HAQ DI and PF-10, particularly in detecting moderate effect sizes.
Clearly higher measurement precision was observed for the PROMIS CAT compared with the HAQ DI and PF-10. Higher reliability also translated into lower sample size requirements for detecting changes in clinical status.
通过基于项目反应理论的模拟研究,评估和比较健康评估问卷残疾指数(HAQ DI)、简化版 36 项健康调查躯体功能量表(PF-10)以及模拟患者报告结局测量信息系统(PROMIS)躯体功能计算机自适应测验(CAT)在 5、10 和 15 个项目时的测量精度和变化敏感性。
使用均方根误差(RMSE)来评估各种躯体功能量表的测量精度,RMSE 为真实躯体功能水平(潜在躯体功能评分)与估计躯体功能水平之间的差值。在 9 个躯体功能水平上评估测量精度,每个水平有 5000 个模拟反应模式。通过简单的统计检验来检测小到中等幅度的模拟变化分数(标准化效应大小分别为 0.20、0.35 和 0.50)来评估变化敏感性。
在潜在躯体功能量表的所有水平上,PROMIS 躯体功能 15 项 CAT(CAT-15)和 CAT-10 的 RMSE 均小于 HAQ DI 和 PF-10。与 CAT-10 相比,CAT-15 的性能仅略有提高,而 CAT-5 在潜在躯体功能量表的大多数水平上与 HAQ DI 和 PF-10 的表现非常相似。与 HAQ DI 和 PF-10 相比,CAT-10 的变化敏感性有了明显提高,尤其是在检测中等效应大小方面。
与 HAQ DI 和 PF-10 相比,PROMIS CAT 表现出明显更高的测量精度。更高的可靠性也转化为检测临床状态变化所需的更小样本量。