Nikolaus Stephanie, Bode Christina, Taal Erik, Vonkeman Harald E, Glas Cees A W, van de Laar Mart A F J
Department of Psychology, Health & Technology, Faculty of Behavioral, Management and Social Sciences, University of Twente, Enschede, The Netherlands.
Expert Center for Chronic Fatigue, Radboud University Medical Center, Nijmegen, The Netherlands.
PLoS One. 2015 Dec 28;10(12):e0145008. doi: 10.1371/journal.pone.0145008. eCollection 2015.
Multidimensional computerized adaptive testing enables precise measurements of patient-reported outcomes at an individual level across different dimensions. This study examined the construct validity of a multidimensional computerized adaptive test (CAT) for fatigue in rheumatoid arthritis (RA).
The 'CAT Fatigue RA' was constructed based on a previously calibrated item bank. It contains 196 items and three dimensions: 'severity', 'impact' and 'variability' of fatigue. The CAT was administered to 166 patients with RA. They also completed a traditional, multidimensional fatigue questionnaire (BRAF-MDQ) and the SF-36 in order to examine the CAT's construct validity. A priori criterion for construct validity was that 75% of the correlations between the CAT dimensions and the subscales of the other questionnaires were as expected. Furthermore, comprehensive use of the item bank, measurement precision and score distribution were investigated.
The a priori criterion for construct validity was supported for two of the three CAT dimensions (severity and impact but not for variability). For severity and impact, 87% of the correlations with the subscales of the well-established questionnaires were as expected but for variability, 53% of the hypothesised relations were found. Eighty-nine percent of the items were selected between one and 137 times for CAT administrations. Measurement precision was excellent for the severity and impact dimensions, with more than 90% of the CAT administrations reaching a standard error below 0.32. The variability dimension showed good measurement precision with 90% of the CAT administrations reaching a standard error below 0.44. No floor- or ceiling-effects were found for the three dimensions.
The CAT Fatigue RA showed good construct validity and excellent measurement precision on the dimensions severity and impact. The dimension variability had less ideal measurement characteristics, pointing to the need to recalibrate the CAT item bank with a two-dimensional model, solely consisting of severity and impact.
多维计算机自适应测试能够在个体层面跨不同维度精确测量患者报告的结局。本研究检验了用于类风湿关节炎(RA)疲劳的多维计算机自适应测试(CAT)的结构效度。
“CAT疲劳RA”基于先前校准的题库构建。它包含196个条目和三个维度:疲劳的“严重程度”、“影响”和“变异性”。对166例RA患者进行了CAT测试。他们还完成了一份传统的多维疲劳问卷(BRAF-MDQ)和SF-36,以检验CAT的结构效度。结构效度的先验标准是CAT维度与其他问卷子量表之间75%的相关性符合预期。此外,还研究了题库的综合使用、测量精度和分数分布。
三个CAT维度中的两个(严重程度和影响,但变异性维度不满足)支持结构效度的先验标准。对于严重程度和影响,与成熟问卷子量表的相关性中87%符合预期,但对于变异性,仅发现53%的假设关系。89%的条目在CAT测试中被选择了1至137次。严重程度和影响维度的测量精度极佳,超过90%的CAT测试标准误差低于0.32。变异性维度显示出良好的测量精度,90%的CAT测试标准误差低于0.44。三个维度均未发现地板效应或天花板效应。
CAT疲劳RA在严重程度和影响维度上显示出良好的结构效度和出色的测量精度。变异性维度的测量特征不太理想,这表明需要用仅由严重程度和影响组成的二维模型对CAT题库进行重新校准。