Amsterdam UMC, Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam Public Health Research Institute, de Boelelaan 1117, Amsterdam, the Netherlands.
Amsterdam Rehabilitation Research Center Reade, Amsterdam, the Netherlands.
J Clin Epidemiol. 2021 Jun;134:1-13. doi: 10.1016/j.jclinepi.2021.01.011. Epub 2021 Jan 30.
PROMIS offers computerized adaptive tests (CAT) of patient-reported outcomes, using a single set of US-based IRT item parameters across populations and language-versions. The use of country-specific item parameters has local appeal, but also disadvantages. We illustrate the effects of choosing US or country-specific item parameters on PROMIS CAT T-scores.
Simulations were performed on response data from Dutch chronic pain patients (n = 1110) who completed the PROMIS Pain Behavior item bank. We compared CAT T-scores obtained with (1) US parameters; (2) Dutch item parameters; (3) US item parameters for DIF-free items and Dutch item parameters (rescaled to the US metric) for DIF items; (4) Dutch item parameters for all items (rescaled to the US metric).
Without anchoring to a common metric, CAT T-scores cannot be compared. When scores were rescaled to the US metric, mean differences in CAT T-scores based on US vs. Dutch item parameters were negligible. However, 0.9%-4.3% of the T-score differences were larger than 5 points (0.5 SD).
The choice of item parameters can be consequential for individual patient scores. We recommend more studies of translated CATs to examine if strategies that allow for country-specific item parameters should be further investigated.
PROMIS 提供基于患者报告结果的计算机自适应测试(CAT),使用一套基于美国的 IRT 项目参数在不同人群和语言版本中进行。使用特定于国家/地区的项目参数具有地方吸引力,但也有缺点。我们展示了在选择美国或特定于国家/地区的项目参数对 PROMIS CAT T 评分的影响。
对完成 PROMIS 疼痛行为项目库的荷兰慢性疼痛患者(n=1110)的反应数据进行模拟。我们比较了使用(1)美国参数;(2)荷兰项目参数;(3)无 DIF 项目的美国项目参数和荷兰项目参数(重新缩放至美国度量);(4)所有项目的荷兰项目参数(重新缩放至美国度量)获得的 CAT T 评分。
没有共同度量标准,CAT T 评分无法进行比较。当分数按美国度量标准重新缩放时,基于美国与荷兰项目参数的 CAT T 评分差异可以忽略不计。然而,0.9%-4.3%的 T 评分差异大于 5 分(0.5 个标准差)。
项目参数的选择可能会对患者个体的分数产生影响。我们建议对翻译后的 CAT 进行更多研究,以检验是否应该进一步研究允许特定于国家/地区的项目参数的策略。