Department of Medical Social Sciences, Northwestern University Feinberg School of Medicine, 625 Michigan Ave, 21st Floor, Chicago, IL, 60611, USA.
Division of General Internal Medicine and Health Services Research, University of California Los Angeles, Department of Medicine, Los Angeles, CA, USA.
Qual Life Res. 2023 May;32(5):1341-1352. doi: 10.1007/s11136-022-03200-4. Epub 2022 Aug 3.
Individual change on a patient-reported outcome (PRO) measure can be assessed by statistical significance and meaningfulness to patients. We explored the relationship between these two criteria by varying the confidence levels of the coefficient of repeatability (CR) on the Patient-Reported Outcomes Measurement Information System (R) Physical Function (PF) 10a (PF10a) measure.
In a sample of 1129 adult cancer patients, we estimated individual-change thresholds on the PF10a from baseline to 6 weeks later with the CR at 50%, 68%, and 95% confidence. We also assessed agreement with group- and individual-level thresholds from anchor-based methods [mean change and receiver operating characteristic (ROC) curve] using a PF-specific patient global impression of change (PGIC).
CRs at 50%, 68%, and 95% confidence were 3, 4, and 7 raw score points, respectively. The ROC- and mean-change-based thresholds for deterioration were -4 and -6; for improvement they were both 2. Kappas for agreement between anchor-based thresholds and CRs for deterioration ranged between κ = 0.65 and 1.00, while for improvement, they ranged between 0.35 and 0.83. Agreement between the PGIC and all CRs always fell below "good" (κ < 0.40) for deterioration (0.30-0.33) and were lower for improvement (0.16-0.28).
In comparison to the CR at 95% confidence, CRs at 50% and 68% confidence (considered likely change indexes) have the advantage of maximizing the proportion of patients appropriately classified as changed according to statistical significance and meaningfulness.
可以通过统计学意义和对患者的意义来评估患者报告的结局(PRO)测量的个体变化。我们通过改变重复性系数(CR)在患者报告的结局测量信息系统(R)物理功能(PF)10a(PF10a)测量上的置信水平来探讨这两个标准之间的关系。
在 1129 名成年癌症患者的样本中,我们使用 CR 在 50%、68%和 95%的置信水平下,从基线估计 PF10a 的个体变化阈值,到 6 周后。我们还使用特定于 PF 的患者整体变化印象(PGIC)来评估基于锚定的方法(平均变化和接收者操作特征(ROC)曲线)的组和个体水平阈值的一致性。
CR 在 50%、68%和 95%的置信水平下分别为 3、4 和 7 个原始分数点。基于 ROC 和平均变化的恶化阈值分别为-4 和-6;对于改善,两者均为 2。基于锚定阈值和 CR 的恶化的一致性的 Kappa 值介于 κ=0.65 和 1.00 之间,而对于改善,它们介于 0.35 和 0.83 之间。PGIC 与所有 CR 的一致性始终低于恶化时的“良好”(κ<0.40)(0.30-0.33),而改善时的一致性较低(0.16-0.28)。
与 95%置信水平的 CR 相比,50%和 68%置信水平的 CR(被认为是可能的变化指标)具有根据统计学意义和对患者的意义最大化适当分类为变化的患者比例的优势。