Division of Spinal Surgery, Departments of Orthopaedic and Neurosurgery, NYU Medical Center, NY Spine Institute, New York, NY, USA.
Department of Orthopaedics, Lenox Hill Hospital, Northwell Health, New York, NY, USA.
Eur Spine J. 2022 May;31(5):1184-1188. doi: 10.1007/s00586-021-07106-6. Epub 2022 Jan 11.
To determine the validity and responsiveness of PROMIS metrics versus the SRS-22r questionnaire in adult spinal deformity (ASD).
Surgical ASD patients undergoing ≥ 4 levels fused with complete baseline PROMIS and SRS-22r data were included. Internal consistency (Cronbach's alpha) and test-retest reliability [intraclass correlation coefficient (ICC)] were compared. Cronbach's alpha and ICC values ≥ 0.70 were predefined as satisfactory. Convergent validity was evaluated via Spearman's correlations. Responsiveness was assessed via paired samples t tests with Cohen's d to assess measure of effect (baseline to 3 months).
One hundred and ten pts are included. Mean baseline SRS-22r score was 2.62 ± 0.67 (domains = Function: 2.6, Pain: 2.5, Self-image: 2.2, Mental Health: 3.0). Mean PROMIS domains = Physical Function (PF): 12.4, Pain Intensity (PI): 91.7, Pain Interference (Int): 55.9. Cronbach's alpha, and ICC were not satisfactory for any SRS-22 and PROMIS domains. PROMIS-Int reliability was low for all SRS-22 domains (0.037-0.225). Convergent validity demonstrated strong correlation via Spearman's rho between PROMIS-PI and overall SRS-22r (- 0.61), SRS-22 Function (- 0.781), and SRS-22 Pain (- 0.735). PROMIS-PF had strong correlation with SRS-22 Function (0.643), while PROMIS-Int had moderate correlation with SRS-22 Pain (- 0.507). Effect size via Cohen's d showed that PROMIS had superior responsiveness across all domains except for self-image.
PROMIS is a valid measure compared to SRS-22r in terms of convergent validity, and has greater measure of effect in terms of responsiveness, but failed in reliability and internal consistency. Surgeons should consider the lack of reliability and internal consistency (despite validity and responsiveness) of the PROMIS to SRS-22r before replacing the traditional questionnaire with the computer-adaptive testing.
确定 PROMIS 指标与 SRS-22r 问卷在成人脊柱畸形(ASD)中的有效性和反应性。
纳入接受≥4 个节段融合且具有完整基线 PROMIS 和 SRS-22r 数据的手术 ASD 患者。比较内部一致性(克朗巴赫α)和重测信度[组内相关系数(ICC)]。预先设定克朗巴赫α和 ICC 值≥0.70 为满意。通过 Spearman 相关评估收敛效度。通过配对样本 t 检验和 Cohen's d 评估效应量(基线至 3 个月)来评估反应性。
共纳入 110 例患者。SRS-22r 基线平均得分 2.62±0.67(领域得分:功能:2.6,疼痛:2.5,自我形象:2.2,心理健康:3.0)。PROMIS 平均领域得分=身体功能(PF):12.4,疼痛强度(PI):91.7,疼痛干扰(Int):55.9。任何 SRS-22 和 PROMIS 领域的克朗巴赫α和 ICC 均不满意。对于所有 SRS-22 领域,PROMIS-Int 的可靠性均较低(0.037-0.225)。通过 Spearman's rho 评估,PROMIS-PI 与整体 SRS-22r(-0.61)、SRS-22 功能(-0.781)和 SRS-22 疼痛(-0.735)呈强相关。PROMIS-PF 与 SRS-22 功能呈强相关(0.643),而 PROMIS-Int 与 SRS-22 疼痛呈中度相关(-0.507)。Cohen's d 表示的效应量显示,PROMIS 在所有领域均具有较高的反应性,除自我形象外。
与 SRS-22r 相比,PROMIS 在收敛效度方面是有效的,在反应性方面具有更大的效应量,但在可靠性和内部一致性方面存在不足。在使用计算机自适应测试替代传统问卷之前,外科医生应考虑到 PROMIS 相对于 SRS-22r 的可靠性和内部一致性(尽管具有有效性和反应性)不足。