Palmer Emily O C, Ker Sheryl, Rentería Miguel E, Carmody Thomas, Rush A John
Holmusk Europe Ltd, London, UK.
KKT Technologies, Pte. Ltd, Singapore.
Neuropsychiatr Dis Treat. 2024 Mar 26;20:671-687. doi: 10.2147/NDT.S444223. eCollection 2024.
Major depressive disorder (MDD) is a leading cause of disability worldwide. An accurate assessment of depressive symptomology is crucial for clinical management and research. This study assessed the convergent validity, reliability, and total scale score interconversion across the 9-item Patient Health Questionnaire (PHQ-9) self-report, the 16-item Quick Inventory of Depressive Symptomatology-clinician report (QIDS-C) (two widely used clinical ratings) and the 5-item Very Brief Quick Inventory of Depressive Symptoms-clinician report (VQIDS-C), which evaluate the core features of MDD.
This study leveraged electronic health record (EHR)-derived, de-identified data from the NeuroBlu Database (Version 23R1), a longitudinal behavioural health real-world platform. Classical Test Theory (CTT) and Item Response Theory (IRT) analyses were used to evaluate the reliability, validity of, and conversions between the scales. The Test Information Function (TIF) was calculated for each scale, with greater test information reflecting higher precision and reliability in measuring depressive symptomology. IRT was also used to generate conversion tables so that total scores on each scale could be compared to the other.
The study sample (n = 2,156) had an average age of 36.4 years (standard deviation [SD] = 13.0) and 59.7% were female. The mean depression scores for the PHQ-9, QIDS-C, and VQIDS-C were 12.9 (SD = 6.6), 12.0 (SD = 4.9), and 6.18 (SD = 3.2), respectively. The Cronbach's alpha coefficients for PHQ-9, QIDS-C, and VQIDS-C were 0.9, 0.8, and 0.7, respectively, suggesting acceptable internal consistency. PHQ-9 (TIF = 30.3) demonstrated the best assessment of depressive symptomology, followed by QIDS-C (TIF = 25.8) and VQIDS-C (TIF = 17.7).
Overall, PHQ-9, QIDS-C, and VQIDS-C appear to be reliable and convertible measures of MDD symptomology within a US-based adult population in a real-world clinical setting.
重度抑郁症(MDD)是全球致残的主要原因。准确评估抑郁症状对于临床管理和研究至关重要。本研究评估了9项患者健康问卷(PHQ - 9)自我报告、16项抑郁症状快速量表 - 临床医生报告(QIDS - C)(两种广泛使用的临床评定量表)以及5项抑郁症状超简短快速量表 - 临床医生报告(VQIDS - C)之间的收敛效度、信度和总分相互转换情况,这些量表用于评估MDD的核心特征。
本研究利用了来自NeuroBlu数据库(23R1版本)的电子健康记录(EHR)衍生的、去识别化的数据,该数据库是一个纵向行为健康真实世界平台。采用经典测试理论(CTT)和项目反应理论(IRT)分析来评估各量表的信度、效度及相互转换情况。计算每个量表的测试信息函数(TIF),测试信息越大表明在测量抑郁症状方面的精度和信度越高。IRT还用于生成转换表,以便比较各量表的总分。
研究样本(n = 2156)的平均年龄为36.4岁(标准差[SD] = 13.0),59.7%为女性。PHQ - 9、QIDS - C和VQIDS - C的平均抑郁得分分别为12.9(SD = 6.6)、12.0(SD = 4.9)和6.18(SD = 3.2)。PHQ - 9、QIDS - C和VQIDS - C的克朗巴哈α系数分别为0.9、0.8和0.7,表明内部一致性可接受。PHQ - 9(TIF = 30.3)在评估抑郁症状方面表现最佳,其次是QIDS - C(TIF = 25.8)和VQIDS - C(TIF = 17.7)。
总体而言,在真实世界临床环境中的美国成年人群体中,PHQ - 9、QIDS - C和VQIDS - C似乎是可靠的且可相互转换的MDD症状测量工具。