Mental Health and Addiction Research Group, Department of Health Sciences, University of York, York, UK.
Hull York Medical School, University of York, York, UK.
Cochrane Database Syst Rev. 2021 May 6;5(5):CD013491. doi: 10.1002/14651858.CD013491.pub2.
Relapse (the re-emergence of depressive symptoms after some level of improvement but preceding recovery) and recurrence (onset of a new depressive episode after recovery) are common in depression, lead to worse outcomes and quality of life for patients and exert a high economic cost on society. Outcomes can be predicted by using multivariable prognostic models, which use information about several predictors to produce an individualised risk estimate. The ability to accurately predict relapse or recurrence while patients are well (in remission) would allow the identification of high-risk individuals and may improve overall treatment outcomes for patients by enabling more efficient allocation of interventions to prevent relapse and recurrence.
To summarise the predictive performance of prognostic models developed to predict the risk of relapse, recurrence, sustained remission or recovery in adults with major depressive disorder who meet criteria for remission or recovery.
We searched the Cochrane Library (current issue); Ovid MEDLINE (1946 onwards); Ovid Embase (1980 onwards); Ovid PsycINFO (1806 onwards); and Web of Science (1900 onwards) up to May 2020. We also searched sources of grey literature, screened the reference lists of included studies and performed a forward citation search. There were no restrictions applied to the searches by date, language or publication status .
We included development and external validation (testing model performance in data separate from the development data) studies of any multivariable prognostic models (including two or more predictors) to predict relapse, recurrence, sustained remission, or recovery in adults (aged 18 years and over) with remitted depression, in any clinical setting. We included all study designs and accepted all definitions of relapse, recurrence and other related outcomes. We did not specify a comparator prognostic model.
Two review authors independently screened references; extracted data (using a template based on the CHecklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies (CHARMS)); and assessed risks of bias of included studies (using the Prediction model Risk Of Bias ASsessment Tool (PROBAST)). We referred any disagreements to a third independent review author. Where we found sufficient (10 or more) external validation studies of an individual model, we planned to perform a meta-analysis of its predictive performance, specifically with respect to its calibration (how well the predicted probabilities match the observed proportions of individuals that experience the outcome) and discrimination (the ability of the model to differentiate between those with and without the outcome). Recommendations could not be qualified using the GRADE system, as guidance is not yet available for prognostic model reviews.
We identified 11 eligible prognostic model studies (10 unique prognostic models). Seven were model development studies; three were model development and external validation studies; and one was an external validation-only study. Multiple estimates of performance measures were not available for any of the models and, meta-analysis was therefore not possible. Ten out of the 11 included studies were assessed as being at high overall risk of bias. Common weaknesses included insufficient sample size, inappropriate handling of missing data and lack of information about discrimination and calibration. One paper (Klein 2018) was at low overall risk of bias and presented a prognostic model including the following predictors: number of previous depressive episodes, residual depressive symptoms and severity of the last depressive episode. The external predictive performance of this model was poor (C-statistic 0.59; calibration slope 0.56; confidence intervals not reported). None of the identified studies examined the clinical utility (net benefit) of the developed model.
AUTHORS' CONCLUSIONS: Of the 10 prognostic models identified (across 11 studies), only four underwent external validation. Most of the studies (n = 10) were assessed as being at high overall risk of bias, and the one study that was at low risk of bias presented a model with poor predictive performance. There is a need for improved prognostic research in this clinical area, with future studies conforming to current best practice recommendations for prognostic model development/validation and reporting findings in line with the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement.
在抑郁症中,复发(抑郁症状在一定程度改善后再次出现)和再发(缓解后新的抑郁发作)很常见,会导致患者的预后和生活质量更差,并给社会带来高昂的经济成本。预后模型可用于预测结局,该模型使用多个预测因子的信息来生成个体化风险估计。在患者病情缓解(缓解)时准确预测复发或再发的能力将有助于识别高危个体,并通过更有效地分配干预措施来预防复发和再发,从而改善患者的整体治疗结局。
总结用于预测符合缓解或缓解标准的成年重性抑郁症患者复发、再发、持续缓解或缓解风险的预后模型的预测性能。
我们检索了 Cochrane 图书馆(当前刊期);Ovid MEDLINE(1946 年以后);Ovid Embase(1980 年以后);Ovid PsycINFO(1806 年以后)和 Web of Science(1900 年以后),截至 2020 年 5 月。我们还检索了灰色文献来源,筛选了纳入研究的参考文献,并进行了前瞻性引文搜索。检索没有对日期、语言或出版状态进行限制。
我们纳入了开发和外部验证(在与开发数据不同的数据中测试模型性能)的任何多变量预后模型(包括两个或多个预测因子)的研究,以预测处于缓解期的成年抑郁症患者(年龄 18 岁及以上)的复发、再发、持续缓解或缓解。我们纳入了所有研究设计,并接受了所有关于复发、再发和其他相关结局的定义。我们没有指定比较预后模型。
两名综述作者独立筛选参考文献;提取数据(使用基于 Checklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies(CHARMS)模板);并评估纳入研究的偏倚风险(使用 Prediction model Risk Of Bias ASsessment Tool(PROBAST))。我们将任何分歧提交给第三位独立的综述作者。如果我们发现了足够数量(10 个或更多)的单个模型的外部验证研究,我们计划对其预测性能进行荟萃分析,特别是关于其校准(预测概率与实际发生结局的个体比例的匹配程度)和区分度(模型区分有无结局的能力)。由于目前还没有针对预后模型审查的指南,因此无法使用 GRADE 系统对建议进行分级。
我们确定了 11 项符合条件的预后模型研究(10 个独特的预后模型)。其中 7 项为模型开发研究;3 项为模型开发和外部验证研究;1 项为仅外部验证研究。对于任何模型,都没有获得多个性能指标的估计值,因此无法进行荟萃分析。纳入的 11 项研究中有 10 项被评估为总体偏倚风险较高。常见的弱点包括样本量不足、对缺失数据的处理不当以及缺乏关于区分度和校准的信息。一篇论文(Klein 2018)总体偏倚风险较低,并提出了一个包含以下预测因子的预后模型:既往抑郁发作次数、残留抑郁症状和最后一次抑郁发作的严重程度。该模型的外部预测性能较差(C 统计量 0.59;校准斜率 0.56;置信区间未报告)。没有一项研究检验了所开发模型的临床实用性(净收益)。
在所确定的 10 个(11 项研究中的)预后模型中,只有 4 个模型进行了外部验证。大多数研究(n=10)被评估为总体偏倚风险较高,而风险较低的那项研究提出的模型预测性能较差。在这一临床领域,需要进行改进的预后研究,未来的研究应符合当前最佳的预后模型开发/验证实践建议,并按照 Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis(TRIPOD)声明报告研究结果。