Epidemiology, Biostatistics and Prevention Institute, University of Zürich, Zurich, Switzerland.
Institute for Medical Information Processing, Biometry and Epidemiology, Ludwig-Maximilians-Universität München, Munich, Germany.
Cochrane Database Syst Rev. 2023 Sep 8;9(9):CD013606. doi: 10.1002/14651858.CD013606.pub2.
Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system that affects millions of people worldwide. The disease course varies greatly across individuals and many disease-modifying treatments with different safety and efficacy profiles have been developed recently. Prognostic models evaluated and shown to be valid in different settings have the potential to support people with MS and their physicians during the decision-making process for treatment or disease/life management, allow stratified and more precise interpretation of interventional trials, and provide insights into disease mechanisms. Many researchers have turned to prognostic models to help predict clinical outcomes in people with MS; however, to our knowledge, no widely accepted prognostic model for MS is being used in clinical practice yet.
To identify and summarise multivariable prognostic models, and their validation studies for quantifying the risk of clinical disease progression, worsening, and activity in adults with MS.
We searched MEDLINE, Embase, and the Cochrane Database of Systematic Reviews from January 1996 until July 2021. We also screened the reference lists of included studies and relevant reviews, and references citing the included studies.
We included all statistically developed multivariable prognostic models aiming to predict clinical disease progression, worsening, and activity, as measured by disability, relapse, conversion to definite MS, conversion to progressive MS, or a composite of these in adult individuals with MS. We also included any studies evaluating the performance of (i.e. validating) these models. There were no restrictions based on language, data source, timing of prognostication, or timing of outcome.
Pairs of review authors independently screened titles/abstracts and full texts, extracted data using a piloted form based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS), assessed risk of bias using the Prediction Model Risk Of Bias Assessment Tool (PROBAST), and assessed reporting deficiencies based on the checklist items in Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD). The characteristics of the included models and their validations are described narratively. We planned to meta-analyse the discrimination and calibration of models with at least three external validations outside the model development study but no model met this criterion. We summarised between-study heterogeneity narratively but again could not perform the planned meta-regression.
We included 57 studies, from which we identified 75 model developments, 15 external validations corresponding to only 12 (16%) of the models, and six author-reported validations. Only two models were externally validated multiple times. None of the identified external validations were performed by researchers independent of those that developed the model. The outcome was related to disease progression in 39 (41%), relapses in 8 (8%), conversion to definite MS in 17 (18%), and conversion to progressive MS in 27 (28%) of the 96 models or validations. The disease and treatment-related characteristics of included participants, and definitions of considered predictors and outcome, were highly heterogeneous amongst the studies. Based on the publication year, we observed an increase in the percent of participants on treatment, diversification of the diagnostic criteria used, an increase in consideration of biomarkers or treatment as predictors, and increased use of machine learning methods over time. Usability and reproducibility All identified models contained at least one predictor requiring the skills of a medical specialist for measurement or assessment. Most of the models (44; 59%) contained predictors that require specialist equipment likely to be absent from primary care or standard hospital settings. Over half (52%) of the developed models were not accompanied by model coefficients, tools, or instructions, which hinders their application, independent validation or reproduction. The data used in model developments were made publicly available or reported to be available on request only in a few studies (two and six, respectively). Risk of bias We rated all but one of the model developments or validations as having high overall risk of bias. The main reason for this was the statistical methods used for the development or evaluation of prognostic models; we rated all but two of the included model developments or validations as having high risk of bias in the analysis domain. None of the model developments that were externally validated or these models' external validations had low risk of bias. There were concerns related to applicability of the models to our research question in over one-third (38%) of the models or their validations. Reporting deficiencies Reporting was poor overall and there was no observable increase in the quality of reporting over time. The items that were unclearly reported or not reported at all for most of the included models or validations were related to sample size justification, blinding of outcome assessors, details of the full model or how to obtain predictions from it, amount of missing data, and treatments received by the participants. Reporting of preferred model performance measures of discrimination and calibration was suboptimal.
AUTHORS' CONCLUSIONS: The current evidence is not sufficient for recommending the use of any of the published prognostic prediction models for people with MS in clinical routine today due to lack of independent external validations. The MS prognostic research community should adhere to the current reporting and methodological guidelines and conduct many more state-of-the-art external validation studies for the existing or newly developed models.
多发性硬化症(MS)是一种影响全球数百万人的中枢神经系统慢性炎症性疾病。该疾病的病程在个体之间差异很大,最近已经开发出许多具有不同安全性和疗效特征的疾病修正治疗方法。在不同环境中评估并被证明有效的预后模型有可能在决策治疗或疾病/生活管理过程中为 MS 患者及其医生提供支持,允许对干预试验进行分层和更精确的解释,并深入了解疾病机制。许多研究人员已经转向预后模型来帮助预测 MS 患者的临床结局;然而,据我们所知,目前还没有被广泛接受的 MS 预后模型用于临床实践。
确定和总结多变量预后模型及其验证研究,以量化成人 MS 患者临床疾病进展、恶化和活动的风险。
我们检索了 MEDLINE、Embase 和 Cochrane 系统评价数据库,检索时间从 1996 年 1 月到 2021 年 7 月。我们还筛选了纳入研究和相关综述的参考文献列表,以及引用纳入研究的参考文献。
我们纳入了所有旨在预测临床疾病进展、恶化和活动的统计学开发的多变量预后模型,这些模型通过残疾、复发、转化为明确的 MS、转化为进行性 MS 或这些的组合来衡量,这些模型适用于成年 MS 个体。我们还纳入了评估(即验证)这些模型性能的任何研究。基于语言、数据来源、预测的时间或结果的时间没有限制。
两名综述作者独立筛选标题/摘要和全文,使用基于关键评估和系统评价预测模型研究数据提取清单(CHARMS)的试用表格提取数据,使用预测模型风险评估工具(PROBAST)评估偏倚风险,并根据多变量预测个体预后或诊断模型透明报告清单(TRIPOD)项目评估报告缺陷。描述了纳入模型及其验证的特征。我们计划对至少有三个外部验证的模型进行荟萃分析,但没有一个模型符合这一标准。我们以叙述性方式总结了模型之间的异质性,但再次无法进行计划的元回归。
我们纳入了 57 项研究,从中确定了 75 项模型开发,15 项外部验证仅对应于 12 项(16%)模型,以及 6 项作者报告的验证。只有两个模型经过多次外部验证。没有一个外部验证是由与开发模型无关的研究人员进行的。识别出的外部验证中,有 39 项(41%)与疾病进展相关,8 项(8%)与复发相关,17 项(18%)与明确 MS 转化相关,27 项(28%)与进行性 MS 转化相关。纳入研究参与者的疾病和治疗相关特征以及考虑的预测因素和结果定义在研究之间高度异质。根据发表年份,我们观察到参与者接受治疗的比例增加,诊断标准的多样化,考虑生物标志物或治疗作为预测因素的增加,以及机器学习方法的使用增加。
所有确定的模型都至少包含一个需要医学专家进行测量或评估的预测因素。大多数模型(44 项;59%)包含预测因素,这些因素需要专门的设备,可能不存在于初级保健或标准医院环境中。超过一半(52%)的开发模型没有模型系数、工具或说明,这阻碍了它们的应用、独立验证或复制。模型开发中使用的数据仅在少数研究中公开提供或报告可根据请求提供(分别为两项和六项)。
我们将所有但一项模型开发或验证评估为具有高总体偏倚风险。主要原因是用于开发或评估预后模型的统计方法;我们将纳入的模型开发或验证中的除两项外的所有模型评估为在分析域中具有高偏倚风险。没有一个经过外部验证的模型开发或这些模型的外部验证具有低偏倚风险。在三分之一以上(38%)的模型或其验证中,存在对模型在我们研究问题中的适用性的担忧。
总体而言,报告质量较差,且随着时间的推移没有明显提高。纳入的模型或验证中,报告不清楚或根本没有报告的项目主要与样本量的合理性、结果评估者的盲法、完整模型的详细信息或如何从模型中获得预测、缺失数据量以及参与者接受的治疗有关。对区分和校准等首选模型性能测量的报告也不理想。
由于缺乏独立的外部验证,目前的证据不足以推荐在临床常规中使用任何已发表的 MS 预后预测模型。MS 预后研究界应遵守当前的报告和方法学准则,并为现有的或新开发的模型进行更多的最先进的外部验证研究。