Barreñada Lasai, Ledger Ashleigh, Dhiman Paula, Collins Gary, Wynants Laure, Verbakel Jan Y, Timmerman Dirk, Valentin Lil, Van Calster Ben
Department of Development and Regeneration, KU Leuven, Leuven, Belgium.
Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford Centre for Statistics in Medicine, Oxford, UK.
BMJ Med. 2024 Feb 17;3(1):e000817. doi: 10.1136/bmjmed-2023-000817. eCollection 2024.
To conduct a systematic review of studies externally validating the ADNEX (Assessment of Different Neoplasias in the adnexa) model for diagnosis of ovarian cancer and to present a meta-analysis of its performance.
Systematic review and meta-analysis of external validation studies.
Medline, Embase, Web of Science, Scopus, and Europe PMC, from 15 October 2014 to 15 May 2023.
All external validation studies of the performance of ADNEX, with any study design and any study population of patients with an adnexal mass. Two independent reviewers extracted the data. Disagreements were resolved by discussion. Reporting quality of the studies was scored with the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) reporting guideline, and methodological conduct and risk of bias with PROBAST (Prediction model Risk Of Bias Assessment Tool). Random effects meta-analysis of the area under the receiver operating characteristic curve (AUC), sensitivity and specificity at the 10% risk of malignancy threshold, and net benefit and relative utility at the 10% risk of malignancy threshold were performed.
47 studies (17 007 tumours) were included, with a median study sample size of 261 (range 24-4905). On average, 61% of TRIPOD items were reported. Handling of missing data, justification of sample size, and model calibration were rarely described. 91% of validations were at high risk of bias, mainly because of the unexplained exclusion of incomplete cases, small sample size, or no assessment of calibration. The summary AUC to distinguish benign from malignant tumours in patients who underwent surgery was 0.93 (95% confidence interval 0.92 to 0.94, 95% prediction interval 0.85 to 0.98) for ADNEX with the serum biomarker, cancer antigen 125 (CA125), as a predictor (9202 tumours, 43 centres, 18 countries, and 21 studies) and 0.93 (95% confidence interval 0.91 to 0.94, 95% prediction interval 0.85 to 0.98) for ADNEX without CA125 (6309 tumours, 31 centres, 13 countries, and 12 studies). The estimated probability that the model has use clinically in a new centre was 95% (with CA125) and 91% (without CA125). When restricting analysis to studies with a low risk of bias, summary AUC values were 0.93 (with CA125) and 0.91 (without CA125), and estimated probabilities that the model has use clinically were 89% (with CA125) and 87% (without CA125).
The results of the meta-analysis indicated that ADNEX performed well in distinguishing between benign and malignant tumours in populations from different countries and settings, regardless of whether the serum biomarker, CA125, was used as a predictor. A key limitation was that calibration was rarely assessed.
PROSPERO CRD42022373182.
对外部验证用于诊断卵巢癌的ADNEX(附件区不同肿瘤评估)模型的研究进行系统评价,并对其性能进行荟萃分析。
对外部验证研究进行系统评价和荟萃分析。
2014年10月15日至2023年5月15日期间的Medline、Embase、Web of Science、Scopus和欧洲生物医学中心。
所有关于ADNEX性能的外部验证研究,研究设计和研究人群为患有附件包块的患者。两名独立的审阅者提取数据。分歧通过讨论解决。研究的报告质量根据TRIPOD(个体预后或诊断多变量预测模型的透明报告)报告指南进行评分,方法学实施和偏倚风险根据PROBAST(预测模型偏倚风险评估工具)进行评估。对接受手术患者中区分良性与恶性肿瘤的受试者工作特征曲线下面积(AUC)、恶性风险阈值为10%时的敏感性和特异性、以及恶性风险阈值为10%时的净效益和相对效用进行随机效应荟萃分析。
纳入47项研究(17007个肿瘤),研究样本量中位数为261(范围24 - 4905)。平均而言,TRIPOD项目的报告率为61%。很少描述缺失数据的处理、样本量的合理性以及模型校准情况。91%的验证存在高偏倚风险,主要原因是未解释地排除不完整病例、样本量小或未评估校准。对于以血清生物标志物癌抗原125(CA125)作为预测指标的ADNEX,在接受手术的患者中区分良性与恶性肿瘤的汇总AUC为0.93(95%置信区间0.92至0.94,95%预测区间0.85至0.98)(9202个肿瘤,43个中心,18个国家,21项研究);对于不使用CA125的ADNEX,汇总AUC为0.93(95%置信区间0.91至0.94,95%预测区间0.85至0.98)(6309个肿瘤,31个中心,13个国家,12项研究)。该模型在新中心临床应用的估计概率分别为95%(使用CA125)和91%(不使用CA125)。当将分析限制在低偏倚风险的研究时,汇总AUC值分别为0.93(使用CA125)和0.91(不使用CA125),模型临床应用的估计概率分别为89%(使用CA125)和87%(不使用CA125)。
荟萃分析结果表明,无论是否将血清生物标志物CA125用作预测指标,ADNEX在区分不同国家和环境人群中的良性与恶性肿瘤方面表现良好。一个关键限制是很少评估校准情况。
PROSPERO CRD42022373182