Vale Luke, Kunonga Patience, Coughlan Diarmuid, Kontogiannis Vasileios, Astin Margaret, Beyer Fiona, Richmond Catherine, Wilson Dor, Bajwa Dalvir, Javanbakht Mehdi, Bryant Andrew, Akor Wanwuri, Craig Dawn, Lovat Penny, Labus Marie, Nasr Batoul, Cunliffe Timothy, Hinde Helena, Shawgi Mohamed, Saleh Daniel, Royle Pam, Steward Paul, Lucas Rachel, Ellis Robert
Institute of Health & Society, Newcastle University, Newcastle upon Tyne, UK.
Institute of Cellular Medicine, Newcastle University, Newcastle upon Tyne, UK.
Health Technol Assess. 2021 Nov;25(64):1-178. doi: 10.3310/hta25640.
Malignant melanoma is the fifth most common cancer in the UK, with rates continuing to rise, resulting in considerable burden to patients and the NHS.
The objectives were to evaluate the effectiveness and cost-effectiveness of current and alternative follow-up strategies for stage IA and IB melanoma.
Three systematic reviews were conducted. (1) The effectiveness of surveillance strategies. Outcomes were detection of new primaries, recurrences, metastases and survival. Risk of bias was assessed using the Cochrane Collaboration's Risk-of-Bias 2.0 tool. (2) Prediction models to stratify by risk of recurrence, metastases and survival. Model performance was assessed by study-reported measures of discrimination (e.g. D-statistic, Harrel's -statistic), calibration (e.g. the Hosmer-Lemeshow 'goodness-of-fit' test) or overall performance (e.g. Brier score, ). Risk of bias was assessed using the Prediction model Risk Of Bias ASsessment Tool (PROBAST). (3) Diagnostic test accuracy of fine-needle biopsy and ultrasonography. Outcomes were detection of new primaries, recurrences, metastases and overall survival. Risk of bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool. Review data and data from elsewhere were used to model the cost-effectiveness of alternative surveillance strategies and the value of further research.
(1) The surveillance review included one randomised controlled trial. There was no evidence of a difference in new primary or recurrence detected (risk ratio 0.75, 95% confidence interval 0.43 to 1.31). Risk of bias was considered to be of some concern. Certainty of the evidence was low. (2) Eleven risk prediction models were identified. Discrimination measures were reported for six models, with the area under the operating curve ranging from 0.59 to 0.88. Three models reported calibration measures, with coefficients of ≥ 0.88. Overall performance was reported by two models. In one, the Brier score was slightly better than the American Joint Committee on Cancer scheme score. The other reported an of 0.47 (95% confidence interval 0.45 to 0.49). All studies were judged to have a high risk of bias. (3) The diagnostic test accuracy review identified two studies. One study considered fine-needle biopsy and the other considered ultrasonography. The sensitivity and specificity for fine-needle biopsy were 0.94 (95% confidence interval 0.90 to 0.97) and 0.95 (95% confidence interval 0.90 to 0.97), respectively. For ultrasonography, sensitivity and specificity were 1.00 (95% confidence interval 0.03 to 1.00) and 0.99 (95% confidence interval 0.96 to 0.99), respectively. For the reference standards and flow and timing domains, the risk of bias was rated as being high for both studies. The cost-effectiveness results suggest that, over a lifetime, less intensive surveillance than recommended by the National Institute for Health and Care Excellence might be worthwhile. There was considerable uncertainty. Improving the diagnostic performance of cancer nurse specialists and introducing a risk prediction tool could be promising. Further research on transition probabilities between different stages of melanoma and on improving diagnostic accuracy would be of most value.
Overall, few data of limited quality were available, and these related to earlier versions of the American Joint Committee on Cancer staging. Consequently, there was considerable uncertainty in the economic evaluation.
Despite adoption of rigorous methods, too few data are available to justify changes to the National Institute for Health and Care Excellence recommendations on surveillance. However, alternative strategies warrant further research, specifically on improving estimates of incidence, progression of recurrent disease; diagnostic accuracy and health-related quality of life; developing and evaluating risk stratification tools; and understanding patient preferences.
This study is registered as PROSPERO CRD42018086784.
This project was funded by the National Institute for Health Research Health Technology Assessment programme and will be published in full in ; Vol 25, No. 64. See the NIHR Journals Library website for further project information.
恶性黑色素瘤是英国第五大常见癌症,发病率持续上升,给患者和国民医疗服务体系带来了相当大的负担。
评估IA期和IB期黑色素瘤当前及替代随访策略的有效性和成本效益。
进行了三项系统综述。(1)监测策略的有效性。结局包括新原发性肿瘤、复发、转移的检测及生存率。使用Cochrane协作网的偏倚风险2.0工具评估偏倚风险。(2)按复发、转移和生存风险分层的预测模型。通过研究报告的区分度测量指标(如D统计量、Harrel's 统计量)、校准指标(如Hosmer-Lemeshow拟合优度检验)或整体性能指标(如Brier评分)评估模型性能。使用预测模型偏倚风险评估工具(PROBAST)评估偏倚风险。(3)细针穿刺活检和超声检查的诊断试验准确性。结局包括新原发性肿瘤、复发、转移的检测及总生存率。使用诊断准确性研究质量评估-2(QUADAS-2)工具评估偏倚风险。综述数据及其他来源的数据用于模拟替代监测策略的成本效益及进一步研究的价值。
(1)监测综述纳入了一项随机对照试验。未发现新原发性肿瘤或复发检测存在差异的证据(风险比0.75,95%置信区间0.43至1.31)。偏倚风险被认为存在一定问题。证据的确定性较低。(2)识别出11个风险预测模型。6个模型报告了区分度测量指标,操作曲线下面积范围为0.59至0.88。3个模型报告了校准指标,系数≥0.88。2个模型报告了整体性能。其中一个模型的Brier评分略优于美国癌症联合委员会方案评分。另一个模型报告的 值为0.47(95%置信区间0.45至0.49)。所有研究均被判定存在高偏倚风险。(3)诊断试验准确性综述识别出两项研究。一项研究考虑了细针穿刺活检,另一项研究考虑了超声检查。细针穿刺活检的敏感性和特异性分别为0.94(95%置信区间0.90至0.97)和0.95(95%置信区间0.90至0.97)。对于超声检查,敏感性和特异性分别为1.00(95%置信区间0.03至1.00)和0.99(95%置信区间0.96至0.99)。对于参考标准以及流程和时间领域,两项研究的偏倚风险均被评为高。成本效益结果表明,在一生中,采用比英国国家卫生与临床优化研究所推荐的强度更低的监测可能是值得的。存在相当大的不确定性。提高癌症专科护士的诊断性能并引入风险预测工具可能很有前景。对黑色素瘤不同阶段之间的转移概率以及提高诊断准确性进行进一步研究将最具价值。
总体而言,可用的数据质量有限且数量较少,并且这些数据与美国癌症联合委员会分期的早期版本相关。因此,经济评估存在相当大的不确定性。
尽管采用了严格的方法,但可用数据过少,无法证明改变英国国家卫生与临床优化研究所关于监测的建议是合理的。然而,替代策略值得进一步研究,特别是在改善发病率估计、复发性疾病进展;诊断准确性和健康相关生活质量;开发和评估风险分层工具;以及了解患者偏好方面。
本研究已注册为PROSPERO CRD42018086784。
本项目由英国国家卫生研究院卫生技术评估计划资助,并将在《》第25卷第64期全文发表。有关更多项目信息,请访问英国国家卫生研究院期刊图书馆网站。