Suppr超能文献

头颈部癌患者放疗后放射性并发症的预后模型

Prognostic models for radiation-induced complications after radiotherapy in head and neck cancer patients.

作者信息

Takada Toshihiko, Tambas Makbule, Clementel Enrico, Leeuwenberg Artuur, Sharabiani Marjan, Damen Johanna Aag, Dunias Zoë S, Nauta Jan F, Idema Demy L, Choi Jungyeon, Meijerink Lotta M, Langendijk Johannes A, Moons Karel Gm, Schuit Ewoud

机构信息

Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands.

Department of General Medicine, Shirakawa Satellite for Teaching And Research (STAR), Fukushima Medical University, Shirakawa, Japan.

出版信息

Cochrane Database Syst Rev. 2025 Sep 10;9(9):CD014745. doi: 10.1002/14651858.CD014745.pub2.

Abstract

BACKGROUND

Radiotherapy is the mainstay of treatment for head and neck cancer (HNC) but may induce various side effects on surrounding normal tissues. To reach an optimal balance between tumour control and toxicity prevention, normal tissue complication probability (NTCP) models have been reported to predict the risk of radiation-induced side effects in patients with HNC. However, the quality of study design, conduct, and analysis (i.e. risk of bias (ROB)), as well as the predictive performance of these models, remains to be evaluated.

OBJECTIVES

To identify, describe and appraise NTCP models to predict the risk of radiation-induced side effects in patients with HNC.

SEARCH METHODS

We searched Ovid MEDLINE, Embase and the World Health Organization International Clinical Trials Registry Platform from conception to January 2024. In addition, we screened references cited in the retrieved articles.

SELECTION CRITERIA

Two review authors independently included articles reporting on the development and external validation of NTCP models to predict any type of radiation-induced side effects in patients with HNC.

DATA COLLECTION AND ANALYSIS

One reviewer extracted data from each article based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies and assessed their applicability and ROB using the Prediction model ROB Assessment Tool, while another reviewer carefully verified the results. For models externally validated at least twice for the same outcome as their original developmental study, we performed qualitative analyses of the model performance. The GRADE system was not applied, since it has not been established for reviews of prognostic model studies.

MAIN RESULTS

Amongst 592 models developed from 143 articles, including 140,767 HNC patients, only 49 (8%) models from six articles were judged to have low ROB and low concerns for applicability. No external validation was performed for 480 models (81%). For the remaining 112 models and six additional models which were not eligible for the present review, 152 external validations were performed in 34,304 patients with HNC in 41 articles. The results of models externally validated at least twice are discussed below. Models for xerostomia Amongst 275 models for xerostomia, two models were externally validated at least twice. The Beetz 2012b model for xerostomia six months after radiotherapy was validated in two studies. C-statistics ranged from 0.70 to 0.74. Calibration performance was reported in one study. One validation study was rated as having low ROB in all domains, while the other was rated as having high ROB in the analysis domain. The Cavallo 2021 model for acute xerostomia during radiotherapy for patients with nasopharyngeal cancer was externally validated in the same study, using two different types of cohorts. C-statistics ranged from 0.68 to 0.73 and calibration plots were reported in both cohorts. Both validations were rated as having unclear ROB in the participants' domain because no detailed information about recruiting was provided. Models for dysphagia Amongst 86 models for dysphagia, two models were externally validated at least twice. The Christianen 2012 model for dysphagia six months after radiotherapy was validated in five studies. C-statistics ranged from 0.66 to 0.75. Calibration performance was assessed in all of them, while four of them were rated as having high ROB in the analysis domain due to the small sample size. The Wopken 2014b model for tube feeding dependence six months after radiotherapy was validated in three external validation studies. C-statistics ranged from 0.79 to 0.95, while calibration was evaluated in all studies. Due to the small size of the validation datasets, they were judged as having high ROB in the analysis domain. Models for hypothyroidism Of 66 models for hypothyroidism, two models were externally validated at least twice. In addition, there was another model which was not originally developed for patients with HNC, but validated in this domain. The Boomsma 2012 for hypothyroidism within two years after radiotherapy was externally validated in two studies. C-statistics ranged from 0.64 to 0.74, while only one study reported its calibration performance. Both validation studies were rated as having high ROB in the analysis domain. The Ronjom 2013 model for radiation-induced hypothyroidism was validated in three studies. C-statistics ranged from 0.65 to 0.69 and calibration plots were reported in only one study. Two validation studies were judged as having high and the other was rated as having unclear ROB in the analysis domain. The Cella 2012 model was originally developed to predict radiation-induced hypothyroidism in patients with Hodgkin's lymphoma. In two validation studies in patients with HNC, c-statistics ranged from 0.65 to 0.68, but calibration performance was not reported. One validation study was rated as having a high ROB and the other was rated as being unclear in the analysis domain. Models for temporal lobe injury Amongst six models for temporal lobe injury, two were externally validated at least twice. The OuYang 2023 model, using deep learning in patients with nasopharyngeal cancer, was validated in the same paper using two different cohorts. C-statistics ranged from 0.80 to 0.82, while calibration performance was assessed in both cohorts. Both validations were judged as having low ROB in all domains. The Wen 2021 model was developed to predict temporal lobe injury in newly diagnosed nasopharyngeal cancer patients. The model was validated by OuYang 2023 using two cohorts. C-statistics ranged from 0.77 to 0.79, while calibration performance was not reported. Both validations were judged as having unclear ROB in the analysis domain. Models for outcomes related to hoarseness, fatigue, nausea-vomiting, throat pain, aspiration No models were externally validated at least twice.

AUTHORS' CONCLUSIONS: Amongst 592 developed models, a limited number had adequate quality. Only one-fifth were externally validated, of which, only nine models at least twice. These nine models showed acceptable discriminative performance at external validation. However, their calibration performance was not always reported. Furthermore, most validation studies were judged as having high ROB, mainly due to problems in the analysis domain. In conclusion, this review shows the need for more external validation studies before the implementation of developed models in clinical practice and improvement of the quality of conducting and reporting of prediction model studies.

摘要

背景

放射治疗是头颈癌(HNC)治疗的主要手段,但可能会对周围正常组织产生各种副作用。为了在肿瘤控制和毒性预防之间达到最佳平衡,已有报道称正常组织并发症概率(NTCP)模型可预测HNC患者辐射诱发副作用的风险。然而,研究设计、实施和分析的质量(即偏倚风险(ROB))以及这些模型的预测性能仍有待评估。

目的

识别、描述和评估NTCP模型,以预测HNC患者辐射诱发副作用的风险。

检索方法

我们检索了从建库至2024年1月的Ovid MEDLINE、Embase和世界卫生组织国际临床试验注册平台。此外,我们还筛选了检索文章中引用的参考文献。

选择标准

两位综述作者独立纳入报告NTCP模型开发和外部验证的文章,以预测HNC患者任何类型的辐射诱发副作用。

数据收集与分析

一位综述作者根据预测模型研究系统评价的关键评价和数据提取清单从每篇文章中提取数据,并使用预测模型ROB评估工具评估其适用性和ROB,另一位综述作者仔细核实结果。对于与原始开发研究具有相同结局且至少经过两次外部验证的模型,我们对模型性能进行了定性分析。未应用GRADE系统,因为尚未为预后模型研究的综述建立该系统。

主要结果

在从143篇文章中开发的592个模型中,包括140,767例HNC患者,只有来自6篇文章的49个(8%)模型被判定具有低ROB且适用性问题较少。480个模型(81%)未进行外部验证。对于其余112个模型以及6个不符合本综述条件的其他模型,在41篇文章中的34,304例HNC患者中进行了152次外部验证。以下讨论至少经过两次外部验证的模型结果。口干模型 在275个口干模型中,有两个模型至少经过两次外部验证。Beetz 2012b模型用于放疗后6个月的口干,在两项研究中得到验证。C统计量范围为0.70至0.74。一项研究报告了校准性能。一项验证研究在所有领域均被评为低ROB,而另一项在分析领域被评为高ROB。Cavallo 2021模型用于鼻咽癌患者放疗期间的急性口干,在同一研究中使用两种不同类型的队列进行了外部验证。C统计量范围为0.68至0.73,两个队列均报告了校准图。由于未提供有关招募的详细信息,两项验证在参与者领域均被评为ROB不明确。吞咽困难模型 在86个吞咽困难模型中,有两个模型至少经过两次外部验证。Christianen 2012模型用于放疗后6个月的吞咽困难,在五项研究中得到验证。C统计量范围为0.66至0.75。所有研究均评估了校准性能,其中四项由于样本量小在分析领域被评为高ROB。Wopken 2014b模型用于放疗后6个月的管饲依赖,在三项外部验证研究中得到验证。C统计量范围为0.79至0.95,所有研究均评估了校准情况。由于验证数据集规模较小,它们在分析领域被判定为高ROB。甲状腺功能减退模型 在66个甲状腺功能减退模型中,有两个模型至少经过两次外部验证。此外,还有一个模型最初不是为HNC患者开发的,但在该领域得到了验证。Boomsma 2012模型用于放疗后两年内的甲状腺功能减退,在两项研究中进行了外部验证。C统计量范围为0.64至0.74,只有一项研究报告了其校准性能。两项验证研究在分析领域均被评为高ROB。Ronjom 2013模型用于辐射诱发的甲状腺功能减退,在三项研究中得到验证。C统计量范围为0.65至0.69,只有一项研究报告了校准图。两项验证研究在分析领域被判定为高ROB,另一项被评为不明确。Cella 2012模型最初是为预测霍奇金淋巴瘤患者辐射诱发的甲状腺功能减退而开发的。在两项针对HNC患者的验证研究中,c统计量范围为0.65至 .68,但未报告校准性能。一项验证研究在分析领域被评为高ROB,另一项被评为不明确。颞叶损伤模型 在6个颞叶损伤模型中,有两个至少经过两次外部验证。OuYang 2023模型在鼻咽癌患者中使用深度学习,在同一篇论文中使用两个不同队列进行了验证。C统计量范围为0.80至0.82,两个队列均评估了校准性能。两项验证在所有领域均被判定为低ROB。Wen 2021模型用于预测新诊断鼻咽癌患者的颞叶损伤。该模型由OuYang 2023使用两个队列进行了验证。C统计量范围为0.77至0.79,但未报告校准性能。两项验证在分析领域均被判定为ROB不明确。与声音嘶哑、疲劳、恶心呕吐、咽痛、误吸相关结局的模型 没有模型至少经过两次外部验证。

作者结论

在592个已开发的模型中,质量合格的数量有限。只有五分之一进行了外部验证,其中只有9个模型至少经过两次验证。这9个模型在外部验证中显示出可接受的判别性能。然而,它们的校准性能并非总是被报告。此外,大多数验证研究被判定为具有高ROB,主要是由于分析领域存在问题。总之,本综述表明在将已开发的模型应用于临床实践之前,需要更多的外部验证研究,并提高预测模型研究的实施和报告质量。

相似文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验