Spiero I, Schuit E, Wijers O B, Hoebers F J P, Langendijk J A, Leeuwenberg A M
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands.
Radiotherapeutic Institute Friesland, Leeuwarden, the Netherlands.
Clin Transl Radiat Oncol. 2023 Sep 21;43:100677. doi: 10.1016/j.ctro.2023.100677. eCollection 2023 Nov.
Head and neck cancer (HNC) patients treated with radiotherapy often suffer from radiation-induced toxicities. Normal Tissue Complication Probability (NTCP) modeling can be used to determine the probability to develop these toxicities based on patient, tumor, treatment and dose characteristics. Since the currently used NTCP models are developed using supervised methods that discard unlabeled patient data, we assessed whether the addition of unlabeled patient data by using semi-supervised modeling would gain predictive performance.
The semi-supervised method of self-training was compared to supervised regression methods with and without prior multiple imputation by chained equation (MICE). The models were developed for the most common toxicity outcomes in HNC patients, xerostomia (dry mouth) and dysphagia (difficulty swallowing), measured at six months after treatment, in a development cohort of 750 HNC patients. The models were externally validated in a validation cohort of 395 HNC patients. Model performance was assessed by discrimination and calibration.
MICE and self-training did not improve performance in terms of discrimination or calibration at external validation compared to current regression models. In addition, the relative performance of the different models did not change upon a decrease in the amount of (labeled) data available for model development. Models using ridge regression outperformed the logistic models for the dysphagia outcome.
Since there was no apparent gain in the addition of unlabeled patient data by using the semi-supervised method of self-training or MICE, the supervised regression models would still be preferred in current NTCP modeling for HNC patients.
接受放射治疗的头颈癌(HNC)患者常遭受放射诱导的毒性反应。正常组织并发症概率(NTCP)建模可用于根据患者、肿瘤、治疗及剂量特征确定发生这些毒性反应的概率。由于目前使用的NTCP模型是采用丢弃未标记患者数据的监督方法开发的,我们评估了通过使用半监督建模添加未标记患者数据是否会提高预测性能。
将自训练的半监督方法与有或无链式方程多重填补(MICE)的监督回归方法进行比较。在一个由750名头颈癌患者组成的开发队列中,针对治疗后6个月测量的头颈癌患者最常见的毒性结局,即口干症(口干)和吞咽困难(吞咽困难),开发模型。在一个由395名头颈癌患者组成的验证队列中对模型进行外部验证。通过辨别力和校准评估模型性能。
与当前回归模型相比,在外部验证中,MICE和自训练在辨别力或校准方面均未提高性能。此外,随着可用于模型开发的(标记)数据量减少,不同模型的相对性能并未改变。对于吞咽困难结局,使用岭回归的模型优于逻辑模型。
由于使用自训练或MICE的半监督方法添加未标记患者数据没有明显益处,在当前针对头颈癌患者的NTCP建模中,监督回归模型仍是首选。