Suppr超能文献

比较监督式和半监督式机器学习方法在正常组织并发症概率(NTCP)建模中预测头颈癌患者并发症的情况。

Comparing supervised and semi-supervised machine learning approaches in NTCP modeling to predict complications in head and neck cancer patients.

作者信息

Spiero I, Schuit E, Wijers O B, Hoebers F J P, Langendijk J A, Leeuwenberg A M

机构信息

Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands.

Radiotherapeutic Institute Friesland, Leeuwarden, the Netherlands.

出版信息

Clin Transl Radiat Oncol. 2023 Sep 21;43:100677. doi: 10.1016/j.ctro.2023.100677. eCollection 2023 Nov.

Abstract

BACKGROUND AND PURPOSE

Head and neck cancer (HNC) patients treated with radiotherapy often suffer from radiation-induced toxicities. Normal Tissue Complication Probability (NTCP) modeling can be used to determine the probability to develop these toxicities based on patient, tumor, treatment and dose characteristics. Since the currently used NTCP models are developed using supervised methods that discard unlabeled patient data, we assessed whether the addition of unlabeled patient data by using semi-supervised modeling would gain predictive performance.

MATERIALS AND METHODS

The semi-supervised method of self-training was compared to supervised regression methods with and without prior multiple imputation by chained equation (MICE). The models were developed for the most common toxicity outcomes in HNC patients, xerostomia (dry mouth) and dysphagia (difficulty swallowing), measured at six months after treatment, in a development cohort of 750 HNC patients. The models were externally validated in a validation cohort of 395 HNC patients. Model performance was assessed by discrimination and calibration.

RESULTS

MICE and self-training did not improve performance in terms of discrimination or calibration at external validation compared to current regression models. In addition, the relative performance of the different models did not change upon a decrease in the amount of (labeled) data available for model development. Models using ridge regression outperformed the logistic models for the dysphagia outcome.

CONCLUSION

Since there was no apparent gain in the addition of unlabeled patient data by using the semi-supervised method of self-training or MICE, the supervised regression models would still be preferred in current NTCP modeling for HNC patients.

摘要

背景与目的

接受放射治疗的头颈癌(HNC)患者常遭受放射诱导的毒性反应。正常组织并发症概率(NTCP)建模可用于根据患者、肿瘤、治疗及剂量特征确定发生这些毒性反应的概率。由于目前使用的NTCP模型是采用丢弃未标记患者数据的监督方法开发的,我们评估了通过使用半监督建模添加未标记患者数据是否会提高预测性能。

材料与方法

将自训练的半监督方法与有或无链式方程多重填补(MICE)的监督回归方法进行比较。在一个由750名头颈癌患者组成的开发队列中,针对治疗后6个月测量的头颈癌患者最常见的毒性结局,即口干症(口干)和吞咽困难(吞咽困难),开发模型。在一个由395名头颈癌患者组成的验证队列中对模型进行外部验证。通过辨别力和校准评估模型性能。

结果

与当前回归模型相比,在外部验证中,MICE和自训练在辨别力或校准方面均未提高性能。此外,随着可用于模型开发的(标记)数据量减少,不同模型的相对性能并未改变。对于吞咽困难结局,使用岭回归的模型优于逻辑模型。

结论

由于使用自训练或MICE的半监督方法添加未标记患者数据没有明显益处,在当前针对头颈癌患者的NTCP建模中,监督回归模型仍是首选。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验