Cheon Wonjoong, Han Mira, Jeong Seonghoon, Oh Eun Sang, Lee Sung Uk, Lee Se Byeong, Shin Dongho, Lim Young Kyung, Jeong Jong Hwi, Kim Haksoo, Kim Joo Young
Proton Therapy Center, National Cancer Center, Goyang-si 10408, Republic of Korea.
Biostatistics Collaboration Team, National Cancer Center, Goyang-si 10408, Republic of Korea.
Cancers (Basel). 2023 Jul 2;15(13):3463. doi: 10.3390/cancers15133463.
(1) In this study, we developed a deep learning (DL) model that can be used to predict late bladder toxicity. (2) We collected data obtained from 281 uterine cervical cancer patients who underwent definitive radiation therapy. The DL model was trained using 16 features, including patient, tumor, treatment, and dose parameters, and its performance was compared with that of a multivariable logistic regression model using the following metrics: accuracy, prediction, recall, F1-score, and area under the receiver operating characteristic curve (AUROC). In addition, permutation feature importance was calculated to interpret the DL model for each feature, and the lightweight DL model was designed to focus on the top five important features. (3) The DL model outperformed the multivariable logistic regression model on our dataset. It achieved an F1-score of 0.76 and an AUROC of 0.81, while the corresponding values for the multivariable logistic regression were 0.14 and 0.43, respectively. The DL model identified the doses for the most exposed 2 cc volume of the bladder (BD) as the most important feature, followed by BD and the ICRU bladder point. In the case of the lightweight DL model, the F-score and AUROC were 0.90 and 0.91, respectively. (4) The DL models exhibited superior performance in predicting late bladder toxicity compared with the statistical method. Through the interpretation of the model, it further emphasized its potential for improving patient outcomes and minimizing treatment-related complications with a high level of reliability.
(1) 在本研究中,我们开发了一种深度学习(DL)模型,可用于预测晚期膀胱毒性。(2) 我们收集了281例接受根治性放射治疗的子宫颈癌患者的数据。DL模型使用16个特征进行训练,包括患者、肿瘤、治疗和剂量参数,并使用以下指标将其性能与多变量逻辑回归模型的性能进行比较:准确率、预测率、召回率、F1分数和受试者操作特征曲线下面积(AUROC)。此外,计算排列特征重要性以解释每个特征的DL模型,并设计轻量级DL模型以关注最重要的五个特征。(3) 在我们的数据集中,DL模型的表现优于多变量逻辑回归模型。它的F1分数为0.76,AUROC为0.81,而多变量逻辑回归的相应值分别为0.14和0.43。DL模型将膀胱最暴露的2 cc体积(BD)的剂量确定为最重要的特征,其次是BD和ICRU膀胱点。对于轻量级DL模型,F分数和AUROC分别为0.90和0.91。(4) 与统计方法相比,DL模型在预测晚期膀胱毒性方面表现出卓越的性能。通过对模型的解释,它进一步强调了其在以高可靠性改善患者预后和最小化治疗相关并发症方面的潜力。