Department of Radiation Oncology, Affiliated Hospital of Xiangnan University, Chenzhou Hunan 423000.
Key Laboratory of Medical Imaging and Artificial Intelligence of Hunan Province, Chenzhou Hunan 423000.
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2022 Aug 28;47(8):1065-1074. doi: 10.11817/j.issn.1672-7347.2022.220353.
Radiation therapy is a main method for female pelvic malignancies, which can cause some adverse reactions, such as radiation proctitis (RP). The incidence of RP is highly positively correlated with radiation dose. There is an urgent need for a scientific method to accurately predict the occurrence of RP to help doctors make clinical decisions. In this study, based on the clinical data of female pelvic tumor patients and dosimetric parameters of radiotherapy, the random forest method was used to screen the hub features related to the occurrence of RP, and then a machine learning algorithm was used to construct a risk prediction model for the occurrence of RP, in order to provide technical support and theoretical basis for the prediction and prevention of RP.
A total of 100 female patients with pelvic tumors, who received static three-dimensional conformal intensity-modulated radiation therapy in the Department of Radiation Oncology of the Affiliated Hospital of Xiangnan University from January 2019 to December 2020, were retrospectively collected, and their clinically relevant data and radiotherapy planning system data were collected. During radiotherapy and 18 months after radiotherapy, 35 cases developed RP (RP group), and the remaining 65 cases had no RP (non-RP group). The clinical and dosimetric characteristics of patients were ranked by the importance of random forest algorithm, and the independent prognostic characteristics associated with the occurrence of RP were selected for machine learning modeling. A total of 6 machine learning algorithms including support vector machines, random forests, logistic regression, lightweight gradient boosting machines, Gaussian naïve Bayes, and adaptive enhancement were used to build models. The performance of the model was evaluated by the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score. Finally, the random forest model was determined as the prediction model, and the calibration curve and decision curve of the prediction model were drawn to evaluate the accuracy and clinical benefit of the model.
The parameters for random forest prediction model in the training set were as follow: AUC, 1.000, accuracy, 0.988, sensitivity, 1.000, specificity, 1.000, positive predictive value, 1.000, negative predictive value, 0.981, and F1 score, 1.000. In validation set, AUC was 0.713, accuracy was 0.640, sensitivity was 0.618, specificity was 0.822, positive predictive value was 0.500, negative predictive value was 0.656, and F1 score was 0.440. Random forest showed high predictive performance. Moreover, the Brief of the calibration curve for the prediction model was 0.178, the prediction accuracy was high, and the decision curve showed that the prediction model could benefit clinically.
Based on the clinical and dosimetric parameters for the female pelvic tumor patients, the prediction model of radiation proctitis constructed by random forest algorithm has high predictive ability and strong clinical usability.
放射治疗是女性盆腔恶性肿瘤的主要方法,可引起一些不良反应,如放射性直肠炎(RP)。RP 的发生率与放射剂量高度正相关。迫切需要一种科学的方法来准确预测 RP 的发生,以帮助医生做出临床决策。本研究基于女性盆腔肿瘤患者的临床数据和放射治疗的剂量学参数,采用随机森林法筛选与 RP 发生相关的枢纽特征,并采用机器学习算法构建 RP 发生风险预测模型,为 RP 的预测和预防提供技术支持和理论依据。
回顾性收集 2019 年 1 月至 2020 年 12 月在湘南大学附属医院放射肿瘤科接受静态三维适形调强放疗的 100 例女性盆腔肿瘤患者的临床相关数据和放射治疗计划系统数据。在放疗期间和放疗后 18 个月,35 例患者发生 RP(RP 组),其余 65 例患者未发生 RP(非 RP 组)。采用随机森林算法对患者的临床和剂量学特征进行重要性排序,并选择与 RP 发生相关的独立预后特征进行机器学习建模。采用支持向量机、随机森林、逻辑回归、轻量级梯度提升机、高斯朴素贝叶斯、自适应增强等 6 种机器学习算法构建模型。通过受试者工作特征曲线下面积(AUC)、准确率、敏感度、特异度、阳性预测值、阴性预测值和 F1 评分评估模型性能。最后,确定随机森林模型为预测模型,绘制预测模型的校准曲线和决策曲线,评估模型的准确性和临床获益。
训练集中随机森林预测模型的参数为:AUC 为 1.000,准确率为 0.988,敏感度为 1.000,特异度为 1.000,阳性预测值为 1.000,阴性预测值为 0.981,F1 得分为 1.000。在验证集中,AUC 为 0.713,准确率为 0.640,敏感度为 0.618,特异度为 0.822,阳性预测值为 0.500,阴性预测值为 0.656,F1 得分为 0.440。随机森林表现出较高的预测性能。此外,预测模型的简要校准曲线为 0.178,预测准确率较高,决策曲线显示预测模型具有临床获益。
基于女性盆腔肿瘤患者的临床和剂量学参数,采用随机森林算法构建的放射性直肠炎预测模型具有较高的预测能力和较强的临床实用性。