Zou Zhi-Min, An Tian-Zhi, Li Jun-Xiang, Zhang Zi-Shu, Xiao Yu-Dong, Liu Jun
Department of Radiology, the Second Xiangya Hospital of Central South University, Changsha, 410011, China.
Department of Radiology, Hunan Children's Hospital, Changsha, 410007, China.
J Cancer. 2021 Oct 17;12(23):7079-7087. doi: 10.7150/jca.63370. eCollection 2021.
To develop and validate a random forest (RF) based predictive model of early refractoriness to transarterial chemoembolization (TACE) in patients with unresectable hepatocellular carcinoma (HCC). A total of 227 patients with unresectable HCC who initially treated with TACE from three independent institutions were retrospectively included. Following a random split, 158 patients (70%) were assigned to a training cohort and the remaining 69 patients (30%) were assigned to a validation cohort. The process of variables selection was based on the importance variable scores generated by RF algorithm. A RF predictive model incorporating the selected variables was developed, and five-fold cross-validation was performed. The discrimination and calibration of the RF model were measured by a receiver operating characteristic (ROC) curve and the Hosmer-Lemeshow test. The potential variables selected by RF algorithm for developing predictive model of early TACE refractoriness included patients' age, number of tumors, tumor distribution, platelet count (PLT), and neutrophil-to-lymphocyte ratio (NLR). The results showed that the RF predictive model had good discrimination ability, with an area under curve (AUC) of 0.863 in the training cohort and 0.767 in the validation cohort, respectively. In Hosmer-Lemeshow test, the RF model had a satisfactory calibration with P values of 0.538 and 0.068 in training cohort and validation cohort, respectively. The RF algorithm-based model has a good predictive performance in the prediction of early TACE refractoriness, which may easily be deployed in clinical routine and help to determine the optimal patient of care.
建立并验证基于随机森林(RF)的不可切除肝细胞癌(HCC)患者经动脉化疗栓塞术(TACE)早期难治性预测模型。回顾性纳入来自三个独立机构的227例最初接受TACE治疗的不可切除HCC患者。随机划分后,158例患者(70%)被分配到训练队列,其余69例患者(30%)被分配到验证队列。变量选择过程基于RF算法生成的重要变量得分。建立了包含所选变量的RF预测模型,并进行了五折交叉验证。通过受试者工作特征(ROC)曲线和Hosmer-Lemeshow检验来衡量RF模型的辨别力和校准度。RF算法为建立早期TACE难治性预测模型而选择的潜在变量包括患者年龄、肿瘤数量、肿瘤分布、血小板计数(PLT)和中性粒细胞与淋巴细胞比值(NLR)。结果显示,RF预测模型具有良好的辨别能力,训练队列和验证队列的曲线下面积(AUC)分别为0.863和0.767。在Hosmer-Lemeshow检验中,RF模型校准度良好,训练队列和验证队列的P值分别为0.538和0.068。基于RF算法的模型在预测TACE早期难治性方面具有良好的预测性能,可轻松应用于临床常规,有助于确定最佳治疗患者。