Xing Fei, Luo Rong, Liu Ming, Zhou Zongke, Xiang Zhou, Duan Xin
Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, Chengdu, China.
Front Med (Lausanne). 2022 May 11;9:829977. doi: 10.3389/fmed.2022.829977. eCollection 2022.
Post-operative mortality risk assessment for geriatric patients with hip fractures (HF) is a challenge for clinicians. Early identification of geriatric HF patients with a high risk of post-operative death is helpful for early intervention and improving clinical prognosis. However, a single significant risk factor of post-operative death cannot accurately predict the prognosis of geriatric HF patients. Therefore, our study aims to utilize a machine learning approach, random forest algorithm, to fabricate a prediction model for post-operative death of geriatric HF patients.
This retrospective study enrolled consecutive geriatric HF patients who underwent treatment for surgery. The study cohort was divided into training and testing datasets at a 70:30 ratio. The random forest algorithm selected or excluded variables according to the feature importance. Least absolute shrinkage and selection operator (Lasso) was utilized to compare feature selection results of random forest. The confirmed variables were used to create a simplified model instead of a full model with all variables. The prediction model was then verified in the training dataset and testing dataset. Additionally, a prediction model constructed by logistic regression was used as a control to evaluate the efficiency of the new prediction model.
Feature selection by random forest algorithm and Lasso regression demonstrated that seven variables, including age, time from injury to surgery, chronic obstructive pulmonary disease (COPD), albumin, hemoglobin, history of malignancy, and perioperative blood transfusion, could be used to predict the 1-year post-operative mortality. The area under the curve (AUC) of the random forest algorithm-based prediction model in training and testing datasets were 1.000, and 0.813, respectively. While the prediction tool constructed by logistic regression in training and testing datasets were 0.895, and 0.797, respectively.
Compared with logistic regression, the random forest algorithm-based prediction model exhibits better predictive ability for geriatric HF patients with a high risk of death within post-operative 1 year.
老年髋部骨折(HF)患者术后死亡风险评估对临床医生而言是一项挑战。早期识别术后死亡风险高的老年HF患者有助于早期干预并改善临床预后。然而,单一的术后死亡显著风险因素无法准确预测老年HF患者的预后。因此,我们的研究旨在利用机器学习方法——随机森林算法,构建老年HF患者术后死亡预测模型。
这项回顾性研究纳入了连续接受手术治疗的老年HF患者。研究队列按70:30的比例分为训练集和测试集。随机森林算法根据特征重要性选择或排除变量。使用最小绝对收缩和选择算子(Lasso)比较随机森林的特征选择结果。使用确认的变量创建简化模型而非包含所有变量的完整模型。然后在训练集和测试集中验证预测模型。此外,将逻辑回归构建的预测模型作为对照,以评估新预测模型的效率。
随机森林算法和Lasso回归的特征选择表明,年龄、受伤至手术时间、慢性阻塞性肺疾病(COPD)、白蛋白、血红蛋白、恶性肿瘤病史和围手术期输血这七个变量可用于预测术后1年死亡率。基于随机森林算法的预测模型在训练集和测试集的曲线下面积(AUC)分别为1.000和0.813。而逻辑回归构建的预测工具在训练集和测试集分别为0.895和0.797。
与逻辑回归相比,基于随机森林算法的预测模型对术后1年内死亡风险高的老年HF患者具有更好的预测能力。