Zhou Ren, Li Zhaolong, Liu Jian, Qian Dewei, Meng Xiangdong, Guan Lichun, Sun Xinxin, Li Haiqing, Yu Min
State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Shanghai Institute of Hematology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Department of Cardiovascular Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Front Cardiovasc Med. 2024 Feb 29;11:1344170. doi: 10.3389/fcvm.2024.1344170. eCollection 2024.
Our study aimed to develop machine learning algorithms capable of predicting red blood cell (RBC) transfusion during valve replacement surgery based on a preoperative dataset of the non-anemic cohort.
A total of 423 patients who underwent valvular replacement surgery from January 2015 to December 2020 were enrolled. A comprehensive database that incorporated demographic characteristics, clinical conditions, and results of preoperative biochemistry tests was used for establishing the models. A range of machine learning algorithms were employed, including decision tree, random forest, extreme gradient boosting (XGBoost), categorical boosting (CatBoost), support vector classifier and logistic regression (LR). Subsequently, the area under the receiver operating characteristic curve (AUC), accuracy, recall, precision, and F1 score were used to determine the predictive capability of the algorithms. Furthermore, we utilized SHapley Additive exPlanation (SHAP) values to explain the optimal prediction model.
The enrolled patients were randomly divided into training set and testing set according to the 8:2 ratio. There were 16 important features identified by Sequential Backward Selection for model establishment. The top 5 most influential features in the RF importance matrix plot were hematocrit, hemoglobin, ALT, fibrinogen, and ferritin. The optimal prediction model was CatBoost algorithm, exhibiting the highest AUC (0.752, 95% CI: 0.662-0.780), which also got relatively high F1 score (0.695). The CatBoost algorithm also showed superior performance over the LR model with the AUC (0.666, 95% CI: 0.534-0.697). The SHAP summary plot and the SHAP dependence plot were used to visually illustrate the positive or negative effects of the selected features attributed to the CatBoost model.
This study established a series of prediction models to enhance risk assessment of intraoperative RBC transfusion during valve replacement in no-anemic patients. The identified important predictors may provide effective preoperative interventions.
我们的研究旨在基于非贫血队列的术前数据集开发能够预测瓣膜置换手术期间红细胞(RBC)输血情况的机器学习算法。
纳入了2015年1月至2020年12月期间接受瓣膜置换手术的423例患者。使用一个综合数据库,该数据库包含人口统计学特征、临床状况和术前生化测试结果,用于建立模型。采用了一系列机器学习算法,包括决策树、随机森林、极端梯度提升(XGBoost)、分类提升(CatBoost)、支持向量分类器和逻辑回归(LR)。随后,使用受试者工作特征曲线(AUC)下面积、准确率、召回率、精确率和F1分数来确定算法的预测能力。此外,我们利用SHapley值相加解释(SHAP)值来解释最优预测模型。
纳入的患者按照8:2的比例随机分为训练集和测试集。通过逐步向后选择确定了16个用于模型建立的重要特征。随机森林重要性矩阵图中最具影响力的前5个特征是血细胞比容、血红蛋白、谷丙转氨酶、纤维蛋白原和铁蛋白。最优预测模型是CatBoost算法,其AUC最高(0.752,95%CI:0.662 - 0.780),F1分数也相对较高(0.695)。与逻辑回归模型相比,CatBoost算法在AUC方面也表现更优(逻辑回归模型AUC为0.666,95%CI:0.534 - 0.697)。使用SHAP总结图和SHAP依赖图直观地说明了所选特征对CatBoost模型的正向或负向影响。
本研究建立了一系列预测模型,以加强对非贫血患者瓣膜置换术中红细胞输血风险的评估。所确定的重要预测因素可能提供有效的术前干预措施。