Kartal Elif, Balaban Mehmet Erdal
Informatics Department, İstanbul University, İstanbul, Turkey.
Turkish Community Services Foundation (TOVAK), İstanbul, Turkey.
Turk Gogus Kalp Damar Cerrahisi Derg. 2018 Jul 3;26(3):394-401. doi: 10.5606/tgkdc.dergisi.2018.15559. eCollection 2018 Jul.
The objective of this study was to predict the mortality risk of patients during or shortly after cardiac surgery by using machine learning techniques and their learning abilities from collected data.
The dataset was obtained from Acıbadem Maslak Hospital. Risk factors of the European System for Cardiac Operative Risk Evaluation (EuroSCORE) were used to predict mortality risk. First, Standard EuroSCORE scores of patients were calculated and risk groups were determined, because 30-day follow-up information of patients was not available in the dataset. Models were created with five different machine learning algorithms and two different datasets including age, serum creatinine, left ventricular dysfunction, and pulmonary hypertension were numeric in Dataset 1 and categorical in Dataset 2. Model performance evaluation was performed with 10-fold cross-validation.
Data analysis and performance evaluation were performed with R, RStudio and Shiny. C4.5 was selected as the best algorithm for risk prediction (accuracy= 0.989) in Dataset 1. This model indicated that pulmonary hypertension, recent myocardial infarct, surgery on thoracic aorta are the primary three risk factors that affect the mortality risk of patients during or shortly after cardiac surgery. Also, this model is used to develop a dynamic web application which is also accessible from mobile devices (https://elifkartal.shinyapps.io/euSCR/).
The C4.5 decision tree model was identified as having the highest performance in Dataset 1 in predicting the mortality risk of patients. Using the numerical values of the risk factors can be useful in increasing the performance of machine learning models. Development of hospital-specific local assessment systems using hospital data, such as the application in this study, would be beneficial for both patients and doctors.
本研究的目的是通过使用机器学习技术及其从收集的数据中学习的能力,预测心脏手术期间或术后不久患者的死亡风险。
数据集来自阿西巴德姆马斯拉克医院。使用欧洲心脏手术风险评估系统(EuroSCORE)的风险因素来预测死亡风险。首先,计算患者的标准EuroSCORE评分并确定风险组,因为数据集中没有患者的30天随访信息。使用五种不同的机器学习算法和两个不同的数据集创建模型,数据集1中的年龄、血清肌酐、左心室功能障碍和肺动脉高压为数值型,数据集2中的为分类型。模型性能评估采用10折交叉验证。
使用R、RStudio和Shiny进行数据分析和性能评估。在数据集1中,C4.5被选为风险预测的最佳算法(准确率=0.989)。该模型表明,肺动脉高压、近期心肌梗死、胸主动脉手术是影响心脏手术期间或术后不久患者死亡风险的主要三个风险因素。此外,该模型还用于开发一个动态网络应用程序,该应用程序也可从移动设备访问(https://elifkartal.shinyapps.io/euSCR/)。
在预测患者死亡风险方面,C4.5决策树模型在数据集1中表现出最高性能。使用风险因素的数值有助于提高机器学习模型的性能。利用医院数据开发特定于医院的局部评估系统,如本研究中的应用,对患者和医生都将是有益的。