Tran Linh, Chi Lianhua, Bonti Alessio, Abdelrazek Mohamed, Chen Yi-Ping Phoebe
School of Info Technology, Deakin University, Burwood, Australia.
Department of Computer Science and Information Technology, La Trobe University, Bundoora, Australia.
JMIR Med Inform. 2021 Apr 1;9(4):e25000. doi: 10.2196/25000.
Cardiovascular disease (CVD) is the greatest health problem in Australia, which kills more people than any other disease and incurs enormous costs for the health care system. In this study, we present a benchmark comparison of various artificial intelligence (AI) architectures for predicting the mortality rate of patients with CVD using structured medical claims data. Compared with other research in the clinical literature, our models are more efficient because we use a smaller number of features, and this study could help health professionals accurately choose AI models to predict mortality among patients with CVD using only claims data before a clinic visit.
This study aims to support health clinicians in accurately predicting mortality among patients with CVD using only claims data before a clinic visit.
The data set was obtained from the Medicare Benefits Scheme and Pharmaceutical Benefits Scheme service information in the period between 2004 and 2014, released by the Department of Health Australia in 2016. It included 346,201 records, corresponding to 346,201 patients. A total of five AI algorithms, including four classical machine learning algorithms (logistic regression [LR], random forest [RF], extra trees [ET], and gradient boosting trees [GBT]) and a deep learning algorithm, which is a densely connected neural network (DNN), were developed and compared in this study. In addition, because of the minority of deceased patients in the data set, a separate experiment using the Synthetic Minority Oversampling Technique (SMOTE) was conducted to enrich the data.
Regarding model performance, in terms of discrimination, GBT and RF were the models with the highest area under the receiver operating characteristic curve (97.8% and 97.7%, respectively), followed by ET (96.8%) and LR (96.4%), whereas DNN was the least discriminative (95.3%). In terms of reliability, LR predictions were the least calibrated compared with the other four algorithms. In this study, despite increasing the training time, SMOTE was proven to further improve the model performance of LR, whereas other algorithms, especially GBT and DNN, worked well with class imbalanced data.
Compared with other research in the clinical literature involving AI models using claims data to predict patient health outcomes, our models are more efficient because we use a smaller number of features but still achieve high performance. This study could help health professionals accurately choose AI models to predict mortality among patients with CVD using only claims data before a clinic visit.
心血管疾病(CVD)是澳大利亚最严重的健康问题,其致死人数超过其他任何疾病,并给医疗保健系统带来巨大成本。在本研究中,我们对各种人工智能(AI)架构进行了基准比较,以使用结构化医疗理赔数据预测CVD患者的死亡率。与临床文献中的其他研究相比,我们的模型效率更高,因为我们使用的特征数量更少,并且本研究可以帮助医疗专业人员仅根据就诊前的理赔数据准确选择AI模型来预测CVD患者的死亡率。
本研究旨在支持临床医生仅根据就诊前的理赔数据准确预测CVD患者的死亡率。
数据集来自澳大利亚卫生部于2016年发布的2004年至2014年期间的医疗保险福利计划和药品福利计划服务信息。它包括346,201条记录,对应346,201名患者。本研究共开发并比较了五种AI算法,包括四种经典机器学习算法(逻辑回归[LR]、随机森林[RF]、极端随机树[ET]和梯度提升树[GBT])以及一种深度学习算法,即全连接神经网络(DNN)。此外,由于数据集中死亡患者较少,因此使用合成少数过采样技术(SMOTE)进行了单独实验以丰富数据。
在模型性能方面,在区分能力上,GBT和RF是接收器操作特征曲线下面积最高的模型(分别为97.8%和97.7%),其次是ET(96.8%)和LR(96.4%),而DNN的区分能力最差(95.3%)。在可靠性方面,与其他四种算法相比,LR预测的校准程度最低。在本研究中,尽管增加了训练时间,但事实证明SMOTE可进一步提高LR的模型性能,而其他算法,尤其是GBT和DNN,在类别不平衡数据上表现良好。
与临床文献中其他使用理赔数据预测患者健康结果的AI模型研究相比,我们的模型效率更高,因为我们使用的特征数量更少,但仍能实现高性能。本研究可以帮助医疗专业人员仅根据就诊前的理赔数据准确选择AI模型来预测CVD患者的死亡率。