Dept. of Industrial and Systems Engineering, University of Oklahoma, Norman, OK 73071, USA.
Dept. of Aerospace and Mechanical Engineering, University of Oklahoma, Norman, OK 73071, USA.
Artif Intell Med. 2022 Jun;128:102289. doi: 10.1016/j.artmed.2022.102289. Epub 2022 Mar 29.
Heart disease is one of the significant challenges in today's world and one of the leading causes of many deaths worldwide. Recent advancement of machine learning (ML) application demonstrates that using electrocardiogram (ECG) and patients' data, detecting heart disease during the early stage is feasible. However, both ECG and patients' data are often imbalanced, which ultimately raises a challenge for the traditional ML to perform unbiasedly. Over the years, several data level and algorithm level solutions have been exposed by many researchers and practitioners. To provide a broader view of the existing literature, this study takes a systematic literature review (SLR) approach to uncover the challenges associated with imbalanced data in heart diseases predictions. Before that, we conducted a meta-analysis using 451 reference literature acquired from the reputed journals between 2012 and November 15, 2021. For in-depth analysis, 49 referenced literature has been considered and studied, taking into account the following factors: heart disease type, algorithms, applications, and solutions. Our SLR study revealed that the current approaches encounter various open problems/issues when dealing with imbalanced data, eventually hindering their practical applicability and functionality. In the diagnosis of heart disease, machine learning approaches help to improve data-driven decision-making. A metadata analysis of 451 articles and content analysis of 49 selected articles of heart disease diagnosis. Researchers primarily concentrated on enhancing the performance of the models while disregarding other issues such as the interpretability and explainability of Machine learning algorithms.
心脏病是当今世界面临的重大挑战之一,也是全球许多人死亡的主要原因之一。最近机器学习(ML)应用的进步表明,使用心电图(ECG)和患者数据,在早期检测心脏病是可行的。然而,心电图和患者数据通常是不平衡的,这给传统的机器学习带来了不公平的挑战。多年来,许多研究人员和从业者已经提出了一些数据级别和算法级别的解决方案。为了更全面地了解现有文献,本研究采用系统文献综述(SLR)方法,揭示与心脏病预测中不平衡数据相关的挑战。在此之前,我们使用 2012 年至 2021 年 11 月 15 日期间从知名期刊中获取的 451 篇参考文献进行了荟萃分析。为了进行深入分析,考虑并研究了 49 篇参考文献,考虑了以下因素:心脏病类型、算法、应用和解决方案。我们的 SLR 研究表明,当前方法在处理不平衡数据时遇到了各种开放性问题/问题,最终阻碍了它们的实际适用性和功能。在心脏病诊断中,机器学习方法有助于改善数据驱动的决策。对 451 篇文章进行元数据分析,并对 49 篇选定的心脏病诊断文章进行内容分析。研究人员主要集中在提高模型的性能上,而忽略了机器学习算法的可解释性和可解释性等其他问题。