Department of Computer Science, University of Ghana, Legon, Accra, Ghana.
Comput Intell Neurosci. 2021 Dec 23;2021:3152618. doi: 10.1155/2021/3152618. eCollection 2021.
Heart diseases are a leading cause of death worldwide, and they have sparked a lot of interest in the scientific community. Because of the high number of impulsive deaths associated with it, early detection is critical. This study proposes a boosting Support Vector Machine (SVM) technique as the backbone of computer-aided diagnostic tools for more accurately forecasting heart disease risk levels. The datasets which contain 13 attributes such as gender, age, blood pressure, and chest pain are taken from the Cleveland clinic. In total, there were 303 records with 6 tuples having missing values. To clean the data, we deleted the 6 missing records through the listwise technique. The size of data, and the fact that it is a purely random subset, made this approach have no significant effect for the experiment because there were no biases. Salient features are selected using the boosting technique to speed up and improve accuracies. Using the train/test split approach, the data is then partitioned into training and testing. SVM is then used to train and test the data. The C parameter is set at 0.05 and the linear kernel function is used. Logistic regression, Nave Bayes, decision trees, Multilayer Perceptron, and random forest were used to compare the results. The proposed boosting SVM performed exceptionally well, making it a better tool than the existing techniques.
心脏病是全球范围内的主要死因,引起了科学界的极大关注。由于与之相关的大量冲动性死亡,早期检测至关重要。本研究提出了一种基于提升支持向量机(SVM)技术的计算机辅助诊断工具,以更准确地预测心脏病风险水平。该数据集包含 13 个属性,如性别、年龄、血压和胸痛,取自克利夫兰诊所。总共有 303 条记录,其中 6 个元组存在缺失值。为了清理数据,我们通过列表技术删除了 6 条缺失记录。由于数据量很大,而且是一个纯粹的随机子集,这种方法对实验没有显著影响,因为没有偏差。使用提升技术选择显著特征,以提高速度和准确性。然后使用 train/test 分割方法将数据分割为训练集和测试集。然后使用 SVM 对数据进行训练和测试。C 参数设置为 0.05,使用线性核函数。逻辑回归、朴素贝叶斯、决策树、多层感知机和随机森林被用来比较结果。提出的提升 SVM 表现出色,使其成为比现有技术更好的工具。