Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh.
Comput Biol Chem. 2022 Jun;98:107672. doi: 10.1016/j.compbiolchem.2022.107672. Epub 2022 Mar 31.
In today's world, a massive amount of data is available in almost every sector. This data has become an asset as we can use this enormous amount of data to find information. Mainly health care industry contains many data consisting of patient and disease-related information. By using the machine learning technique, we can look for hidden data patterns to predict various diseases. Recently CVDs, or cardiovascular disease, have become a leading cause of death around the world. The number of death due to CVDs is frightening. That is why many researchers are trying their best to design a predictive model that can save many lives using the data mining model. In this research, some fusion models have been constructed to diagnose CVDs along with its severity. Machine learning(ML) algorithms like artificial neural network, SVM, logistic regression, decision tree, random forest, and AdaBoost have been applied to the heart disease dataset to predict disease. Randomoversampler only for multi-class classification to make the imbalanced dataset balanced. To improve the performance of classification, a weighted score fusion approach was taken. At first, the models were trained. After training, two algorithms' decision was combined using a weighted sum rule. A total of three fusion models have been developed from the six ML algorithms. The results were promising in the performance parameter. The proposed approach has been experimented with different test training ratios for binary and multiclass classification problems, and for both of them, the fusion models performed well. The highest accuracy for multiclass classification was found as 75%, and it was 95% for binary.
在当今世界,几乎每个领域都有大量的数据。我们可以利用这些海量数据来查找信息,因此这些数据已成为一种资产。主要医疗保健行业包含许多包含患者和疾病相关信息的数据。通过使用机器学习技术,我们可以寻找隐藏的数据模式来预测各种疾病。最近,心血管疾病 (CVD) 已成为世界各地的主要死亡原因。由于 CVD 而死亡的人数令人恐惧。这就是为什么许多研究人员正在尽力设计一种预测模型,通过数据挖掘模型来挽救许多生命。在这项研究中,构建了一些融合模型来诊断 CVD 及其严重程度。已经将机器学习 (ML) 算法(如人工神经网络、SVM、逻辑回归、决策树、随机森林和 AdaBoost)应用于心脏病数据集以预测疾病。仅用于多类分类的随机过采样器使不平衡数据集平衡。为了提高分类性能,采用了加权分数融合方法。首先,对模型进行训练。训练后,使用加权和规则组合两种算法的决策。从六种 ML 算法中总共开发了三种融合模型。在性能参数方面,结果很有希望。已经使用不同的测试训练比例对二进制和多类分类问题进行了该方法的实验,对于这两种情况,融合模型的性能都很好。多类分类的最高精度为 75%,二进制为 95%。