基于评分和相关系数的特征选择在使用机器学习算法预测心力衰竭诊断中的应用。

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms.

机构信息

Department of Computer Science & Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, India.

Information Systems Department, Prince Sultan University, Riyadh, Saudi Arabia.

出版信息

Comput Math Methods Med. 2021 Dec 20;2021:8500314. doi: 10.1155/2021/8500314. eCollection 2021.

DOI:10.1155/2021/8500314

PMID:34966445

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8712170/

Abstract

Cardiovascular disease (CVD) is one of the most common causes of death that kills approximately 17 million people annually. The main reasons behind CVD are myocardial infarction and the failure of the heart to pump blood normally. Doctors could diagnose heart failure (HF) through electronic medical records on the basis of patient's symptoms and clinical laboratory investigations. However, accurate diagnosis of HF requires medical resources and expert practitioners that are not always available, thus making the diagnosing challengeable. Therefore, predicting the patients' condition by using machine learning algorithms is a necessity to save time and efforts. This paper proposed a machine-learning-based approach that distinguishes the most important correlated features amongst patients' electronic clinical records. The SelectKBest function was applied with chi-squared statistical method to determine the most important features, and then feature engineering method has been applied to create new features correlated strongly in order to train machine learning models and obtain promising results. Optimised hyperparameter classification algorithms SVM, KNN, Decision Tree, Random Forest, and Logistic Regression were used to train two different datasets. The first dataset, called Cleveland, consisted of 303 records. The second dataset, which was used for predicting HF, consisted of 299 records. Experimental results showed that the Random Forest algorithm achieved accuracy, precision, recall, and F1 scores of 95%, 97.62%, 95.35%, and 96.47%, respectively, during the test phase for the second dataset. The same algorithm achieved accuracy scores of 100% for the first dataset and 97.68% for the second dataset, while 100% precision, recall, and F1 scores were reached for both datasets.

摘要

心血管疾病（CVD）是导致每年约 1700 万人死亡的最常见死因之一。CVD 的主要原因是心肌梗死和心脏不能正常泵血。医生可以根据患者的症状和临床实验室检查结果从电子病历中诊断心力衰竭（HF）。然而，HF 的准确诊断需要医疗资源和专家医生，这些资源并不总是可用的，因此诊断具有挑战性。因此，使用机器学习算法预测患者的病情是必要的，可以节省时间和精力。本文提出了一种基于机器学习的方法，可以区分患者电子临床记录中的最重要相关特征。应用 SelectKBest 函数和卡方统计方法来确定最重要的特征，然后应用特征工程方法创建与重要特征强相关的新特征，以便训练机器学习模型并获得有前途的结果。优化的超参数分类算法 SVM、KNN、决策树、随机森林和逻辑回归用于训练两个不同的数据集。第一个数据集称为克利夫兰，包含 303 条记录。第二个数据集用于预测 HF，包含 299 条记录。实验结果表明，在第二个数据集的测试阶段，随机森林算法的准确率、精度、召回率和 F1 分数分别为 95%、97.62%、95.35%和 96.47%。同一算法在第一个数据集的准确率为 100%，在第二个数据集的准确率为 97.68%，而两个数据集的精度、召回率和 F1 分数均达到 100%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb74/8712170/7b896587c55c/CMMM2021-8500314.001.jpg

相似文献

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms.

Comput Math Methods Med. 2021 Dec 20;2021:8500314. doi: 10.1155/2021/8500314. eCollection 2021.

Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone.

BMC Med Inform Decis Mak. 2020 Feb 3;20(1):16. doi: 10.1186/s12911-020-1023-5.

Efficient Prediction of Missed Clinical Appointment Using Machine Learning.

Comput Math Methods Med. 2021 Oct 22;2021:2376391. doi: 10.1155/2021/2376391. eCollection 2021.

A proposed technique for predicting heart disease using machine learning algorithms and an explainable AI method.

Sci Rep. 2024 Oct 7;14(1):23277. doi: 10.1038/s41598-024-74656-2.

Predicting Chronic Kidney Disease Using Hybrid Machine Learning Based on Apache Spark.

Comput Intell Neurosci. 2022 Feb 23;2022:9898831. doi: 10.1155/2022/9898831. eCollection 2022.

Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

Comput Math Methods Med. 2021 May 17;2021:6662420. doi: 10.1155/2021/6662420. eCollection 2021.

Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease.

Comput Intell Neurosci. 2023 Mar 14;2023:9266889. doi: 10.1155/2023/9266889. eCollection 2023.

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.

Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.

A clinical text classification paradigm using weak supervision and deep representation.

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Prediction of heart failure patients with distinct left ventricular ejection fraction levels using circadian ECG features and machine learning.

PLoS One. 2024 May 13;19(5):e0302639. doi: 10.1371/journal.pone.0302639. eCollection 2024.

引用本文的文献

Deep transfer learning and attention based P2.5 forecasting in Delhi using a decade of winter season data.

Sci Rep. 2025 Aug 28;15(1):31787. doi: 10.1038/s41598-025-16664-4.

Individualized functional brain mapping machine learning prediction of symptom-change resulting from selective kappa-opioid antagonism in an anhedonic sample from a Fast-Fail trial.

J Mood Anxiety Disord. 2025 May 9;11:100126. doi: 10.1016/j.xjmad.2025.100126. eCollection 2025 Sep.

Chaotic gradient based optimization with fuzzy temporal optimized CNN for heart failure prediction.

Sci Rep. 2025 Jan 31;15(1):3867. doi: 10.1038/s41598-025-88277-w.

Predictive Analytics in Heart Failure Risk, Readmission, and Mortality Prediction: A Review.

Cureus. 2024 Nov 17;16(11):e73876. doi: 10.7759/cureus.73876. eCollection 2024 Nov.

Cardiovascular risk factors and development of nomograms in an Italian cohort of patients with suspected coronary artery disease undergoing SPECT or PET stress myocardial perfusion imaging.

Front Nucl Med. 2024 Feb 14;4:1232135. doi: 10.3389/fnume.2024.1232135. eCollection 2024.

Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning.

BMC Bioinformatics. 2024 Mar 27;25(1):132. doi: 10.1186/s12859-024-05749-y.

Feature selection and association rule learning identify risk factors of malnutrition among Ethiopian schoolchildren.

Front Epidemiol. 2023 Jul 6;3:1150619. doi: 10.3389/fepid.2023.1150619. eCollection 2023.

Machine learning analyses reveal circadian clock features predictive of anxiety among UK biobank participants.

Sci Rep. 2023 Dec 15;13(1):22304. doi: 10.1038/s41598-023-49644-7.

Predicting of diabetic retinopathy development stages of fundus images using deep learning based on combined features.

PLoS One. 2023 Oct 20;18(10):e0289555. doi: 10.1371/journal.pone.0289555. eCollection 2023.

Age-Specific Cardiovascular Risk Factors for Major Adverse Cardiac Events in Patients Undergoing Myocardial Perfusion Imaging.

J Cardiovasc Dev Dis. 2023 Sep 13;10(9):395. doi: 10.3390/jcdd10090395.

本文引用的文献

Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone.

BMC Med Inform Decis Mak. 2020 Feb 3;20(1):16. doi: 10.1186/s12911-020-1023-5.

Improving risk prediction in heart failure using machine learning.

Eur J Heart Fail. 2020 Jan;22(1):139-147. doi: 10.1002/ejhf.1628. Epub 2019 Nov 12.

Machine learning for prediction of sudden cardiac death in heart failure patients with low left ventricular ejection fraction: study protocol for a retroprospective multicentre registry in China.

BMJ Open. 2019 May 16;9(5):e023724. doi: 10.1136/bmjopen-2018-023724.

Clinical profiles in acute heart failure: an urgent need for a new approach.

ESC Heart Fail. 2019 Jun;6(3):464-474. doi: 10.1002/ehf2.12439. Epub 2019 Apr 25.

A systematic review of clinical prediction rules for the diagnosis of chronic heart failure.

ESC Heart Fail. 2019 Jun;6(3):499-508. doi: 10.1002/ehf2.12426. Epub 2019 Mar 10.

Application of stacked convolutional and long short-term memory network for accurate identification of CAD ECG signals.

Comput Biol Med. 2018 Mar 1;94:19-26. doi: 10.1016/j.compbiomed.2017.12.023. Epub 2018 Jan 2.

Biological Phenotypes of Heart Failure With Preserved Ejection Fraction.

J Am Coll Cardiol. 2017 Oct 24;70(17):2186-2200. doi: 10.1016/j.jacc.2017.09.006.

Global, Regional, and National Burden of Cardiovascular Diseases for 10 Causes, 1990 to 2015.

J Am Coll Cardiol. 2017 Jul 4;70(1):1-25. doi: 10.1016/j.jacc.2017.04.052. Epub 2017 May 17.

Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm.

Comput Methods Programs Biomed. 2017 Apr;141:19-26. doi: 10.1016/j.cmpb.2017.01.004. Epub 2017 Jan 18.

Characterization of coronary atherosclerosis by intravascular imaging modalities.

Cardiovasc Diagn Ther. 2016 Aug;6(4):368-81. doi: 10.21037/cdt.2015.12.05.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于评分和相关系数的特征选择在使用机器学习算法预测心力衰竭诊断中的应用。

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms.

机构信息

Department of Computer Science & Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, India.

Information Systems Department, Prince Sultan University, Riyadh, Saudi Arabia.

出版信息

Comput Math Methods Med. 2021 Dec 20;2021:8500314. doi: 10.1155/2021/8500314. eCollection 2021.

DOI:10.1155/2021/8500314

PMID:34966445

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8712170/

Abstract

摘要

基于评分和相关系数的特征选择在使用机器学习算法预测心力衰竭诊断中的应用。

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于评分和相关系数的特征选择在使用机器学习算法预测心力衰竭诊断中的应用。

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献