比较不同的监督机器学习算法在疾病预测中的应用。

Comparing different supervised machine learning algorithms for disease prediction.

机构信息

Complex Systems Research Group, Faculty of Engineering, The University of Sydney, Room 524, SIT Building (J12), Darlington, NSW, 2008, Australia.

Health Market Quality Research Stream, Capital Markets CRC, Level 3, 55 Harrington Street, Sydney, NSW, Australia.

出版信息

BMC Med Inform Decis Mak. 2019 Dec 21;19(1):281. doi: 10.1186/s12911-019-1004-8.

DOI:10.1186/s12911-019-1004-8

PMID:31864346

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6925840/

Abstract

BACKGROUND

Supervised machine learning algorithms have been a dominant method in the data mining field. Disease prediction using health data has recently shown a potential application area for these methods. This study ai7ms to identify the key trends among different types of supervised machine learning algorithms, and their performance and usage for disease risk prediction.

METHODS

In this study, extensive research efforts were made to identify those studies that applied more than one supervised machine learning algorithm on single disease prediction. Two databases (i.e., Scopus and PubMed) were searched for different types of search items. Thus, we selected 48 articles in total for the comparison among variants supervised machine learning algorithms for disease prediction.

RESULTS

We found that the Support Vector Machine (SVM) algorithm is applied most frequently (in 29 studies) followed by the Naïve Bayes algorithm (in 23 studies). However, the Random Forest (RF) algorithm showed superior accuracy comparatively. Of the 17 studies where it was applied, RF showed the highest accuracy in 9 of them, i.e., 53%. This was followed by SVM which topped in 41% of the studies it was considered.

CONCLUSION

This study provides a wide overview of the relative performance of different variants of supervised machine learning algorithms for disease prediction. This important information of relative performance can be used to aid researchers in the selection of an appropriate supervised machine learning algorithm for their studies.

摘要

背景

监督机器学习算法是数据挖掘领域的主要方法。使用健康数据进行疾病预测最近显示出这些方法的一个潜在应用领域。本研究旨在识别不同类型的监督机器学习算法之间的关键趋势，以及它们在疾病风险预测中的性能和用途。

方法

在这项研究中，我们进行了广泛的研究工作，以确定那些在单一疾病预测中应用了多种监督机器学习算法的研究。我们在两个数据库（即 Scopus 和 PubMed）中搜索了不同类型的搜索项。因此，我们总共选择了 48 篇文章，用于比较用于疾病预测的监督机器学习算法的变体。

结果

我们发现支持向量机（SVM）算法的应用最为频繁（在 29 项研究中），其次是朴素贝叶斯算法（在 23 项研究中）。然而，随机森林（RF）算法的准确性相对较高。在应用的 17 项研究中，RF 在其中 9 项研究中表现出最高的准确性，即 53%。其次是 SVM，在其被考虑的研究中，有 41%的研究排名第一。

结论

本研究提供了监督机器学习算法在疾病预测方面的相对性能的广泛概述。这些相对性能的重要信息可用于帮助研究人员在他们的研究中选择适当的监督机器学习算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d26/6925840/a6161ce2908b/12911_2019_1004_Fig1_HTML.jpg

相似文献

Comparing different supervised machine learning algorithms for disease prediction.

BMC Med Inform Decis Mak. 2019 Dec 21;19(1):281. doi: 10.1186/s12911-019-1004-8.

Application of supervised machine learning algorithms for classification and prediction of type-2 diabetes disease status in Afar regional state, Northeastern Ethiopia 2021.

Sci Rep. 2023 May 13;13(1):7779. doi: 10.1038/s41598-023-34906-1.

Comparison of supervised machine learning classification techniques in prediction of locoregional recurrences in early oral tongue cancer.

Int J Med Inform. 2020 Apr;136:104068. doi: 10.1016/j.ijmedinf.2019.104068. Epub 2019 Dec 28.

Application of supervised machine learning algorithms in the classification of sagittal gait patterns of cerebral palsy children with spastic diplegia.

Comput Biol Med. 2019 Mar;106:33-39. doi: 10.1016/j.compbiomed.2019.01.009. Epub 2019 Jan 16.

Obstructive Sleep Apnea: A Prediction Model Using Supervised Machine Learning Method.

Stud Health Technol Inform. 2020 Jun 26;272:387-390. doi: 10.3233/SHTI200576.

Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison.

Comput Biol Med. 2021 Sep;136:104672. doi: 10.1016/j.compbiomed.2021.104672. Epub 2021 Jul 21.

Machine-learning techniques for the prediction of protein-protein interactions.

J Biosci. 2019 Sep;44(4).

Developing robust arsenic awareness prediction models using machine learning algorithms.

J Environ Manage. 2018 Apr 1;211:125-137. doi: 10.1016/j.jenvman.2018.01.044. Epub 2018 Feb 4.

Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease).

J Biomed Inform. 2019 Sep;97:103257. doi: 10.1016/j.jbi.2019.103257. Epub 2019 Jul 30.

Construction accident narrative classification: An evaluation of text mining techniques.

Accid Anal Prev. 2017 Nov;108:122-130. doi: 10.1016/j.aap.2017.08.026. Epub 2017 Sep 1.

引用本文的文献

Artificial intelligence in interventional cardiology: a review of its role in diagnosis, decision-making, and procedural precision.

Ann Med Surg (Lond). 2025 Jul 18;87(9):5720-5734. doi: 10.1097/MS9.0000000000003602. eCollection 2025 Sep.

CT Radiomics-based machine learning approach for the invasiveness of pulmonary ground-glass nodules prediction.

Eur J Radiol Open. 2025 Aug 23;15:100680. doi: 10.1016/j.ejro.2025.100680. eCollection 2025 Dec.

Interpretable Machine Learning Models for Predicting Malignant Ventricular Arrhythmia in Patients with Acute ST-Segment Elevation Myocardial Infarction Based on Systemic Inflammation Index.

Clin Appl Thromb Hemost. 2025 Jan-Dec;31:10760296251375795. doi: 10.1177/10760296251375795. Epub 2025 Sep 1.

Pre-diagnostic serum metabolome and breast cancer risk: a nested case-control study.

Breast Cancer Res. 2025 Aug 27;27(1):156. doi: 10.1186/s13058-025-02102-w.

Identification of Neutrophil Extracellular Trap-Related Biomarkers in Diabetic Foot Ulcers Based on Bioinformatics.

J Inflamm Res. 2025 Aug 18;18:11355-11372. doi: 10.2147/JIR.S531204. eCollection 2025.

Revealing potential interfering genes between abdominal aortic aneurysm and periodontitis through machine learning and bioinformatics analysis.

PLoS One. 2025 Aug 26;20(8):e0329592. doi: 10.1371/journal.pone.0329592. eCollection 2025.

Machine learning applications in forecasting patient satisfaction and clinical outcomes after carpal tunnel release: a retrospective study.

BMC Musculoskelet Disord. 2025 Aug 23;26(1):813. doi: 10.1186/s12891-025-09079-9.

Machine learning based on pangenome-wide association studies reveals the impact of host source on the zoonotic potential of closely related bacterial pathogens.

Commun Biol. 2025 Aug 20;8(1):1253. doi: 10.1038/s42003-025-08650-3.

RadiomiX for Radiomics Analysis: Automated Approaches to Overcome Challenges in Replicability.

Diagnostics (Basel). 2025 Aug 5;15(15):1968. doi: 10.3390/diagnostics15151968.

From brain to education through machine learning: Predicting literacy and numeracy skills from neuroimaging data.

Imaging Neurosci (Camb). 2024 Jul 3;2. doi: 10.1162/imag_a_00219. eCollection 2024.

本文引用的文献

Detection of genetic cardiac diseases by Ca transient profiles using machine learning methods.

Sci Rep. 2018 Jun 19;8(1):9355. doi: 10.1038/s41598-018-27695-5.

Research on Improved Depth Belief Network-Based Prediction of Cardiovascular Diseases.

J Healthc Eng. 2018 May 9;2018:8954878. doi: 10.1155/2018/8954878. eCollection 2018.

Applications of Machine Learning in Fatty Live Disease Prediction.

Stud Health Technol Inform. 2018;247:166-170.

Prediction of cardiac death after adenosine myocardial perfusion SPECT based on machine learning.

J Nucl Cardiol. 2019 Oct;26(5):1746-1754. doi: 10.1007/s12350-018-1250-7. Epub 2018 Mar 14.

Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies.

Cancer Biomark. 2018 Feb 6;21(2):393-413. doi: 10.3233/CBM-170643.

Prediction of lung cancer patient survival via supervised machine learning classification techniques.

Int J Med Inform. 2017 Dec;108:1-8. doi: 10.1016/j.ijmedinf.2017.09.013. Epub 2017 Sep 25.

Wrapper method for feature selection to classify cardiac arrhythmia.

Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3656-3659. doi: 10.1109/EMBC.2017.8037650.

Comparing deep neural network and other machine learning algorithms for stroke prediction in a large-scale population-based electronic medical claims database.

Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3110-3113. doi: 10.1109/EMBC.2017.8037515.

Risk prediction model for in-hospital mortality in women with ST-elevation myocardial infarction: A machine learning approach.

Heart Lung. 2017 Nov-Dec;46(6):405-411. doi: 10.1016/j.hrtlng.2017.09.003. Epub 2017 Oct 6.

Evaluation of Machine Learning Methods to Predict Coronary Artery Disease Using Metabolomic Data.

Stud Health Technol Inform. 2017;235:111-115.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

比较不同的监督机器学习算法在疾病预测中的应用。

Comparing different supervised machine learning algorithms for disease prediction.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献