使用健身数据比较机器学习技术预测全因死亡率：亨利福特锻炼测试（FIT）项目。

Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.

机构信息

King AbdulAziz Cardiac Center, Ministry of National Guard, Health Affairs, King Abdulaziz Medical City for National Guard - Health affairs, King Abdullah International Medical Research Center, King Saud bin Abdulaziz University for Health Sciences, Department Mail Code: 1413, P.O. Box 22490, Riyadh, 11426, Kingdom of Saudi Arabia.

Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

出版信息

BMC Med Inform Decis Mak. 2017 Dec 19;17(1):174. doi: 10.1186/s12911-017-0566-6.

DOI:10.1186/s12911-017-0566-6

PMID:29258510

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5735871/

Abstract

BACKGROUND

Prior studies have demonstrated that cardiorespiratory fitness (CRF) is a strong marker of cardiovascular health. Machine learning (ML) can enhance the prediction of outcomes through classification techniques that classify the data into predetermined categories. The aim of this study is to present an evaluation and comparison of how machine learning techniques can be applied on medical records of cardiorespiratory fitness and how the various techniques differ in terms of capabilities of predicting medical outcomes (e.g. mortality).

METHODS

We use data of 34,212 patients free of known coronary artery disease or heart failure who underwent clinician-referred exercise treadmill stress testing at Henry Ford Health Systems Between 1991 and 2009 and had a complete 10-year follow-up. Seven machine learning classification techniques were evaluated: Decision Tree (DT), Support Vector Machine (SVM), Artificial Neural Networks (ANN), Naïve Bayesian Classifier (BC), Bayesian Network (BN), K-Nearest Neighbor (KNN) and Random Forest (RF). In order to handle the imbalanced dataset used, the Synthetic Minority Over-Sampling Technique (SMOTE) is used.

RESULTS

Two set of experiments have been conducted with and without the SMOTE sampling technique. On average over different evaluation metrics, SVM Classifier has shown the lowest performance while other models like BN, BC and DT performed better. The RF classifier has shown the best performance (AUC = 0.97) among all models trained using the SMOTE sampling.

CONCLUSIONS

The results show that various ML techniques can significantly vary in terms of its performance for the different evaluation metrics. It is also not necessarily that the more complex the ML model, the more prediction accuracy can be achieved. The prediction performance of all models trained with SMOTE is much better than the performance of models trained without SMOTE. The study shows the potential of machine learning methods for predicting all-cause mortality using cardiorespiratory fitness data.

摘要

背景

先前的研究表明，心肺适能（CRF）是心血管健康的强有力指标。机器学习（ML）可以通过分类技术增强对结果的预测，这些技术将数据分类到预定的类别中。本研究旨在展示如何在心肺适能的医疗记录上应用机器学习技术，并比较各种技术在预测医疗结果（例如死亡率）方面的能力差异。

方法

我们使用了 1991 年至 2009 年间在亨利福特健康系统接受临床医生推荐的运动跑步机压力测试且在 10 年内完成完整随访的 34212 例无已知冠状动脉疾病或心力衰竭的患者的数据。评估了七种机器学习分类技术：决策树（DT）、支持向量机（SVM）、人工神经网络（ANN）、朴素贝叶斯分类器（BC）、贝叶斯网络（BN）、K-近邻（KNN）和随机森林（RF）。为了处理使用的不平衡数据集，使用了合成少数过采样技术（SMOTE）。

结果

在使用和不使用 SMOTE 采样技术的情况下进行了两组实验。在不同的评估指标上，SVM 分类器的平均性能最低，而其他模型，如 BN、BC 和 DT 的性能更好。在使用 SMOTE 采样训练的所有模型中，RF 分类器的表现最好（AUC=0.97）。

结论

结果表明，各种 ML 技术在不同的评估指标上的性能可能会有很大差异。也不一定是 ML 模型越复杂，预测准确性就越高。使用 SMOTE 训练的所有模型的预测性能都明显优于未使用 SMOTE 训练的模型的性能。该研究表明，机器学习方法在使用心肺适能数据预测全因死亡率方面具有潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c601/5735871/fba67734c6b8/12911_2017_566_Fig1_HTML.jpg

相似文献

Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.

BMC Med Inform Decis Mak. 2017 Dec 19;17(1):174. doi: 10.1186/s12911-017-0566-6.

Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project.

PLoS One. 2017 Jul 24;12(7):e0179805. doi: 10.1371/journal.pone.0179805. eCollection 2017.

Using machine learning on cardiorespiratory fitness data for predicting hypertension: The Henry Ford ExercIse Testing (FIT) Project.

PLoS One. 2018 Apr 18;13(4):e0195344. doi: 10.1371/journal.pone.0195344. eCollection 2018.

Using Machine Learning to Define the Association between Cardiorespiratory Fitness and All-Cause Mortality (from the Henry Ford Exercise Testing Project).

Am J Cardiol. 2017 Dec 1;120(11):2078-2084. doi: 10.1016/j.amjcard.2017.08.029. Epub 2017 Aug 30.

Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.

BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.

Hospital mortality prediction in traumatic injuries patients: comparing different SMOTE-based machine learning algorithms.

BMC Med Res Methodol. 2023 Apr 22;23(1):101. doi: 10.1186/s12874-023-01920-w.

On the interpretability of machine learning-based model for predicting hypertension.

BMC Med Inform Decis Mak. 2019 Jul 29;19(1):146. doi: 10.1186/s12911-019-0874-0.

Stroke Prediction with Machine Learning Methods among Older Chinese.

Int J Environ Res Public Health. 2020 Mar 12;17(6):1828. doi: 10.3390/ijerph17061828.

Machine learning models predict triage levels, massive transfusion protocol activation, and mortality in trauma utilizing patients hemodynamics on admission.

Comput Biol Med. 2024 Sep;179:108880. doi: 10.1016/j.compbiomed.2024.108880. Epub 2024 Jul 16.

Development of an efficient novel method for coronary artery disease prediction using machine learning and deep learning techniques.

Technol Health Care. 2024;32(6):4545-4569. doi: 10.3233/THC-240740.

引用本文的文献

Integrated bioinformatics and experiment validation reveal cuproptosis-related biomarkers and therapeutic targets in sepsis-induced myocardial dysfunction.

BMC Infect Dis. 2025 Mar 31;25(1):445. doi: 10.1186/s12879-025-10822-9.

Predicting all-cause mortality and premature death using interpretable machine learning among a middle-aged and elderly Chinese population.

Heliyon. 2024 Aug 28;10(17):e36878. doi: 10.1016/j.heliyon.2024.e36878. eCollection 2024 Sep 15.

Stone decision engine accurately predicts stone removal and treatment complications for shock wave lithotripsy and laser ureterorenoscopy patients.

PLoS One. 2024 May 2;19(5):e0301812. doi: 10.1371/journal.pone.0301812. eCollection 2024.

FIT calculator: a multi-risk prediction framework for medical outcomes using cardiorespiratory fitness data.

Sci Rep. 2024 Apr 16;14(1):8745. doi: 10.1038/s41598-024-59401-z.

Integrated transcriptomic meta-analysis and comparative artificial intelligence models in maize under biotic stress.

Sci Rep. 2023 Sep 23;13(1):15899. doi: 10.1038/s41598-023-42984-4.

Integrative Interpretation of Cardiopulmonary Exercise Tests for Cardiovascular Outcome Prediction: A Machine Learning Approach.

Diagnostics (Basel). 2023 Jun 13;13(12):2051. doi: 10.3390/diagnostics13122051.

Machine-learning predicts time-series prognosis factors in metastatic prostate cancer patients treated with androgen deprivation therapy.

Sci Rep. 2023 Apr 18;13(1):6325. doi: 10.1038/s41598-023-32987-6.

Prediction of Prednisolone Dose Correction Using Machine Learning.

J Healthc Inform Res. 2023 Feb 15;7(1):84-103. doi: 10.1007/s41666-023-00128-3. eCollection 2023 Mar.

Development and validation of questionnaire-based machine learning models for predicting all-cause mortality in a representative population of China.

Front Public Health. 2023 Jan 27;11:1033070. doi: 10.3389/fpubh.2023.1033070. eCollection 2023.

Identification of clinical factors related to prediction of alcohol use disorder from electronic health records using feature selection methods.

BMC Med Inform Decis Mak. 2022 Nov 23;22(1):304. doi: 10.1186/s12911-022-02051-w.

本文引用的文献

A Comparison of a Machine Learning Model with EuroSCORE II in Predicting Mortality after Elective Cardiac Surgery: A Decision Curve Analysis.

PLoS One. 2017 Jan 6;12(1):e0169772. doi: 10.1371/journal.pone.0169772. eCollection 2017.

Meta-Analysis Comparing Established Risk Prediction Models (EuroSCORE II, STS Score, and ACEF Score) for Perioperative Mortality During Cardiac Surgery.

Am J Cardiol. 2016 Nov 15;118(10):1574-1582. doi: 10.1016/j.amjcard.2016.08.024. Epub 2016 Aug 23.

Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.

Acad Emerg Med. 2016 Mar;23(3):269-78. doi: 10.1111/acem.12876. Epub 2016 Feb 13.

Cardiorespiratory Fitness and Risk of Incident Atrial Fibrillation: Results From the Henry Ford Exercise Testing (FIT) Project.

Circulation. 2015 May 26;131(21):1827-34. doi: 10.1161/CIRCULATIONAHA.114.014833. Epub 2015 Apr 22.

Cardiorespiratory fitness and incident diabetes: the FIT (Henry Ford ExercIse Testing) project.

Diabetes Care. 2015 Jun;38(6):1075-81. doi: 10.2337/dc14-2714. Epub 2015 Mar 12.

Physical fitness and hypertension in a population at risk for cardiovascular disease: the Henry Ford ExercIse Testing (FIT) Project.

J Am Heart Assoc. 2014 Dec;3(6):e001268. doi: 10.1161/JAHA.114.001268.

Prognostic value of exercise capacity in patients with coronary artery disease: the FIT (Henry Ford ExercIse Testing) project.

Mayo Clin Proc. 2014 Dec;89(12):1644-54. doi: 10.1016/j.mayocp.2014.07.011. Epub 2014 Oct 14.

Rationale and design of the Henry Ford Exercise Testing Project (the FIT project).

Clin Cardiol. 2014 Aug;37(8):456-61. doi: 10.1002/clc.22302.

Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes.

J Clin Epidemiol. 2013 Apr;66(4):398-407. doi: 10.1016/j.jclinepi.2012.11.008. Epub 2013 Feb 4.

Mortality risk score prediction in an elderly population using machine learning.

Am J Epidemiol. 2013 Mar 1;177(5):443-52. doi: 10.1093/aje/kws241. Epub 2013 Jan 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用健身数据比较机器学习技术预测全因死亡率：亨利福特锻炼测试（FIT）项目。

Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献