基于非侵入性指标的机器学习辅助肺炎预测。

Machine learning-assisted prediction of pneumonia based on non-invasive measures.

机构信息

College of Public Health, Zhengzhou University, Zhengzhou, China.

Department of Radiation Oncology, Zhengzhou University People's Hospital, Henan Provincial People's Hospital, Zhengzhou, China.

出版信息

Front Public Health. 2022 Jul 28;10:938801. doi: 10.3389/fpubh.2022.938801. eCollection 2022.

DOI:10.3389/fpubh.2022.938801

PMID:35968461

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9371749/

Abstract

BACKGROUND

Pneumonia is an infection of the lungs that is characterized by high morbidity and mortality. The use of machine learning systems to detect respiratory diseases non-invasive measures such as physical and laboratory parameters is gaining momentum and has been proposed to decrease diagnostic uncertainty associated with bacterial pneumonia. Herein, this study conducted several experiments using eight machine learning models to predict pneumonia based on biomarkers, laboratory parameters, and physical features.

METHODS

We perform machine-learning analysis on 535 different patients, each with 45 features. Data normalization to rescale all real-valued features was performed. Since it is a binary problem, we categorized each patient into one class at a time. We designed three experiments to evaluate the models: (1) feature selection techniques to select appropriate features for the models, (2) experiments on the imbalanced original dataset, and (3) experiments on the SMOTE data. We then compared eight machine learning models to evaluate their effectiveness in predicting pneumonia.

RESULTS

Biomarkers such as C-reactive protein and procalcitonin demonstrated the most significant discriminating power. Ensemble machine learning models such as RF (accuracy = 92.0%, precision = 91.3%, recall = 96.0%, f1-Score = 93.6%) and XGBoost (accuracy = 90.8%, precision = 92.6%, recall = 92.3%, f1-score = 92.4%) achieved the highest performance accuracy on the original dataset with AUCs of 0.96 and 0.97, respectively. On the SMOTE dataset, RF and XGBoost achieved the highest prediction results with f1-scores of 92.0 and 91.2%, respectively. Also, AUC of 0.97 was achieved for both RF and XGBoost models.

CONCLUSIONS

Our models showed that in the diagnosis of pneumonia, individual clinical history, laboratory indicators, and symptoms do not have adequate discriminatory power. We can also conclude that the ensemble ML models performed better in this study.

摘要

背景

肺炎是一种肺部感染，其发病率和死亡率都很高。使用机器学习系统来检测呼吸道疾病，如非侵入性的物理和实验室参数，正逐渐受到关注，并已被提出用于降低与细菌性肺炎相关的诊断不确定性。在此，本研究使用 8 种机器学习模型，基于生物标志物、实验室参数和物理特征，进行了多项实验，以预测肺炎。

方法

我们对 535 名不同的患者进行了机器学习分析，每位患者有 45 个特征。对所有实值特征进行了数据归一化，以重新缩放。由于这是一个二分类问题，我们每次将每个患者分类到一个类别中。我们设计了三个实验来评估模型：（1）特征选择技术，为模型选择合适的特征；（2）在原始不平衡数据集上的实验；（3）在 SMOTE 数据上的实验。然后，我们比较了 8 种机器学习模型，以评估它们在预测肺炎方面的有效性。

结果

生物标志物如 C 反应蛋白和降钙素表现出最显著的区分能力。RF（准确率=92.0%，精度=91.3%，召回率=96.0%，F1-Score=93.6%）和 XGBoost（准确率=90.8%，精度=92.6%，召回率=92.3%，F1-Score=92.4%）等集成机器学习模型在原始数据集上取得了最高的性能准确性，AUC 分别为 0.96 和 0.97。在 SMOTE 数据集上，RF 和 XGBoost 分别取得了最高的预测结果，F1-Score 分别为 92.0%和 91.2%。此外，RF 和 XGBoost 模型的 AUC 均达到 0.97。

结论

我们的模型表明，在肺炎的诊断中，个体的临床病史、实验室指标和症状没有足够的区分能力。我们还可以得出结论，在这项研究中，集成机器学习模型表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b32b/9371749/bc8f6fa0a1d0/fpubh-10-938801-g0001.jpg

相似文献

Machine learning-assisted prediction of pneumonia based on non-invasive measures.

Front Public Health. 2022 Jul 28;10:938801. doi: 10.3389/fpubh.2022.938801. eCollection 2022.

Hospital mortality prediction in traumatic injuries patients: comparing different SMOTE-based machine learning algorithms.

BMC Med Res Methodol. 2023 Apr 22;23(1):101. doi: 10.1186/s12874-023-01920-w.

A machine-learning approach for stress detection using wearable sensors in free-living environments.

Comput Biol Med. 2024 Sep;179:108918. doi: 10.1016/j.compbiomed.2024.108918. Epub 2024 Jul 18.

Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.

J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.

Predicting post-stroke pneumonia using deep neural network approaches.

Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1.

A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.

Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.

Stroke Prediction with Machine Learning Methods among Older Chinese.

Int J Environ Res Public Health. 2020 Mar 12;17(6):1828. doi: 10.3390/ijerph17061828.

Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.

BMC Med Inform Decis Mak. 2017 Dec 19;17(1):174. doi: 10.1186/s12911-017-0566-6.

Identification of clinical factors related to prediction of alcohol use disorder from electronic health records using feature selection methods.

BMC Med Inform Decis Mak. 2022 Nov 23;22(1):304. doi: 10.1186/s12911-022-02051-w.

Development and performance assessment of novel machine learning models to predict pneumonia after liver transplantation.

Respir Res. 2021 Mar 31;22(1):94. doi: 10.1186/s12931-021-01690-3.

引用本文的文献

A Novel Knowledge Fusion Ensemble for Diagnostic Differentiation of Pediatric Pneumonia and Acute Bronchitis.

Diagnostics (Basel). 2025 Sep 6;15(17):2258. doi: 10.3390/diagnostics15172258.

Determinants of community-acquired pneumonia among under-five children in Awi Zone, Northwest Ethiopia.

Front Public Health. 2025 May 1;13:1511263. doi: 10.3389/fpubh.2025.1511263. eCollection 2025.

Efficient federated learning for pediatric pneumonia on chest X-ray classification.

Sci Rep. 2024 Oct 7;14(1):23272. doi: 10.1038/s41598-024-74491-5.

Predicting omicron pneumonia severity and outcome: a single-center study in Hangzhou, China.

Front Med (Lausanne). 2023 May 26;10:1192376. doi: 10.3389/fmed.2023.1192376. eCollection 2023.

Machine Learning Approaches for the Prediction of Hepatitis B and C Seropositivity.

Int J Environ Res Public Health. 2023 Jan 29;20(3):2380. doi: 10.3390/ijerph20032380.

Prediction of HELLP Syndrome Severity Using Machine Learning Algorithms-Results from a Retrospective Study.

Diagnostics (Basel). 2023 Jan 12;13(2):287. doi: 10.3390/diagnostics13020287.

本文引用的文献

Development of machine learning model for diagnostic disease prediction based on laboratory tests.

Sci Rep. 2021 Apr 7;11(1):7567. doi: 10.1038/s41598-021-87171-5.

Machine Learning Approach to Predicting COVID-19 Disease Severity Based on Clinical Blood Test Data: Statistical Analysis and Model Development.

JMIR Med Inform. 2021 Apr 13;9(4):e25884. doi: 10.2196/25884.

Early prediction of level-of-care requirements in patients with COVID-19.

Elife. 2020 Oct 12;9:e60519. doi: 10.7554/eLife.60519.

Prediction of blood culture outcome using hybrid neural network model based on electronic health records.

BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):121. doi: 10.1186/s12911-020-1113-4.

TCGA-TCIA Impact on Radiogenomics Cancer Research: A Systematic Review.

Int J Mol Sci. 2019 Nov 29;20(23):6033. doi: 10.3390/ijms20236033.

Diagnosis and Treatment of Adults with Community-acquired Pneumonia. An Official Clinical Practice Guideline of the American Thoracic Society and Infectious Diseases Society of America.

Am J Respir Crit Care Med. 2019 Oct 1;200(7):e45-e67. doi: 10.1164/rccm.201908-1581ST.

Procalcitonin and C-Reactive Protein As Markers of Bacteremia in Patients With Febrile Neutropenia Who Receive Chemotherapy for Acute Leukemia: A Prospective Study From Nepal.

J Glob Oncol. 2019 Sep;5:1-6. doi: 10.1200/JGO.19.00147.

The "inconvenient truth" about AI in healthcare.

NPJ Digit Med. 2019 Aug 16;2:77. doi: 10.1038/s41746-019-0155-4. eCollection 2019.

Causes of severe pneumonia requiring hospital admission in children without HIV infection from Africa and Asia: the PERCH multi-country case-control study.

Lancet. 2019 Aug 31;394(10200):757-779. doi: 10.1016/S0140-6736(19)30721-4. Epub 2019 Jun 27.

Utilizing Machine Learning Methods for Preoperative Prediction of Postsurgical Mortality and Intensive Care Unit Admission.

Ann Surg. 2020 Dec;272(6):1133-1139. doi: 10.1097/SLA.0000000000003297.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于非侵入性指标的机器学习辅助肺炎预测。

Machine learning-assisted prediction of pneumonia based on non-invasive measures.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献