• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习模型预测疟疾使用临床信息。

Machine learning model for predicting malaria using clinical information.

机构信息

Department of Tropical Medicine and Parasitology, Seoul National University College of Medicine and Institute of Endemic Diseases, Seoul, 03080, Republic of Korea.

Department of Pharmacology, Yonsei University College of Medicine, Seoul, 03722, Republic of Korea; Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, 03722, Republic of Korea.

出版信息

Comput Biol Med. 2021 Feb;129:104151. doi: 10.1016/j.compbiomed.2020.104151. Epub 2020 Nov 28.

DOI:10.1016/j.compbiomed.2020.104151
PMID:33290932
Abstract

BACKGROUND

Rapid diagnosing is crucial for controlling malaria. Various studies have aimed at developing machine learning models to diagnose malaria using blood smear images; however, this approach has many limitations. This study developed a machine learning model for malaria diagnosis using patient information.

METHODS

To construct datasets, we extracted patient information from the PubMed abstracts from 1956 to 2019. We used two datasets: a solely parasitic disease dataset and total dataset by adding information about other diseases. We compared six machine learning models: support vector machine, random forest (RF), multilayered perceptron, AdaBoost, gradient boosting (GB), and CatBoost. In addition, a synthetic minority oversampling technique (SMOTE) was employed to address the data imbalance problem.

RESULTS

Concerning the solely parasitic disease dataset, RF was found to be the best model regardless of using SMOTE. Concerning the total dataset, GB was found to be the best. However, after applying SMOTE, RF performed the best. Considering the imbalanced data, nationality was found to be the most important feature in malaria prediction. In case of the balanced data with SMOTE, the most important feature was symptom.

CONCLUSIONS

The results demonstrated that machine learning techniques can be successfully applied to predict malaria using patient information.

摘要

背景

快速诊断对于控制疟疾至关重要。许多研究旨在开发使用血涂片图像诊断疟疾的机器学习模型,但这种方法存在许多局限性。本研究使用患者信息开发了一种疟疾诊断的机器学习模型。

方法

为了构建数据集,我们从 1956 年至 2019 年的 PubMed 摘要中提取了患者信息。我们使用了两个数据集:仅寄生虫病数据集和通过添加其他疾病信息的总数据集。我们比较了六种机器学习模型:支持向量机、随机森林(RF)、多层感知机、AdaBoost、梯度提升(GB)和 CatBoost。此外,还采用了合成少数过采样技术(SMOTE)来解决数据不平衡问题。

结果

关于仅寄生虫病数据集,无论是否使用 SMOTE,RF 都是最佳模型。关于总数据集,GB 是最佳模型。但是,在应用 SMOTE 后,RF 的表现最佳。考虑到不平衡数据,国籍是疟疾预测中最重要的特征。在具有 SMOTE 的平衡数据的情况下,最重要的特征是症状。

结论

结果表明,机器学习技术可成功应用于使用患者信息预测疟疾。

相似文献

1
Machine learning model for predicting malaria using clinical information.机器学习模型预测疟疾使用临床信息。
Comput Biol Med. 2021 Feb;129:104151. doi: 10.1016/j.compbiomed.2020.104151. Epub 2020 Nov 28.
2
Stroke Prediction with Machine Learning Methods among Older Chinese.基于机器学习方法对中国老年人进行中风预测。
Int J Environ Res Public Health. 2020 Mar 12;17(6):1828. doi: 10.3390/ijerph17061828.
3
Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China.利用 SMOTE 优化机器学习方法在浙江省丽水市滑坡易发性制图中的预测能力。
Int J Environ Res Public Health. 2019 Jan 28;16(3):368. doi: 10.3390/ijerph16030368.
4
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
5
Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them.开发基于机器学习的模型,通过解决不平衡数据中的建模挑战来预测宫腔内人工授精(IUI)的成功率,并提供修改解决方案。
BMC Med Inform Decis Mak. 2022 Sep 1;22(1):228. doi: 10.1186/s12911-022-01974-8.
6
Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.使用健身数据比较机器学习技术预测全因死亡率:亨利福特锻炼测试(FIT)项目。
BMC Med Inform Decis Mak. 2017 Dec 19;17(1):174. doi: 10.1186/s12911-017-0566-6.
7
Hospital mortality prediction in traumatic injuries patients: comparing different SMOTE-based machine learning algorithms.创伤性损伤患者的医院死亡率预测:比较不同基于 SMOTE 的机器学习算法。
BMC Med Res Methodol. 2023 Apr 22;23(1):101. doi: 10.1186/s12874-023-01920-w.
8
A hybrid Stacking-SMOTE model for optimizing the prediction of autistic genes.一种混合的堆叠-SMOTE 模型,用于优化自闭症基因预测。
BMC Bioinformatics. 2023 Oct 6;24(1):379. doi: 10.1186/s12859-023-05501-y.
9
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
10
Class-imbalanced crash prediction based on real-time traffic and weather data: A driving simulator study.基于实时交通和天气数据的不平衡碰撞预测:驾驶模拟器研究。
Traffic Inj Prev. 2020;21(3):201-208. doi: 10.1080/15389588.2020.1723794. Epub 2020 Mar 3.

引用本文的文献

1
A machine learning model for non-invasive prediction of advanced liver fibrosis in patients with chronic hepatitis B.一种用于无创预测慢性乙型肝炎患者肝纤维化进展的机器学习模型。
Am J Transl Res. 2025 Jul 15;17(7):4939-4951. doi: 10.62347/KEVQ8263. eCollection 2025.
2
Development and Validation of An Interpretable Machine Learning-Based Prediction Model of Postpartum Hemorrhage in Placenta Previa Following Cesarean Section: A Multicenter Study.剖宫产术后前置胎盘产后出血的可解释机器学习预测模型的开发与验证:一项多中心研究
Reprod Sci. 2025 Aug 12. doi: 10.1007/s43032-025-01937-0.
3
An artificial intelligence model to predict mortality among hemodialysis patients: A retrospective validated cohort study.
一种预测血液透析患者死亡率的人工智能模型:一项回顾性验证队列研究。
Sci Rep. 2025 Jul 29;15(1):27699. doi: 10.1038/s41598-025-06576-8.
4
Stacking Ensemble Neural Network for Chemical Safety Assessment: A Case Study of Thyroid Peroxidase and Natural Product Screening.用于化学安全评估的堆叠集成神经网络:以甲状腺过氧化物酶和天然产物筛选为例
ACS Omega. 2025 Jul 10;10(28):30450-30466. doi: 10.1021/acsomega.5c02188. eCollection 2025 Jul 22.
5
Development and validation of a predictive model for in-hospital mortality in patients with coronary heart disease and renal insufficiency.冠心病合并肾功能不全患者院内死亡预测模型的开发与验证
Int J Cardiol Cardiovasc Risk Prev. 2025 Jul 1;26:200463. doi: 10.1016/j.ijcrp.2025.200463. eCollection 2025 Sep.
6
Machine learning-driven prediction of risk factors for postoperative re-fractures in elderly OVCF patients with underlying diseases: model development and validation.机器学习驱动的合并基础疾病老年骨质疏松性椎体压缩骨折患者术后再骨折危险因素预测:模型构建与验证
Front Med (Lausanne). 2025 Jul 3;12:1616923. doi: 10.3389/fmed.2025.1616923. eCollection 2025.
7
Size-Specific Predictors for Malignancy Risk in Follicular Thyroid Neoplasms: Machine Learning Analysis.滤泡性甲状腺肿瘤恶性风险的大小特异性预测指标:机器学习分析
JMIR Cancer. 2025 Jul 11;11:e73069. doi: 10.2196/73069.
8
A comprehensive study based on machine learning models for early identification Mycoplasma pneumoniae infection in segmental/lobar pneumonia.一项基于机器学习模型的综合性研究,用于早期识别节段性/大叶性肺炎中的肺炎支原体感染。
Sci Rep. 2025 Jul 1;15(1):21044. doi: 10.1038/s41598-025-07513-5.
9
PLASMOpred: A Machine Learning-Based Web Application for Predicting Antimalarial Small Molecules Targeting the Apical Membrane Antigen 1-Rhoptry Neck Protein 2 Invasion Complex.PLASMOpred:一种基于机器学习的网络应用程序,用于预测靶向顶端膜抗原1-棒状体颈部蛋白2入侵复合物的抗疟小分子。
Pharmaceuticals (Basel). 2025 May 23;18(6):776. doi: 10.3390/ph18060776.
10
Evaluation of artificial intelligence techniques in disease diagnosis and prediction.人工智能技术在疾病诊断与预测中的评估
Discov Artif Intell. 2023;3(1):5. doi: 10.1007/s44163-023-00049-5. Epub 2023 Jan 30.