• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习在样本量有限的情况下表现优于逻辑回归分类:预测儿科 HIV 死亡率和临床进展为艾滋病的模型。

Machine learning outperformed logistic regression classification even with limit sample size: A model to predict pediatric HIV mortality and clinical progression to AIDS.

机构信息

Pediatric Infectious Diseases Unit, Fundación para la Investigación Biomédica del Hospital 12 de Octubre, Madrid, Spain.

PENTA Foundation, Padova, Italy.

出版信息

PLoS One. 2022 Oct 14;17(10):e0276116. doi: 10.1371/journal.pone.0276116. eCollection 2022.

DOI:10.1371/journal.pone.0276116
PMID:36240212
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9565414/
Abstract

Logistic regression (LR) is the most common prediction model in medicine. In recent years, supervised machine learning (ML) methods have gained popularity. However, there are many concerns about ML utility for small sample sizes. In this study, we aim to compare the performance of 7 algorithms in the prediction of 1-year mortality and clinical progression to AIDS in a small cohort of infants living with HIV from South Africa and Mozambique. The data set (n = 100) was randomly split into 70% training and 30% validation set. Seven algorithms (LR, Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Naïve Bayes (NB), Artificial Neural Network (ANN), and Elastic Net) were compared. The variables included as predictors were the same across the models including sociodemographic, virologic, immunologic, and maternal status features. For each of the models, a parameter tuning was performed to select the best-performing hyperparameters using 5 times repeated 10-fold cross-validation. A confusion-matrix was built to assess their accuracy, sensitivity, and specificity. RF ranked as the best algorithm in terms of accuracy (82,8%), sensitivity (78%), and AUC (0,73). Regarding specificity and sensitivity, RF showed better performance than the other algorithms in the external validation and the highest AUC. LR showed lower performance compared with RF, SVM, or KNN. The outcome of children living with perinatally acquired HIV can be predicted with considerable accuracy using ML algorithms. Better models would benefit less specialized staff in limited resources countries to improve prompt referral in case of high-risk clinical progression.

摘要

逻辑回归(LR)是医学中最常用的预测模型。近年来,监督机器学习(ML)方法越来越受欢迎。然而,对于 ML 在小样本量下的实用性存在许多担忧。在这项研究中,我们旨在比较 7 种算法在预测南非和莫桑比克感染 HIV 的小婴儿中 1 年死亡率和向艾滋病临床进展的表现。数据集(n = 100)随机分为 70%的训练集和 30%的验证集。比较了 7 种算法(LR、随机森林(RF)、支持向量机(SVM)、K-最近邻(KNN)、朴素贝叶斯(NB)、人工神经网络(ANN)和弹性网络)。纳入的预测变量在模型中是相同的,包括社会人口统计学、病毒学、免疫学和母体状态特征。对于每个模型,都进行了参数调整,使用 5 次重复 10 折交叉验证选择表现最佳的超参数。构建混淆矩阵以评估其准确性、敏感性和特异性。RF 在准确性(82.8%)、敏感性(78%)和 AUC(0.73)方面表现最佳。关于特异性和敏感性,RF 在外部验证中的表现优于其他算法,且 AUC 最高。LR 的性能低于 RF、SVM 或 KNN。使用 ML 算法可以相当准确地预测围产期感染 HIV 的儿童的结果。更好的模型将使资源有限国家的非专业人员受益,以便在临床进展风险高的情况下及时转诊。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/574d27973032/pone.0276116.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/927ee7d3b9eb/pone.0276116.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/3850556f1b2c/pone.0276116.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/574d27973032/pone.0276116.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/927ee7d3b9eb/pone.0276116.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/3850556f1b2c/pone.0276116.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75aa/9565414/574d27973032/pone.0276116.g003.jpg

相似文献

1
Machine learning outperformed logistic regression classification even with limit sample size: A model to predict pediatric HIV mortality and clinical progression to AIDS.机器学习在样本量有限的情况下表现优于逻辑回归分类:预测儿科 HIV 死亡率和临床进展为艾滋病的模型。
PLoS One. 2022 Oct 14;17(10):e0276116. doi: 10.1371/journal.pone.0276116. eCollection 2022.
2
Can machine learning predict pharmacotherapy outcomes? An application study in osteoporosis.机器学习能预测药物治疗效果吗?一项在骨质疏松症中的应用研究。
Comput Methods Programs Biomed. 2022 Oct;225:107028. doi: 10.1016/j.cmpb.2022.107028. Epub 2022 Jul 21.
3
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
4
Machine learning-based monosaccharide profiling for tissue-specific classification of Wolfiporia extensa samples.基于机器学习的单糖分析用于区分不同产地野生松口蘑样本
Carbohydr Polym. 2023 Dec 15;322:121338. doi: 10.1016/j.carbpol.2023.121338. Epub 2023 Aug 28.
5
Development and internal validation of a machine-learning-developed model for predicting 1-year mortality after fragility hip fracture.开发和内部验证一种机器学习开发的模型,用于预测脆性髋部骨折后 1 年的死亡率。
BMC Geriatr. 2022 May 24;22(1):451. doi: 10.1186/s12877-022-03152-x.
6
Performance Evaluation of Machine Learning Algorithm for Classification of Unintended Pregnancy among Married Women in Bangladesh.机器学习算法在孟加拉国已婚妇女意外怀孕分类中的性能评估。
J Healthc Eng. 2022 May 28;2022:1460908. doi: 10.1155/2022/1460908. eCollection 2022.
7
Application of machine learning algorithms to identify people with low bone density.机器学习算法在识别低骨密度人群中的应用。
Front Public Health. 2024 Apr 25;12:1347219. doi: 10.3389/fpubh.2024.1347219. eCollection 2024.
8
Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years.机器学习方法预测 50 岁以上人群的慢性下腰痛
Medicina (Kaunas). 2021 Nov 11;57(11):1230. doi: 10.3390/medicina57111230.
9
Development of Machine Learning Models for Prediction of Smoking Cessation Outcome.机器学习模型在预测戒烟结果中的应用开发。
Int J Environ Res Public Health. 2021 Mar 5;18(5):2584. doi: 10.3390/ijerph18052584.
10
Machine learning algorithms to predict seizure due to acute tramadol poisoning.机器学习算法预测因急性曲马多中毒导致的癫痫发作。
Hum Exp Toxicol. 2021 Aug;40(8):1225-1233. doi: 10.1177/0960327121991910. Epub 2021 Feb 4.

引用本文的文献

1
Scalable and robust machine learning framework for HIV classification using clinical and laboratory data.使用临床和实验室数据进行HIV分类的可扩展且稳健的机器学习框架。
Sci Rep. 2025 May 28;15(1):18727. doi: 10.1038/s41598-025-00085-4.
2
Leveraging small-sample machine learning for rigorous prediction of JOA recovery in cervical spondylotic myelopathy patients: insights from imaging parameters and modeling strategies.利用小样本机器学习对脊髓型颈椎病患者的日本骨科学会(JOA)恢复情况进行严格预测:来自影像学参数和建模策略的见解
Eur Spine J. 2025 Mar 7. doi: 10.1007/s00586-025-08763-7.
3
Application of machine learning in breast cancer survival prediction using a multimethod approach.

本文引用的文献

1
Automated machine learning: Review of the state-of-the-art and opportunities for healthcare.自动化机器学习:最新技术综述及医疗保健领域的机遇
Artif Intell Med. 2020 Apr;104:101822. doi: 10.1016/j.artmed.2020.101822. Epub 2020 Feb 21.
2
Machine Learning to Identify Persons at High-Risk of Human Immunodeficiency Virus Acquisition in Rural Kenya and Uganda.机器学习识别肯尼亚和乌干达农村中艾滋病毒感染高风险人群
Clin Infect Dis. 2020 Dec 3;71(9):2326-2333. doi: 10.1093/cid/ciz1096.
3
Comparing regression, naive Bayes, and random forest methods in the prediction of individual survival to second lactation in Holstein cattle.
机器学习在乳腺癌生存预测中的多方法应用。
Sci Rep. 2024 Dec 3;14(1):30147. doi: 10.1038/s41598-024-81734-y.
4
Early inflammation as a footprint of increased mortality risk in infants living with HIV from three African countries.在三个非洲国家,HIV 感染婴儿的早期炎症是其死亡率升高的一个标志。
Sci Rep. 2024 Oct 28;14(1):25792. doi: 10.1038/s41598-024-74066-4.
5
Artificial intelligence in assisting pathogenic microorganism diagnosis and treatment: a review of infectious skin diseases.人工智能辅助病原微生物诊断与治疗:感染性皮肤病综述
Front Microbiol. 2024 Oct 8;15:1467113. doi: 10.3389/fmicb.2024.1467113. eCollection 2024.
6
Machine learning-based prediction of the risk of moderate-to-severe catheter-related bladder discomfort in general anaesthesia patients: a prospective cohort study.基于机器学习的全麻患者中中重度导管相关膀胱不适风险预测:一项前瞻性队列研究。
BMC Anesthesiol. 2024 Sep 19;24(1):334. doi: 10.1186/s12871-024-02720-5.
7
The predictive accuracy of machine learning for the risk of death in HIV patients: a systematic review and meta-analysis.机器学习预测 HIV 患者死亡风险的准确性:系统评价和荟萃分析。
BMC Infect Dis. 2024 May 6;24(1):474. doi: 10.1186/s12879-024-09368-z.
8
Application of machine learning algorithms in classifying postoperative success in metabolic bariatric surgery: Acomprehensive study.机器学习算法在代谢性减肥手术术后成功分类中的应用:一项综合研究。
Digit Health. 2024 Mar 29;10:20552076241239274. doi: 10.1177/20552076241239274. eCollection 2024 Jan-Dec.
9
A Machine Learning-Based Method for Detecting Liver Fibrosis.一种基于机器学习的肝纤维化检测方法。
Diagnostics (Basel). 2023 Sep 14;13(18):2952. doi: 10.3390/diagnostics13182952.
10
Improved Regularized Multi-class Logistic Regression for Gene Classification with Optimal Kernel PCA and HC Algorithm.基于最优核主成分分析和 HC 算法的基因分类的正则化多类逻辑回归改进。
Adv Exp Med Biol. 2023;1424:273-279. doi: 10.1007/978-3-031-31982-2_31.
比较回归、朴素贝叶斯和随机森林方法在荷斯坦奶牛个体预测第二次泌乳存活中的应用。
J Dairy Sci. 2019 Oct;102(10):9409-9421. doi: 10.3168/jds.2019-16295. Epub 2019 Aug 22.
4
Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease).机器学习算法在临床事件预测(冠心病风险)中的比较。
J Biomed Inform. 2019 Sep;97:103257. doi: 10.1016/j.jbi.2019.103257. Epub 2019 Jul 30.
5
Development and validation of an automated HIV prediction algorithm to identify candidates for pre-exposure prophylaxis: a modelling study.开发和验证一种自动 HIV 预测算法以识别暴露前预防候选者:一项建模研究。
Lancet HIV. 2019 Oct;6(10):e696-e704. doi: 10.1016/S2352-3018(19)30139-0. Epub 2019 Jul 5.
6
Multiple Machine Learning Comparisons of HIV Cell-based and Reverse Transcriptase Data Sets.基于细胞的 HIV 和逆转录酶数据集的多种机器学习比较。
Mol Pharm. 2019 Apr 1;16(4):1620-1632. doi: 10.1021/acs.molpharmaceut.8b01297. Epub 2019 Feb 26.
7
Mortality among pediatric patients on HIV treatment in sub-Saharan African countries: a systematic review and meta-analysis.撒哈拉以南非洲国家接受 HIV 治疗的儿科患者死亡率:系统评价和荟萃分析。
BMC Public Health. 2019 Feb 4;19(1):149. doi: 10.1186/s12889-019-6482-1.
8
Mortality in a Cohort of HIV-Infected Children: A 12-Month Outcome of Antiretroviral Therapy in Makurdi, Nigeria.一组感染艾滋病毒儿童的死亡率:尼日利亚马库尔迪抗逆转录病毒疗法的12个月结果。
Adv Med. 2018 Jun 19;2018:6409134. doi: 10.1155/2018/6409134. eCollection 2018.
9
Random forest versus logistic regression: a large-scale benchmark experiment.随机森林与逻辑回归:大规模基准实验。
BMC Bioinformatics. 2018 Jul 17;19(1):270. doi: 10.1186/s12859-018-2264-5.
10
A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.一种利用百度趋势数据监测中国艾滋病疫情的机器学习方法。
PLoS One. 2018 Jul 11;13(7):e0199697. doi: 10.1371/journal.pone.0199697. eCollection 2018.