• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人群的极早产儿队列中流失率的机器学习预测方法。

Machine learning methods to predict attrition in a population-based cohort of very preterm infants.

机构信息

EPIUnit - Instituto de Saúde Pública, Universidade do Porto, Rua das Taipas, nº 135, 4050-600, Porto, Portugal.

Laboratório para a Investigação Integrativa e Translacional em Saúde Populacional (ITR), Porto, Portugal.

出版信息

Sci Rep. 2022 Jun 22;12(1):10587. doi: 10.1038/s41598-022-13946-z.

DOI:10.1038/s41598-022-13946-z
PMID:35732850
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9217966/
Abstract

The timely identification of cohort participants at higher risk for attrition is important to earlier interventions and efficient use of research resources. Machine learning may have advantages over the conventional approaches to improve discrimination by analysing complex interactions among predictors. We developed predictive models of attrition applying a conventional regression model and different machine learning methods. A total of 542 very preterm (< 32 gestational weeks) infants born in Portugal as part of the European Effective Perinatal Intensive Care in Europe (EPICE) cohort were included. We tested a model with a fixed number of predictors (Baseline) and a second with a dynamic number of variables added from each follow-up (Incremental). Eight classification methods were applied: AdaBoost, Artificial Neural Networks, Functional Trees, J48, J48Consolidated, K-Nearest Neighbours, Random Forest and Logistic Regression. Performance was compared using AUC- PR (Area Under the Curve-Precision Recall), Accuracy, Sensitivity and F-measure. Attrition at the four follow-ups were, respectively: 16%, 25%, 13% and 17%. Both models demonstrated good predictive performance, AUC-PR ranging between 69 and 94.1 in Baseline and from 72.5 to 97.1 in Incremental model. Of the whole set of methods, Random Forest presented the best performance at all follow-ups [AUC-PR: 94.1 (2.0); AUC-PR: 91.2 (1.2); AUC-PR: 97.1 (1.0); AUC-PR: 96.5 (1.7)]. Logistic Regression performed well below Random Forest. The top-ranked predictors were common for both models in all follow-ups: birthweight, gestational age, maternal age, and length of hospital stay. Random Forest presented the highest capacity for prediction and provided interpretable predictors. Researchers involved in cohorts can benefit from our robust models to prepare for and prevent loss to follow-up by directing efforts toward individuals at higher risk.

摘要

及时识别有较高脱落风险的队列参与者对于早期干预和有效利用研究资源非常重要。机器学习在分析预测因子之间的复杂交互作用以提高区分能力方面可能具有优势。我们应用传统回归模型和不同的机器学习方法开发了脱落预测模型。共纳入了 542 名出生于葡萄牙的非常早产儿(<32 孕周),他们是欧洲有效围产期密集护理(EPICE)队列的一部分。我们测试了一个具有固定数量预测因子的模型(基线)和第二个具有从每次随访中添加的动态数量变量的模型(增量)。应用了八种分类方法:AdaBoost、人工神经网络、功能树、J48、J48Consolidated、K-近邻、随机森林和逻辑回归。使用 AUC-PR(曲线下面积-精度召回率)、准确性、敏感性和 F 度量来比较性能。四次随访的脱落率分别为:16%、25%、13%和 17%。两种模型均表现出良好的预测性能,基线模型的 AUC-PR 范围为 69 至 94.1,增量模型的 AUC-PR 范围为 72.5 至 97.1。在整个方法中,随机森林在所有随访中表现出最佳性能[AUC-PR:94.1(2.0);AUC-PR:91.2(1.2);AUC-PR:97.1(1.0);AUC-PR:96.5(1.7)]。逻辑回归的表现明显低于随机森林。前几个排名最高的预测因子在所有随访中均为两种模型的共同预测因子:出生体重、胎龄、母亲年龄和住院时间。随机森林具有最高的预测能力,并提供可解释的预测因子。参与队列研究的研究人员可以从我们稳健的模型中受益,通过针对高风险个体进行努力,为随访流失做好准备并加以预防。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/9217966/38ce94d41122/41598_2022_13946_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/9217966/2708bcf66ddb/41598_2022_13946_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/9217966/38ce94d41122/41598_2022_13946_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/9217966/2708bcf66ddb/41598_2022_13946_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/9217966/38ce94d41122/41598_2022_13946_Fig2_HTML.jpg

相似文献

1
Machine learning methods to predict attrition in a population-based cohort of very preterm infants.基于人群的极早产儿队列中流失率的机器学习预测方法。
Sci Rep. 2022 Jun 22;12(1):10587. doi: 10.1038/s41598-022-13946-z.
2
Prediction of preterm birth in multiparous women using logistic regression and machine learning approaches.多产妇早产的逻辑回归和机器学习预测方法。
Sci Rep. 2024 Sep 20;14(1):21967. doi: 10.1038/s41598-024-60097-4.
3
Chronic stress in practice assistants: An analytic approach comparing four machine learning classifiers with a standard logistic regression model.实践助理中的慢性压力:一种分析方法,比较了四种机器学习分类器与标准逻辑回归模型。
PLoS One. 2021 May 4;16(5):e0250842. doi: 10.1371/journal.pone.0250842. eCollection 2021.
4
Improving preterm newborn identification in low-resource settings with machine learning.利用机器学习提高资源匮乏环境中早产儿的识别率。
PLoS One. 2019 Feb 27;14(2):e0198919. doi: 10.1371/journal.pone.0198919. eCollection 2019.
5
Comparison of logistic regression with machine learning methods for the prediction of fetal growth abnormalities: a retrospective cohort study.逻辑回归与机器学习方法预测胎儿生长异常的比较:一项回顾性队列研究。
BMC Pregnancy Childbirth. 2018 Aug 15;18(1):333. doi: 10.1186/s12884-018-1971-2.
6
Machine learning versus logistic regression for prognostic modelling in individuals with non-specific neck pain.机器学习与逻辑回归在非特异性颈痛患者预后模型中的比较。
Eur Spine J. 2022 Aug;31(8):2082-2091. doi: 10.1007/s00586-022-07188-w. Epub 2022 Mar 30.
7
Machine-learning prediction of adolescent alcohol use: a cross-study, cross-cultural validation.机器学习预测青少年饮酒:跨研究、跨文化验证。
Addiction. 2019 Apr;114(4):662-671. doi: 10.1111/add.14504. Epub 2018 Dec 21.
8
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
9
Predicting mortality risk for preterm infants using random forest.使用随机森林预测早产儿的死亡率。
Sci Rep. 2021 Mar 31;11(1):7308. doi: 10.1038/s41598-021-86748-4.
10
Comparative effectiveness of explainable machine learning approaches for extrauterine growth restriction classification in preterm infants using longitudinal data.使用纵向数据的可解释机器学习方法对早产儿宫外生长受限分类的比较有效性
Front Med (Lausanne). 2023 Nov 29;10:1166743. doi: 10.3389/fmed.2023.1166743. eCollection 2023.

引用本文的文献

1
Survey Data on Attitudes Towards Foreign Aid & Development in France, Germany, Great Britain, and the U.S.关于法国、德国、英国和美国对对外援助与发展态度的调查数据
Sci Data. 2025 Jul 1;12(1):1122. doi: 10.1038/s41597-025-05135-0.
2
A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial.一种应用于口腔健康临床试验的在线强化学习算法
Proc AAAI Conf Artif Intell. 2025;39(28):28792-28800. doi: 10.1609/aaai.v39i28.35143. Epub 2025 Apr 11.
3
Modeling the determinants of attrition in a two-stage epilepsy prevalence survey in Nairobi using machine learning.

本文引用的文献

1
Strategies for assessing the impact of loss to follow-up on estimates of neurodevelopmental impairment in a very preterm cohort at 2 years of age.评估失访对极早产儿 2 岁时神经发育损伤估计值影响的策略。
BMC Med Res Methodol. 2021 Jun 6;21(1):118. doi: 10.1186/s12874-021-01264-3.
2
Completeness of Retention Data and Determinants of Attrition in Birth Cohorts of Very Preterm Infants: A Systematic Review.极早产儿出生队列中保留数据的完整性及失访的决定因素:一项系统综述
Front Pediatr. 2021 Feb 17;9:529733. doi: 10.3389/fped.2021.529733. eCollection 2021.
3
Machine-Learning vs. Expert-Opinion Driven Logistic Regression Modelling for Predicting 30-Day Unplanned Rehospitalisation in Preterm Babies: A Prospective, Population-Based Study (EPIPAGE 2).
利用机器学习对内罗毕两阶段癫痫患病率调查中失访的决定因素进行建模。
Glob Epidemiol. 2025 Jan 6;9:100183. doi: 10.1016/j.gloepi.2025.100183. eCollection 2025 Jun.
机器学习与专家意见驱动的逻辑回归模型在预测早产儿30天内非计划再入院中的应用:一项基于人群的前瞻性研究(EPIPAGE 2)
Front Pediatr. 2021 Feb 3;8:585868. doi: 10.3389/fped.2020.585868. eCollection 2020.
4
Tree-based Machine Learning Methods for Survey Research.用于调查研究的基于树的机器学习方法。
Surv Res Methods. 2019 Apr 11;13(1):73-93.
5
Cohort Profile: Effective Perinatal Intensive Care in Europe (EPICE) very preterm birth cohort.队列简介:欧洲极早产有效围产期重症监护(EPICE)极早产队列。
Int J Epidemiol. 2020 Apr 1;49(2):372-386. doi: 10.1093/ije/dyz270.
6
Priorities for collaborative research using very preterm birth cohorts.极早产儿队列的合作研究优先事项。
Arch Dis Child Fetal Neonatal Ed. 2020 Sep;105(5):538-544. doi: 10.1136/archdischild-2019-317991. Epub 2020 Feb 6.
7
EPICE cohort: two-year neurodevelopmental outcomes after very preterm birth.EPICE 队列研究:极早产儿出生后两年的神经发育结局。
Arch Dis Child Fetal Neonatal Ed. 2020 Jul;105(4):350-356. doi: 10.1136/archdischild-2019-317418. Epub 2019 Nov 5.
8
What is Machine Learning? A Primer for the Epidemiologist.什么是机器学习?流行病学人员入门指南。
Am J Epidemiol. 2019 Dec 31;188(12):2222-2239. doi: 10.1093/aje/kwz189.
9
Predicting breast cancer metastasis by using serum biomarkers and clinicopathological data with machine learning technologies.利用机器学习技术,通过血清生物标志物和临床病理数据预测乳腺癌转移。
Int J Med Inform. 2019 Aug;128:79-86. doi: 10.1016/j.ijmedinf.2019.05.003. Epub 2019 May 7.
10
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models.系统评价显示,机器学习在临床预测模型中并未优于逻辑回归。
J Clin Epidemiol. 2019 Jun;110:12-22. doi: 10.1016/j.jclinepi.2019.02.004. Epub 2019 Feb 11.