• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习预测早产:基于中国深圳大规模学龄前儿童调查数据的综合分析

Prediction of preterm birth using machine learning: a comprehensive analysis based on large-scale preschool children survey data in Shenzhen of China.

作者信息

Ding Liwen, Yin Xiaona, Wen Guomin, Sun Dengli, Xian Danxia, Zhao Yafen, Zhang Maolin, Yang Weikang, Chen Weiqing

机构信息

Department of Epidemiology and Health Statistics, School of Public Health, Sun Yat-Sen University, Guangzhou, 510080, China.

Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China.

出版信息

BMC Pregnancy Childbirth. 2024 Dec 4;24(1):810. doi: 10.1186/s12884-024-06980-4.

DOI:10.1186/s12884-024-06980-4
PMID:39633287
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11616287/
Abstract

BACKGROUND

Preterm birth (PTB) is a significant cause of neonatal mortality and long-term health issues. Accurate prediction and timely prevention of PTB are essential for reducing associated child mortality and morbidity. Traditional predictive methods face challenges due to heterogeneous risk factors and their interaction effects. This study aims to develop and evaluate six machine learning (ML) models to predict PTB using large-scale children survey data from Shenzhen, China, and to identify key predictors through Shapley Additive Explanations (SHAP) analysis.

METHODS

Data from 84,050 mother-child pairs, collected in 2021 and 2022, were processed and divided into training, validation, and test sets. Six ML models were tested: L1-Regularised Logistic Regression, Light Gradient Boosting Machine (LightGBM), Naive Bayes, Random Forests, Support Vector Machine, and Extreme Gradient Boosting (XGBoost). Model performance was evaluated based on discrimination, calibration and clinical utility. SHAP analysis was used to interpret the importance and impact of individual features on PTB prediction.

RESULTS

The XGBoost model demonstrated the best overall performance, with the area under the receiver operating characteristic curve (AUC) scores of 0.752 and 0.757 in the validation and test sets, respectively, along with favorable calibration and clinical utility. Key predictors identified were multiple pregnancies, threatened abortion, and maternal age of conception. SHAP analysis highlighted the positive impacts of multiple pregnancies and threatened abortion, as well as the negative impact of micronutrient supplementation on PTB.

CONCLUSION

Our study found that ML models, particularly XGBoost, show promise in accurately predicting PTB and identifying key risk factors. These findings provide the potential of ML for enhancing clinical interventions, personalizing prenatal care, and informing public health initiatives.

摘要

背景

早产是新生儿死亡和长期健康问题的重要原因。准确预测和及时预防早产对于降低相关儿童死亡率和发病率至关重要。由于风险因素的异质性及其相互作用,传统预测方法面临挑战。本研究旨在开发和评估六种机器学习(ML)模型,利用来自中国深圳的大规模儿童调查数据预测早产,并通过夏普利值加法解释(SHAP)分析确定关键预测因素。

方法

对2021年和2022年收集的84050对母婴数据进行处理,分为训练集、验证集和测试集。测试了六种ML模型:L1正则化逻辑回归、轻梯度提升机(LightGBM)、朴素贝叶斯、随机森林、支持向量机和极端梯度提升(XGBoost)。基于区分度、校准度和临床实用性评估模型性能。使用SHAP分析来解释个体特征对早产预测的重要性和影响。

结果

XGBoost模型表现出最佳的整体性能,在验证集和测试集中,受试者工作特征曲线(AUC)下面积得分分别为0.752和0.757,同时具有良好的校准度和临床实用性。确定的关键预测因素为多胎妊娠、先兆流产和受孕时的母亲年龄。SHAP分析突出了多胎妊娠和先兆流产的积极影响,以及微量营养素补充对早产的负面影响。

结论

我们的研究发现,ML模型,尤其是XGBoost,在准确预测早产和识别关键风险因素方面显示出前景。这些发现为ML在加强临床干预、个性化产前护理和为公共卫生举措提供信息方面提供了潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/df21e14ea7ed/12884_2024_6980_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/c2e76cc09673/12884_2024_6980_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/8b47ac25513a/12884_2024_6980_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/4d25b7adfe3e/12884_2024_6980_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/2a6c120ea239/12884_2024_6980_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/8238cf87698b/12884_2024_6980_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/0e6a2b561911/12884_2024_6980_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/fba5044d88d9/12884_2024_6980_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/85d336761d32/12884_2024_6980_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/df21e14ea7ed/12884_2024_6980_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/c2e76cc09673/12884_2024_6980_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/8b47ac25513a/12884_2024_6980_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/4d25b7adfe3e/12884_2024_6980_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/2a6c120ea239/12884_2024_6980_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/8238cf87698b/12884_2024_6980_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/0e6a2b561911/12884_2024_6980_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/fba5044d88d9/12884_2024_6980_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/85d336761d32/12884_2024_6980_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d58/11616287/df21e14ea7ed/12884_2024_6980_Fig9_HTML.jpg

相似文献

1
Prediction of preterm birth using machine learning: a comprehensive analysis based on large-scale preschool children survey data in Shenzhen of China.利用机器学习预测早产:基于中国深圳大规模学龄前儿童调查数据的综合分析
BMC Pregnancy Childbirth. 2024 Dec 4;24(1):810. doi: 10.1186/s12884-024-06980-4.
2
Machine learning prediction of preterm birth in women under 35 using routine biomarkers in a retrospective cohort study.一项回顾性队列研究中利用常规生物标志物对35岁以下女性早产进行机器学习预测
Sci Rep. 2025 Mar 25;15(1):10213. doi: 10.1038/s41598-025-92814-y.
3
Prediction of preterm birth in multiparous women using logistic regression and machine learning approaches.多产妇早产的逻辑回归和机器学习预测方法。
Sci Rep. 2024 Sep 20;14(1):21967. doi: 10.1038/s41598-024-60097-4.
4
Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。
Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.
5
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
6
Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测
Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.
7
Prediction of depressive disorder using machine learning approaches: findings from the NHANES.使用机器学习方法预测抑郁症:来自美国国家健康与营养检查调查(NHANES)的结果
BMC Med Inform Decis Mak. 2025 Feb 17;25(1):83. doi: 10.1186/s12911-025-02903-1.
8
Predicting preterm birth using explainable machine learning in a prospective cohort of nulliparous and multiparous pregnant women.利用可解释机器学习在单胎和多胎孕妇前瞻性队列中预测早产。
PLoS One. 2023 Dec 27;18(12):e0293925. doi: 10.1371/journal.pone.0293925. eCollection 2023.
9
Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS:一项多中心回顾性研究
BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.
10
Machine learning model-based preterm birth prediction and clinical nomogram: A big retrospective cohort study.基于机器学习模型的早产预测与临床列线图:一项大型回顾性队列研究。
Int J Gynaecol Obstet. 2025 Apr;169(1):332-340. doi: 10.1002/ijgo.16036. Epub 2024 Nov 18.

引用本文的文献

1
Artificial Intelligence's Role in Improving Adverse Pregnancy Outcomes: A Scoping Review and Consideration of Ethical Issues.人工智能在改善不良妊娠结局中的作用:一项范围综述及伦理问题考量
J Clin Med. 2025 May 30;14(11):3860. doi: 10.3390/jcm14113860.

本文引用的文献

1
A novel higher performance nomogram based on explainable machine learning for predicting mortality risk in stroke patients within 30 days based on clinical features on the first day ICU admission.一种新的基于可解释机器学习的高性能列线图,用于根据 ICU 入院第一天的临床特征预测 30 天内脑卒中患者的死亡风险。
BMC Med Inform Decis Mak. 2024 Jun 7;24(1):161. doi: 10.1186/s12911-024-02547-7.
2
Factors and outcomes for placental anomalies: An umbrella review of systematic reviews and meta-analyses.胎盘异常的因素和结果:系统评价和荟萃分析的伞式综述。
J Glob Health. 2024 Jan 19;14:04013. doi: 10.7189/jogh.14.04013.
3
Predicting preterm birth using explainable machine learning in a prospective cohort of nulliparous and multiparous pregnant women.
利用可解释机器学习在单胎和多胎孕妇前瞻性队列中预测早产。
PLoS One. 2023 Dec 27;18(12):e0293925. doi: 10.1371/journal.pone.0293925. eCollection 2023.
4
Development and validation of a prediction model on spontaneous preterm birth in twin pregnancy: a retrospective cohort study.建立并验证一项关于双胎妊娠自发性早产的预测模型:一项回顾性队列研究。
Reprod Health. 2023 Dec 21;20(1):187. doi: 10.1186/s12978-023-01728-3.
5
A Machine Learning Algorithm using Clinical and Demographic Data for All-Cause Preterm Birth Prediction.基于临床和人口统计学数据的机器学习算法预测全因早产
Am J Perinatol. 2024 May;41(S 01):e3115-e3123. doi: 10.1055/s-0043-1776917. Epub 2023 Dec 4.
6
On effectively predicting autism spectrum disorder therapy using an ensemble of classifiers.基于分类器集成的自闭症谱系障碍疗法有效预测
Sci Rep. 2023 Nov 15;13(1):19957. doi: 10.1038/s41598-023-46379-3.
7
Establishment of a model for predicting preterm birth based on the machine learning algorithm.基于机器学习算法的早产预测模型的建立。
BMC Pregnancy Childbirth. 2023 Nov 10;23(1):779. doi: 10.1186/s12884-023-06058-7.
8
National, regional, and global estimates of preterm birth in 2020, with trends from 2010: a systematic analysis.2020 年全球、区域和国家早产估计数及其 2010 年以来的变化趋势:系统分析。
Lancet. 2023 Oct 7;402(10409):1261-1271. doi: 10.1016/S0140-6736(23)00878-4.
9
Predicting post-stroke cognitive impairment using machine learning: A prospective cohort study.使用机器学习预测中风后认知障碍:一项前瞻性队列研究。
J Stroke Cerebrovasc Dis. 2023 Nov;32(11):107354. doi: 10.1016/j.jstrokecerebrovasdis.2023.107354. Epub 2023 Sep 14.
10
Associations between the size and duration of asymptomatic subchorionic hematoma and pregnancy outcomes in women with singleton pregnancies.在单胎妊娠妇女中,无症状性绒毛膜下血肿的大小和持续时间与妊娠结局的相关性。
BMC Pregnancy Childbirth. 2023 Aug 2;23(1):555. doi: 10.1186/s12884-023-05831-y.