• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用孕中期和孕晚期早期孕妇血液基因表达进行机器学习预测自发性早产:一个警示故事。

Machine learning for the prediction of spontaneous preterm birth using early second and third trimester maternal blood gene expression: A cautionary tale.

作者信息

Hornaday Kylie K, Werbicki Ty, Tough Suzanne C, Wood Stephen L, Anderson David W, Li Constance H, Slater Donna M

机构信息

Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.

Department of Community of Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.

出版信息

PLoS One. 2025 Jun 27;20(6):e0310937. doi: 10.1371/journal.pone.0310937. eCollection 2025.

DOI:10.1371/journal.pone.0310937
PMID:40577367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12204558/
Abstract

Spontaneous preterm birth (sPTB) remains a significant global health challenge and a leading cause of neonatal mortality and morbidity. Despite advancements in neonatal care, the prediction of sPTB remains elusive, in part due to complex etiologies and heterogeneous patient populations. This study aimed to validate and extend information on gene expression biomarkers previously described for predicting sPTB using maternal whole blood from the All Our Families pregnancy cohort study based in Calgary, Canada. The results of this study are two-fold: first, using additional replicates of maternal blood samples from the All Our Families cohort, we were unable to repeat the findings of a 2016 study which identified top maternal gene expression predictors for sPTB. Second, we conducted a secondary analysis of the original gene expression dataset from the 2016 study using five modelling approaches (random forest, elastic net regression, unregularized logistic regression, L2-regularized logistic regression, and multilayer perceptron neural network) followed by external validation using a pregnancy cohort based in Detroit, USA. The top performing model (random forest classification) suggested promising performance (area under the receiver operating curve, AUROC 0.99 in the training set), but performance was significantly degraded on the test set (AUROC 0.54) and further degraded in external validation (AUROC 0.50), suggesting poor generalizability, likely due to overfitting exacerbated by a low feature-to-noise ratio. Similar performance was observed in the other four learning models. Prediction was not improved when using higher complexity machine learning (e.g., neural network) approaches over traditional statistical learning (e.g., logistic regression). These findings underscore the challenges in translating biomarker discovery into clinically useful predictive models for sPTB. This study highlights the critical need for rigorous methodological safeguards and external validation in biomarker research. It also emphasizes the impact of data noise and overfitting on model performance, particularly in high-dimensional omics datasets. Future research should prioritize robust validation strategies and explore mechanistic insights to improve our understanding and prediction of sPTB.

摘要

自发性早产(sPTB)仍然是一项重大的全球健康挑战,也是新生儿死亡和发病的主要原因。尽管新生儿护理取得了进展,但sPTB的预测仍然难以捉摸,部分原因是病因复杂且患者群体异质性大。本研究旨在验证并扩展先前描述的用于预测sPTB的基因表达生物标志物信息,该信息使用了来自加拿大卡尔加里“我们所有的家庭”妊娠队列研究中的孕妇全血。本研究结果有两方面:第一,使用“我们所有的家庭”队列中孕妇血液样本的额外重复样本,我们无法重复2016年一项研究的结果,该研究确定了sPTB的顶级孕妇基因表达预测指标。第二,我们使用五种建模方法(随机森林、弹性网回归、非正则逻辑回归、L2正则逻辑回归和多层感知器神经网络)对2016年研究的原始基因表达数据集进行了二次分析,随后使用美国底特律的一个妊娠队列进行了外部验证。表现最佳的模型(随机森林分类)显示出有前景的性能(训练集中受试者工作特征曲线下面积,AUROC为0.99),但在测试集上性能显著下降(AUROC为0.54),在外部验证中进一步下降(AUROC为0.50),这表明泛化性较差,可能是由于低特征噪声比加剧了过拟合。在其他四个学习模型中也观察到了类似的性能。与传统统计学习方法(如逻辑回归)相比,使用更高复杂度的机器学习方法(如神经网络)时,预测并没有得到改善。这些发现凸显了将生物标志物发现转化为临床上有用的sPTB预测模型所面临的挑战。本研究强调了在生物标志物研究中严格的方法保障和外部验证的迫切需求。它还强调了数据噪声和过拟合对模型性能的影响,特别是在高维组学数据集中。未来的研究应优先考虑稳健的验证策略,并探索机制性见解,以提高我们对sPTB的理解和预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/514630a9f1e1/pone.0310937.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/40f17c18a9c3/pone.0310937.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/83fa2e30b915/pone.0310937.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/1de6ec91772c/pone.0310937.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/514630a9f1e1/pone.0310937.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/40f17c18a9c3/pone.0310937.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/83fa2e30b915/pone.0310937.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/1de6ec91772c/pone.0310937.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ffe/12204558/514630a9f1e1/pone.0310937.g004.jpg

相似文献

1
Machine learning for the prediction of spontaneous preterm birth using early second and third trimester maternal blood gene expression: A cautionary tale.利用孕中期和孕晚期早期孕妇血液基因表达进行机器学习预测自发性早产:一个警示故事。
PLoS One. 2025 Jun 27;20(6):e0310937. doi: 10.1371/journal.pone.0310937. eCollection 2025.
2
The comparative and added prognostic value of biomarkers to the Revised Cardiac Risk Index for preoperative prediction of major adverse cardiac events and all-cause mortality in patients who undergo noncardiac surgery.生物标志物对改良心脏风险指数在预测非心脏手术患者主要不良心脏事件和全因死亡率方面的比较和附加预后价值。
Cochrane Database Syst Rev. 2021 Dec 21;12(12):CD013139. doi: 10.1002/14651858.CD013139.pub2.
3
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
4
Multiple-micronutrient supplementation for women during pregnancy.孕期女性的多种微量营养素补充
Cochrane Database Syst Rev. 2017 Apr 13;4(4):CD004905. doi: 10.1002/14651858.CD004905.pub5.
5
External validation of QUiPP App in three independent European cohorts of symptomatic women.QUiPP应用程序在三个独立的有症状女性欧洲队列中的外部验证。
Ultrasound Obstet Gynecol. 2025 Aug;66(2):163-174. doi: 10.1002/uog.29263. Epub 2025 Jun 26.
6
Accuracy of placental growth factor alone or in combination with soluble fms-like tyrosine kinase-1 or maternal factors in detecting preeclampsia in asymptomatic women in the second and third trimesters: a systematic review and meta-analysis.单独或联合使用胎盘生长因子、可溶性 fms 样酪氨酸激酶-1 或母体因素在第二和第三孕期无症状妇女中检测子痫前期的准确性:系统评价和荟萃分析。
Am J Obstet Gynecol. 2023 Sep;229(3):222-247. doi: 10.1016/j.ajog.2023.03.032. Epub 2023 Mar 28.
7
Antenatal corticosteroids for accelerating fetal lung maturation for women at risk of preterm birth.用于加速早产风险女性胎儿肺成熟的产前皮质类固醇。
Cochrane Database Syst Rev. 2017 Mar 21;3(3):CD004454. doi: 10.1002/14651858.CD004454.pub3.
8
Cervical pessary for preventing preterm birth in singleton pregnancies.宫颈托预防单胎妊娠早产。
Cochrane Database Syst Rev. 2022 Dec 1;12(12):CD014508. doi: 10.1002/14651858.CD014508.
9
Maternal and neonatal outcomes of elective induction of labor.择期引产的母婴结局
Evid Rep Technol Assess (Full Rep). 2009 Mar(176):1-257.
10
Sealing procedures for preterm prelabour rupture of membranes.早产胎膜早破的封闭治疗程序
Cochrane Database Syst Rev. 2016 Jul 7;7(7):CD010218. doi: 10.1002/14651858.CD010218.pub2.

本文引用的文献

1
Application of Proteomics in Maternal and Neonatal Health: Advancements and Future Directions.蛋白质组学在母婴健康中的应用:进展与未来方向
Proteomics Clin Appl. 2025 May;19(3):e70004. doi: 10.1002/prca.70004. Epub 2025 Mar 24.
2
A systematic review of first-trimester blood biomarkers associated with preterm prelabor rupture of the fetal membranes.对与胎膜早破相关的孕早期血液生物标志物的系统评价。
Biomarkers. 2025 May;30(3):271-283. doi: 10.1080/1354750X.2025.2475474. Epub 2025 Apr 7.
3
Maternal cytokine profiles in second and early third trimester are not predictive of preterm birth.
孕中期和孕晚期早期的母体细胞因子谱不能预测早产。
PLoS One. 2024 Dec 19;19(12):e0311721. doi: 10.1371/journal.pone.0311721. eCollection 2024.
4
Prediction of risk for early or very early preterm births using high-resolution urinary metabolomic profiling.利用高分辨率尿液代谢组学分析预测早期或极早期早产风险。
BMC Pregnancy Childbirth. 2024 Nov 25;24(1):783. doi: 10.1186/s12884-024-06974-2.
5
Development and validation of a spontaneous preterm birth risk prediction algorithm based on maternal bioinformatics: A single-center retrospective study.基于母体生物信息学的自发性早产风险预测算法的开发和验证:一项单中心回顾性研究。
BMC Pregnancy Childbirth. 2024 Nov 18;24(1):763. doi: 10.1186/s12884-024-06933-x.
6
Fetal Fraction of Cell-Free DNA in the Prediction of Adverse Pregnancy Outcomes: A Nationwide Retrospective Cohort Study.游离DNA的胎儿分数在不良妊娠结局预测中的作用:一项全国性回顾性队列研究
BJOG. 2025 Feb;132(3):318-325. doi: 10.1111/1471-0528.17978. Epub 2024 Oct 2.
7
Maternal Plasma miRNAs as Early Biomarkers of Moderate-to-Late-Preterm Birth.母体血浆 miRNA 作为中晚期早产儿的早期生物标志物。
Int J Mol Sci. 2024 Sep 2;25(17):9536. doi: 10.3390/ijms25179536.
8
Predicting Spontaneous Preterm Birth Using the Immunome.利用免疫组预测自发性早产
Clin Perinatol. 2024 Jun;51(2):441-459. doi: 10.1016/j.clp.2024.02.013. Epub 2024 Apr 4.
9
Predicting Preterm Birth Using Proteomics.利用蛋白质组学预测早产。
Clin Perinatol. 2024 Jun;51(2):391-409. doi: 10.1016/j.clp.2024.02.011. Epub 2024 Mar 23.
10
Predicting Preterm Birth Using Cell-Free Ribonucleic Acid.使用游离核糖核酸预测早产。
Clin Perinatol. 2024 Jun;51(2):379-389. doi: 10.1016/j.clp.2024.02.008. Epub 2024 Mar 13.