Department of Women's and Children's Health, Centre for Women's Health Research, Institute of Life Course and Medical Sciences, University of Liverpool, Member of Liverpool Health Partners, Liverpool, L8 7SS, UK.
High Field NMR Facility, Liverpool Shared Research Facilities, University of Liverpool, Liverpool, L69 7TX, UK.
Sci Rep. 2024 May 15;14(1):11172. doi: 10.1038/s41598-024-61690-3.
A significant number of pregnancies are lost in the first trimester and 1-2% are ectopic pregnancies (EPs). Early pregnancy loss in general can cause significant morbidity with bleeding or infection, while EPs are the leading cause of maternal mortality in the first trimester. Symptoms of pregnancy loss and EP are very similar (including pain and bleeding); however, these symptoms are also common in live normally sited pregnancies (LNSP). To date, no biomarkers have been identified to differentiate LNSP from pregnancies that will not progress beyond early gestation (non-viable or EPs), defined together as combined adverse outcomes (CAO). In this study, we present a novel machine learning pipeline to create prediction models that identify a composite biomarker to differentiate LNSP from CAO in symptomatic women. This prospective cohort study included 370 participants. A single blood sample was prospectively collected from participants on first emergency presentation prior to final clinical diagnosis of pregnancy outcome: LNSP, miscarriage, pregnancy of unknown location (PUL) or tubal EP (tEP). Miscarriage, PUL and tEP were grouped together into a CAO group. Human chorionic gonadotrophin β (β-hCG) and progesterone concentrations were measured in plasma. Serum samples were subjected to untargeted metabolomic profiling. The cohort was randomly split into train and validation data sets, with the train data set subjected to variable selection. Nine metabolite signals were identified as key discriminators of LNSP versus CAO. Random forest models were constructed using stable metabolite signals alone, or in combination with plasma hormone concentrations and demographic data. When comparing LNSP with CAO, a model with stable metabolite signals only demonstrated a modest predictive accuracy (0.68), which was comparable to a model of β-hCG and progesterone (0.71). The best model for LNSP prediction comprised stable metabolite signals and hormone concentrations (accuracy = 0.79). In conclusion, serum metabolite levels and biochemical markers from a single blood sample possess modest predictive utility in differentiating LNSP from CAO pregnancies upon first presentation, which is improved by variable selection and combination using machine learning. A diagnostic test to confirm LNSP and thus exclude pregnancies affecting maternal morbidity and potentially life-threatening outcomes would be invaluable in emergency situations.
相当数量的妊娠在孕早期流产,1-2%为异位妊娠(EP)。一般来说,早期妊娠丢失会导致严重的发病率,包括出血或感染,而 EP 是孕早期产妇死亡的主要原因。妊娠丢失和 EP 的症状非常相似(包括疼痛和出血);然而,这些症状在正常位置妊娠(LNSP)中也很常见。迄今为止,尚未发现能够区分 LNSP 与不能继续妊娠(无生命或 EP)的生物标志物,两者统称为不良妊娠结局(CAO)。在这项研究中,我们提出了一种新的机器学习管道,以创建预测模型,以识别一种复合生物标志物,将 LNSP 与有症状妇女的 CAO 区分开来。这项前瞻性队列研究纳入了 370 名参与者。在首次出现紧急症状就诊之前,前瞻性地采集了参与者的单一血样,以确定妊娠结局:LNSP、流产、妊娠位置不明(PUL)或输卵管 EP(tEP)。流产、PUL 和 tEP 被归为 CAO 组。测量血浆中人绒毛膜促性腺激素β(β-hCG)和孕酮浓度。对血清样本进行非靶向代谢组学分析。该队列随机分为训练和验证数据集,使用训练数据集进行变量选择。确定了 9 种代谢物信号作为区分 LNSP 与 CAO 的关键鉴别指标。使用稳定的代谢物信号或与血浆激素浓度和人口统计学数据相结合构建随机森林模型。当比较 LNSP 与 CAO 时,仅使用稳定代谢物信号的模型显示出适度的预测准确性(0.68),与β-hCG 和孕酮的模型(0.71)相当。用于 LNSP 预测的最佳模型包括稳定的代谢物信号和激素浓度(准确率=0.79)。总之,在首次就诊时,单一血样中的血清代谢物水平和生化标志物对区分 LNSP 与 CAO 妊娠具有适度的预测能力,通过机器学习的变量选择和组合可以提高其预测能力。一种能够确诊 LNSP 并排除影响产妇发病率和潜在危及生命的妊娠结局的诊断测试,在紧急情况下将具有重要价值。