School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, 310053, China.
Department of Clinical Laboratory, Hangzhou Women's Hospital, No. 369, Kunpeng Road, Shangcheng District Hangzhou, Hangzhou, 310008, Zhejiang, China.
BMC Pregnancy Childbirth. 2024 Nov 18;24(1):763. doi: 10.1186/s12884-024-06933-x.
Spontaneous preterm birth (sPTB) is a primary cause of adverse neonatal outcomes. The objective of this study is to analyze the factors influencing the occurrence of sPTB in pregnant women and to construct and validate a predictive model for sPTB risk based on big data from clinical and laboratory assessments during pregnancy.
A retrospective analysis was conducted on the clinical data of 3,082 pregnant women, categorizing those who delivered before 37 weeks of gestation as the sPTB group and those who delivered at or after 37 weeks as the full-term group. The performance of five machine learning models was compared using metrics such as the AUC, accuracy, sensitivity, specificity, and precision to identify the optimal predictive model. The top 10 predictive variables were selected based on their significance in disease prediction. The data were then divided into a training set (70%) and a validation set (30%) for validation. External data were also utilized to validate the model's predictive performance.
A total of 24 indicators with significant differences were identified. In terms of predicting the risk of preterm birth, the XGBoost algorithm demonstrated the most outstanding performance, with an AUC of 0.89 (95% CI: 0.88-0.90). The top 10 critical indicators included ALP, AFP, ALB, HCT, TC, DBP, ALT, PLT, height, and SBP, which are essential for constructing an accurate predictive model. The model exhibited stable performance on both the training and validation sets, with AUC values of 0.93 and 0.87, respectively. Furthermore, the external testing set also showed superior performance, with an AUC of 0.79.
At the time of delivery, ALP, AFP, ALB, HCT, TC, DBP, ALT, PLT, height, and SBP are influential factors for sPTB in pregnant women. The XGBoost algorithm, constructed based on these factors, demonstrated the most outstanding performance.
自发性早产(sPTB)是不良新生儿结局的主要原因。本研究旨在分析影响孕妇发生 sPTB 的因素,并基于妊娠期间临床和实验室评估的大数据构建并验证 sPTB 风险预测模型。
对 3082 例孕妇的临床资料进行回顾性分析,将妊娠 37 周前分娩的孕妇分为 sPTB 组,将妊娠 37 周及以上分娩的孕妇分为足月组。采用 AUC、准确率、敏感度、特异度和精度等指标比较 5 种机器学习模型的性能,以识别最佳预测模型。根据疾病预测的重要性选择前 10 个预测变量。然后将数据分为训练集(70%)和验证集(30%)进行验证。还利用外部数据验证模型的预测性能。
共确定了 24 个具有显著差异的指标。在预测早产风险方面,XGBoost 算法表现最为突出,AUC 为 0.89(95%CI:0.88-0.90)。前 10 个关键指标包括 ALP、AFP、ALB、HCT、TC、DBP、ALT、PLT、身高和 SBP,对于构建准确的预测模型至关重要。该模型在训练集和验证集上均表现出稳定的性能,AUC 值分别为 0.93 和 0.87。此外,外部测试集也表现出优异的性能,AUC 为 0.79。
分娩时,ALP、AFP、ALB、HCT、TC、DBP、ALT、PLT、身高和 SBP 是孕妇 sPTB 的影响因素。基于这些因素构建的 XGBoost 算法表现最为突出。