Lv Bohan, Wang Gang, Pan Yueshuai, Yuan Guanghui, Wei Lili
Department of Critical Care Medicine, Affiliated Hospital of Qingdao University, Qingdao 266000, China.
Nursing Department, Affiliated Hospital of Qingdao University, Qingdao 266000, China.
Pregnancy Hypertens. 2025 Mar;39:101198. doi: 10.1016/j.preghy.2025.101198. Epub 2025 Jan 30.
To analyze the influencing factors of early-onset preeclampsia (EOPE). And to construct and validate the prediction model of EOPE using machine learning algorithm.
Based on Python system, the data profile of 1040 pregnant women was divided into 80% training set and 20% test set. Logistic regression algorithm, XGBoost algorithm, random forest algorithm, support vector machine algorithm and artificial neural network algorithm were used to construct the EOPE prediction model, respectively, and the resulting model was validated by resampling method. Accuracy, sensitivity, specificity, F1 score, and area under the ROC curve were used to evaluate the resulting models and screen the optimal models.
EOPE in pregnant women.
The results of binary logistic regression showed that the influencing factors of EOPE included six indicators: pre-pregnancy BMI, number of pregnancies, mean arterial pressure, smoking, alpha-fetoprotein, and methods of conception. Among them, the prediction model of EOPE constructed based on the XGBoost algorithm performed the best in the training and test sets, with an F1 score of 0.554 ± 0.068 and an AUC of 0.963 (95 % CI: 0.943 ∼ 0.983) in the training set, and an F1 score of 0.488 ± 0.082 and an AUC of 0.936 (95 % CI: 0.887 ∼ 0.983).
Our prediction model for EOPE constructed based on the XGBoost algorithm has superior disease prediction ability and can provide assistance in predicting the disease risk of EOPE.
分析早发型子痫前期(EOPE)的影响因素。并使用机器学习算法构建并验证EOPE的预测模型。
基于Python系统,将1040名孕妇的数据资料分为80%的训练集和20%的测试集。分别使用逻辑回归算法、XGBoost算法、随机森林算法、支持向量机算法和人工神经网络算法构建EOPE预测模型,并通过重采样方法对所得模型进行验证。使用准确率、灵敏度、特异度、F1分数和ROC曲线下面积来评估所得模型并筛选最优模型。
孕妇中的EOPE。
二元逻辑回归结果显示,EOPE的影响因素包括六个指标:孕前体重指数、妊娠次数、平均动脉压、吸烟、甲胎蛋白和受孕方式。其中,基于XGBoost算法构建的EOPE预测模型在训练集和测试集中表现最佳,训练集中F1分数为0.554±0.068,AUC为0.963(95%CI:0.943~0.983),测试集中F1分数为0.488±0.082,AUC为0.936(95%CI:0.887~0.983)。
我们基于XGBoost算法构建的EOPE预测模型具有卓越的疾病预测能力,可为预测EOPE的疾病风险提供辅助。