Wie Jeong Ha, Lee Se Jin, Choi Sae Kyung, Jo Yun Sung, Hwang Han Sung, Park Mi Hye, Kim Yeon Hee, Shin Jae Eun, Kil Ki Cheol, Kim Su Mi, Choi Bong Suk, Hong Hanul, Seol Hyun-Joo, Won Hye-Sung, Ko Hyun Sun, Na Sunghun
Department of Obstetrics and Gynecology, Eunpyeong St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul 03312, Korea.
Department of Obstetrics and Gynecology, Kangwon National University Hospital, Kangwon National University School of Medicine, Chuncheon 24289, Korea.
Life (Basel). 2022 Apr 18;12(4):604. doi: 10.3390/life12040604.
This study was a multicenter retrospective cohort study of term nulliparous women who underwent labor, and was conducted to develop an automated machine learning model for prediction of emergent cesarean section (CS) before onset of labor. Nine machine learning methods of logistic regression, random forest, Support Vector Machine (SVM), gradient boosting, extreme gradient boosting (XGBoost), light gradient boosting machine (LGBM), k-nearest neighbors (KNN), Voting, and Stacking were applied and compared for prediction of emergent CS during active labor. External validation was performed using a nationwide multicenter dataset for Korean fetal growth. A total of 6549 term nulliparous women was included in the analysis, and the emergent CS rate was 16.1%. The C-statistics values for KNN, Voting, XGBoost, Stacking, gradient boosting, random forest, LGBM, logistic regression, and SVM were 0.6, 0.69, 0.64, 0.59, 0.66, 0.68, 0.68, 0.7, and 0.69, respectively. The logistic regression model showed the best predictive performance with an accuracy of 0.78. The machine learning model identified nine significant variables of maternal age, height, weight at pre-pregnancy, pregnancy-associated hypertension, gestational age, and fetal sonographic findings. The C-statistic value for the logistic regression machine learning model in the external validation set (1391 term nulliparous women) was 0.69, with an overall accuracy of 0.68, a specificity of 0.83, and a sensitivity of 0.41. Machine learning algorithms with clinical and sonographic parameters at near term could be useful tools to predict individual risk of emergent CS during active labor in nulliparous women.
本研究是一项针对足月未产妇分娩情况的多中心回顾性队列研究,旨在开发一种自动化机器学习模型,用于在分娩开始前预测急诊剖宫产(CS)。应用了逻辑回归、随机森林、支持向量机(SVM)、梯度提升、极端梯度提升(XGBoost)、轻量级梯度提升机(LGBM)、k近邻(KNN)、投票法和堆叠法这九种机器学习方法,并对其在活跃分娩期间预测急诊剖宫产的能力进行了比较。使用韩国全国多中心胎儿生长数据集进行了外部验证。分析共纳入6549名足月未产妇,急诊剖宫产率为16.1%。KNN、投票法、XGBoost、堆叠法、梯度提升、随机森林、LGBM、逻辑回归和SVM的C统计量值分别为0.6、0.69、0.64、0.59、0.66、0.68、0.68、0.7和0.69。逻辑回归模型显示出最佳预测性能,准确率为0.78。该机器学习模型确定了产妇年龄、身高、孕前体重、妊娠相关高血压、孕周和胎儿超声检查结果这九个显著变量。外部验证集(1391名足月未产妇)中逻辑回归机器学习模型的C统计量值为0.69,总体准确率为0.68,特异性为0.83,敏感性为0.41。结合近期临床和超声参数的机器学习算法可能是预测未产妇活跃分娩期间急诊剖宫产个体风险的有用工具。