Chen Xinyu, Wu Siqing, Chen Xinqing, Hu Linmin, Li Wenjing, Mi Ningning, Xie Peng, Huang Yujun, Yuan Kun, Sui Yajuan, Li Renjie, Wang Kangting, Sun Nan, Yao Yuyang, Xu Zuofeng, Yuan Jinqiu, Zhu Yunxiao
Department of Medical Ultrasonics, The Seventh Affiliated Hospital of Sun Yat-sen University, No.628, Zhenyuan Road, Xinhu Street, Guangming District, Shenzhen 518107, China.
School of Medicine, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China.
Eur J Obstet Gynecol Reprod Biol. 2025 Feb;305:48-55. doi: 10.1016/j.ejogrb.2024.11.022. Epub 2024 Nov 26.
To develop machine learning prediction models for small for gestational age with baseline characteristics and biochemical tests of various pregnancy stages individually and collectively and compare predictive performance.
This retrospective study included singleton pregnancies with infants born between May 2018 and March 2023. Small for gestational age was defined as a birth weight below the 10th percentile according to the Intergrowth-21st fetal growth standards. The pregnancy data were categorized into four datasets at different gestational time points (14 and 28 weeks and admission). The LightGBM framework was utilized to assess the variable importance by employing a five-fold cross-validation. RandomizedSearchCV and sequential feature selection were applied to estimate the optimal number of features. Seven machine learning algorithms were used to develop prediction models, with an 8:2 ratio for training and testing. The model performance was evaluated using receiver operating characteristic curve analysis and sensitivity at a false positive rate of 10 %.
We included data of 4,394 women with singleton pregnancies, including 148 (3.4%) small for gestational age infants. Women delivering small for gestational age infants exhibited significantly shorter stature and lower fundal height and abdominal circumference at admission. Maternal height, age, and pre-pregnancy weight consistently ranked among the top 20 features in prediction models with any dataset. The models incorporated variables of admission stage have strong predictive performance with the area under the curves exceeding 0.8. The prediction model developed with variables of admission stage yielded the best performance, achieving an area under the curve of 0.85 and a sensitivity of 73% at the false positive rate of 10%.
By machine learning, various pregnancy stages' prediction models for small for gestational age showed good predictive performance, and the predictive value of variables at each pregnancy stage was fully explored. The prediction model with the best performance was established with variables of admission stage and emphasized the significance of prenatal physical examinations.
分别及综合利用不同妊娠阶段的基线特征和生化检测结果,开发用于预测小于胎龄儿的机器学习预测模型,并比较预测性能。
这项回顾性研究纳入了2018年5月至2023年3月期间分娩单胎婴儿的孕妇。根据Intergrowth-21st胎儿生长标准,小于胎龄儿定义为出生体重低于第10百分位数。妊娠数据在不同妊娠时间点(14周、28周和入院时)被分类为四个数据集。利用LightGBM框架,通过五折交叉验证评估变量重要性。应用随机搜索交叉验证(RandomizedSearchCV)和顺序特征选择来估计最佳特征数量。使用七种机器学习算法开发预测模型,训练集与测试集的比例为8:2。使用受试者操作特征曲线分析和10%假阳性率下的灵敏度评估模型性能。
我们纳入了4394名单胎妊娠女性的数据,其中包括148名(3.4%)小于胎龄儿。分娩小于胎龄儿的女性在入院时身高显著更矮,宫高和腹围更低。在任何数据集中,母亲身高、年龄和孕前体重始终位列预测模型中前20个特征。纳入入院阶段变量的模型具有较强的预测性能,曲线下面积超过0.8。使用入院阶段变量开发的预测模型性能最佳,在10%假阳性率下曲线下面积达到0.85,灵敏度为73%。
通过机器学习,针对小于胎龄儿的不同妊娠阶段预测模型显示出良好的预测性能,且充分探索了各妊娠阶段变量的预测价值。利用入院阶段变量建立了性能最佳的预测模型,强调了产前体格检查的重要性。