Liu Zheng, Han Na, Su Tao, Ji Yuelong, Bao Heling, Zhou Shuang, Luo Shusheng, Wang Hui, Liu Jue, Wang Hai-Jun
Department of Maternal and Child Health, School of Public Health, Peking University, National Health Commission Key Laboratory of Reproductive Health, Beijing, China.
Tongzhou Maternal and Child Health Care Hospital of Beijing, Beijing, China.
Front Pediatr. 2022 Nov 11;10:899954. doi: 10.3389/fped.2022.899954. eCollection 2022.
Predicting birth weight and identifying its risk factors are clinically important. This study aims to use interpretable machine learning to predict birth weight and identity important predictors.
This prospective cohort study was conducted in Tongzhou Maternal and Child Health Care Hospital of Beijing, China, recruiting pregnant women between June 2018 and February 2019. We used 24 features to predict infant birth weight, including gestational age, mother's age, parity, history of macrosomia delivery, pre-pregnancy body mass index (BMI), height, father's BMI, lifestyle (diet, physical activity, smoking), and biomarker (fasting glucose and lipids) features. Study outcome was birth weight of infant. We used 8 supervised learning models including 4 individual [linear regression, ridge regression, lasso regression, support vector machines regression (SVR)], and 4 ensemble estimators (random forest, AdaBoost, gradient boosted trees, and voting ensemble for regression) to predict birth weight. Model accuracy was measured by root mean squared error (RMSE) of 10-fold cross validation on the training set and RMSE of prediction on the test set. We used permutation importance algorithm to understand the prediction from the models and what affected them.
This study included 4,754 mother-child dyads. RMSEs were lower in voting ensemble for regression, linear regression, and SVR than random forest, AdaBoost, and gradient boosted tree. The 5 most important predictors for infant birth weight were gestational age, fetal sex, preterm birth, mother's height, and pre-pregnancy BMI. After adding ultrasound-measured indicators of fetal growth into predictors, mother's height and pre-pregnancy BMI remained the most important predictors in predicting the outcome.
Mother's height and pre-pregnancy BMI were identified as important predictors for infant birth weight. Interpretable machine learning is a promising tool in the prediction of birth weight.
预测出生体重并识别其风险因素在临床上具有重要意义。本研究旨在使用可解释的机器学习来预测出生体重并确定重要的预测因素。
本前瞻性队列研究在中国北京通州区妇幼保健院进行,招募了2018年6月至2019年2月期间的孕妇。我们使用24个特征来预测婴儿出生体重,包括孕周、母亲年龄、产次、巨大儿分娩史、孕前体重指数(BMI)、身高、父亲BMI、生活方式(饮食、体育活动、吸烟)以及生物标志物(空腹血糖和血脂)特征。研究结局为婴儿出生体重。我们使用8种监督学习模型,包括4种单一模型[线性回归、岭回归、套索回归、支持向量机回归(SVR)]以及4种集成估计器(随机森林、AdaBoost、梯度提升树和回归投票集成)来预测出生体重。通过训练集上10折交叉验证的均方根误差(RMSE)以及测试集上预测的RMSE来衡量模型准确性。我们使用排列重要性算法来理解模型的预测以及影响预测的因素。
本研究纳入了4754对母婴。回归投票集成、线性回归和SVR的RMSE低于随机森林、AdaBoost和梯度提升树。婴儿出生体重的5个最重要预测因素为孕周、胎儿性别、早产、母亲身高和孕前BMI。在将超声测量的胎儿生长指标添加到预测因素中后,母亲身高和孕前BMI仍然是预测结局的最重要预测因素。
母亲身高和孕前BMI被确定为婴儿出生体重的重要预测因素。可解释的机器学习是预测出生体重的一种有前景的工具。