Nejsum Freja Marie, Wiingreen Rikke, Jensen Andreas Kryger, Løkkegaard Ellen Christine Leth, Mølholm Hansen Bo
Department of Pediatrics, Copenhagen University Hospital-North Zealand, Hillerød, Denmark.
Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
PLoS One. 2025 Jan 9;20(1):e0312238. doi: 10.1371/journal.pone.0312238. eCollection 2025.
Identification of mother-infant pairs predisposed to early cessation of exclusive breastfeeding is important for delivering targeted support. Machine learning techniques enable development of transparent prediction models that enhance clinical applicability. We aimed to develop and validate two models to predict cessation of exclusive breastfeeding within one month among infants born after 35 weeks gestation using machine learning techniques.
Utilizing a nationwide dataset from Statistics Denmark, including infants born between the 1st of January 2014 and the 31st of December 2015, we employed random forest machine learning to develop two predictive models. The first model included 11 well-established factors associated with cessation of exclusive breastfeeding within one month. The second model was expanded to include 21 additional factors associated with complications during pregnancy and delivery that potentially impede breastfeeding. Feature importance was applied to elucidate the factors driving model predictions.
The dataset comprised 110,206 infants and 106,835 mothers. The first model predicted cessation of exclusive breastfeeding within one month with an area under the receiver operating curve of 62.0% (95% confidence interval 61.3% - 62.7%) and an accuracy of 60.4% (95% confidence interval 59.8% - 61.0%). The second model predicted cessation of exclusive breastfeeding within one month with an area under the receiver operating curve of 62.2% (95% confidence interval 61.5% - 62.9%) and an accuracy of 60.0% (95% confidence interval 59.3% - 60.6%). In both models, birthplace, maternal education, delivery mode, and maternal body mass index were the most important factors influencing the overall model performance.
The two models could not accurately predict cessation of exclusive breastfeeding within one month among infants born after 35 weeks gestation. Contrary to our expectations, including additional factors in the model did not increase model performance.
识别易出现纯母乳喂养早期中断的母婴对对于提供有针对性的支持很重要。机器学习技术有助于开发具有临床适用性的透明预测模型。我们旨在使用机器学习技术开发并验证两个模型,以预测妊娠35周后出生的婴儿在出生后一个月内纯母乳喂养的中断情况。
利用丹麦统计局的全国性数据集,该数据集包括2014年1月1日至2015年12月31日期间出生的婴儿,我们采用随机森林机器学习方法开发了两个预测模型。第一个模型纳入了11个与出生后一个月内纯母乳喂养中断相关的公认因素。第二个模型进行了扩展,纳入了另外21个与妊娠和分娩期间可能妨碍母乳喂养的并发症相关的因素。应用特征重要性来阐明驱动模型预测的因素。
该数据集包含110,206名婴儿和106,835名母亲。第一个模型预测出生后一个月内纯母乳喂养中断情况的受试者工作特征曲线下面积为62.0%(95%置信区间61.3% - 62.7%),准确率为60.4%(95%置信区间59.8% - 61.0%)。第二个模型预测出生后一个月内纯母乳喂养中断情况的受试者工作特征曲线下面积为62.2%(95%置信区间61.5% - 62.9%),准确率为60.0%(95%置信区间59.3% - 60.6%)。在两个模型中,出生地、母亲教育程度、分娩方式和母亲体重指数都是影响整体模型性能的最重要因素。
这两个模型无法准确预测妊娠35周后出生的婴儿在出生后一个月内纯母乳喂养的中断情况。与我们的预期相反,在模型中纳入额外因素并未提高模型性能。