Suppr超能文献

多产妇早产的逻辑回归和机器学习预测方法。

Prediction of preterm birth in multiparous women using logistic regression and machine learning approaches.

机构信息

Clinical Research Unit (CRU), CHEO Research Institute, University of Ottawa, Ottawa, Canada.

Department of Obstetrics and Gynecology, McMaster University, 1280 Main Street W, Hamilton, ON, L8S 4L8, Canada.

出版信息

Sci Rep. 2024 Sep 20;14(1):21967. doi: 10.1038/s41598-024-60097-4.

Abstract

To predict preterm birth (PTB) in multiparous women, comparing machine learning approaches with traditional logistic regression. A population-based cohort study was conducted using data from the Ontario Better Outcomes Registry and Network (BORN). The cohort included all multiparous women who delivered a singleton birth at 20-42 weeks' gestation in an Ontario hospital between April 1, 2012 and March 31, 2014. The primary outcome was PTB < 37 weeks, with spontaneous PTB analyzed as a secondary outcome. Stepwise logistic regression and the Boruta machine learning were used to select the important variables during the first and second trimester. For building prediction models, the whole data set were divided for the two independent parts: two-third for training the classifiers (Logistic regression, random forests, decision trees, and artificial neural networks) and one-third for model validation. Then, the training data set were balanced by random over sampling technique. The best hyper parameters were obtained by the tenfold cross validation. The performance of all models was evaluated by sensitivity, specificity, positive predictive value, negative predictive value, and the area under the receiver operating characteristics (AUC). The cohort included 145,846 births, of which 8125 (5.57%) were preterm. In first-trimester models, the strongest predictors of PTB were previous PTB, preexisting diabetes, and abnormal pregnancy-associated plasma protein-A. In the testing data set, the highest predictive ability was seen for artificial neural networks, with an area under the receiver operating characteristic curve (AUC) of 68.8% (95% CI 67.6-70.1%). In second-trimester models, addition of infant sex, attendance at first-trimester appointment, medication exposure, and abnormal alpha-fetoprotein concentrations increased the AUC to 72.1% (95% CI 71.1-73.1%) with logistic regression. With the inclusion of the variable complications during pregnancy, the AUC increased to 80.5% (95% CI 79.6-81.5%) using logistic regression. For both overall and spontaneous PTB, during both the first and second trimesters, models yielded negative predictive values of 97%. Overall, machine learning and logistic regression produced similar performance for prediction of PTB. For overall and spontaneous PTB, both first- and second-trimester models provided negative predictive values of ~ 97%, higher than that of fetal fibronectin.

摘要

为了预测多产妇的早产(PTB),将机器学习方法与传统的逻辑回归进行比较。这项基于人群的队列研究使用了安大略省出生登记和网络(BORN)的数据。该队列包括所有在 2012 年 4 月 1 日至 2014 年 3 月 31 日期间在安大略省医院妊娠 20-42 周分娩的单胎多产妇。主要结局为妊娠 37 周前早产,自发性早产为次要结局。逐步逻辑回归和 Boruta 机器学习用于在第一和第二孕期选择重要变量。为了构建预测模型,整个数据集被分为两个独立的部分:三分之二用于训练分类器(逻辑回归、随机森林、决策树和人工神经网络),三分之一用于模型验证。然后,通过随机过采样技术平衡训练数据集。通过十折交叉验证获得最佳超参数。通过灵敏度、特异性、阳性预测值、阴性预测值和接收器工作特征曲线下的面积(AUC)来评估所有模型的性能。该队列包括 145846 例分娩,其中 8125 例(5.57%)为早产。在第一孕期模型中,早产的最强预测因素是既往早产、原有糖尿病和异常妊娠相关血浆蛋白 A。在测试数据集,人工神经网络的预测能力最高,接收器工作特征曲线下的面积(AUC)为 68.8%(95%CI 67.6-70.1%)。在第二孕期模型中,加入婴儿性别、第一次孕期预约、药物暴露和异常甲胎蛋白浓度后,逻辑回归的 AUC 增加到 72.1%(95%CI 71.1-73.1%)。在纳入妊娠期间并发症的变量后,逻辑回归的 AUC 增加到 80.5%(95%CI 79.6-81.5%)。对于整体和自发性早产,在第一和第二孕期,模型的阴性预测值均为 97%。总的来说,机器学习和逻辑回归在预测早产方面表现相似。对于整体和自发性早产,第一和第二孕期模型的阴性预测值均在 97%左右,高于胎儿纤连蛋白。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed7c/11415355/c497204b9d6e/41598_2024_60097_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验