Suppr超能文献

基于电子健康记录的机器学习预测早产模型

Machine Learning-Based Prediction Model of Preterm Birth Using Electronic Health Record.

机构信息

National Human Genetics Resource Center, National Research Institute for Family Planning, Beijing 100081, China.

Graduate School of Peking Union Medical College, Beijing 100730, China.

出版信息

J Healthc Eng. 2022 Apr 13;2022:9635526. doi: 10.1155/2022/9635526. eCollection 2022.

Abstract

OBJECTIVE

Preterm birth (PTB) was one of the leading causes of neonatal death. Predicting PTB in the first trimester and second trimester will help improve pregnancy outcomes. The aim of this study is to propose a prediction model based on machine learning algorithms for PTB.

METHOD

Data for this study were reviewed from 2008 to 2018, and all the participants included were selected from a hospital in China. Six algorisms, including Naive Bayesian (NBM), support vector machine (SVM), random forest tree (RF), artificial neural networks (ANN), K-means, and logistic regression, were used to predict PTB. The receiver operating characteristic curve (ROC), accuracy, sensitivity, and specificity were used to assess the performance of the model.

RESULTS

A total of 9550 pregnant women were included in the study, of which 4775 women had PTB. A total of 4775 people were randomly selected as controls. Based on 27 weeks of gestation, the area under the curve (AUC) and the accuracy of the RF model were the highest compared with other algorithms (accuracy: 0.816; AUC = 0.885, 95% confidence interval (CI): 0.873-0.897). Meanwhile, there was positive association between the accuracy and AUC of the RF model and gestational age. Age, magnesium, fundal height, serum inorganic phosphorus, mean platelet volume, waist size, total cholesterol, triglycerides, globulins, and total bilirubin were the main influence factors of PTB.

CONCLUSION

The results indicated that the prediction model based on the RF algorithm had a potential value to predict preterm birth in the early stage of pregnancy. The important analysis of the RF model suggested that intervention for main factors of PTB in the early stages of pregnancy would reduce the risk of PTB.

摘要

目的

早产(PTB)是新生儿死亡的主要原因之一。预测早孕和中孕期的 PTB 将有助于改善妊娠结局。本研究旨在提出一种基于机器学习算法的 PTB 预测模型。

方法

本研究的数据回顾时间为 2008 年至 2018 年,所有参与者均选自中国的一家医院。使用 6 种算法,包括朴素贝叶斯(NBM)、支持向量机(SVM)、随机森林树(RF)、人工神经网络(ANN)、K-均值和逻辑回归,来预测 PTB。使用接收者操作特征曲线(ROC)、准确率、敏感度和特异度来评估模型的性能。

结果

共有 9550 名孕妇纳入研究,其中 4775 名孕妇发生 PTB。随机选择 4775 名孕妇作为对照组。基于 27 孕周,RF 模型的曲线下面积(AUC)和准确率最高,优于其他算法(准确率:0.816;AUC=0.885,95%置信区间(CI):0.873-0.897)。同时,RF 模型的准确率和 AUC 与孕龄呈正相关。年龄、镁、宫底高度、血清无机磷、平均血小板体积、腰围、总胆固醇、甘油三酯、球蛋白和总胆红素是 PTB 的主要影响因素。

结论

结果表明,基于 RF 算法的预测模型具有预测妊娠早期早产的潜在价值。RF 模型的重要分析表明,对妊娠早期 PTB 的主要因素进行干预,将降低 PTB 的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e633/9020923/2ada642cb51a/JHE2022-9635526.001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验