Department of Pediatrics, University of Texas Health San Antonio, San Antonio, Texas, USA.
Department of Pediatrics, University of Texas Health San Antonio, San Antonio, Texas, USA,
Neonatology. 2021;118(4):394-405. doi: 10.1159/000516891. Epub 2021 Jul 14.
Approximately 7,000 newborns die every day, accounting for almost half of child deaths under 5 years of age. Deciphering which neonates are at increased risk for mortality can have an important global impact. As such, integrating high computational technology (e.g., artificial intelligence [AI]) may help identify the early and potentially modifiable predictors of neonatal mortality. Therefore, the objective of this study was to collate, critically appraise, and analyze neonatal prediction studies that included AI.
A literature search was performed in PubMed, Cochrane, OVID, and Google Scholar. We included studies that used AI (e.g., machine learning (ML) and deep learning) to formulate prediction models for neonatal death. We excluded small studies (n < 500 individuals) and studies using only antenatal factors to predict mortality. Two independent investigators screened all articles for inclusion. The data collection consisted of study design, number of models, features used per model, feature importance, internal and/or external validation, and calibration analysis. Our primary outcome was the average area under the receiving characteristic curve (AUC) or sensitivity and specificity for all models included in each study.
Of 434 articles, 11 studies were included. The total number of participants was 1.26 M with gestational ages ranging from 22 weeks to term. Number of features ranged from 3 to 66 with timing of prediction as early as 5 min of life to a maximum of 7 days of age. The average number of models per study was 4, with neural network, random forest, and logistic regression comprising the most used models (58.3%). Five studies (45.5%) reported calibration plots and 2 (18.2%) conducted external validation. Eight studies reported results by AUC and 5 studies reported the sensitivity and specificity. The AUC varied from 58.3% to 97.0%. The mean sensitivities ranged from 63% to 80% and specificities from 78% to 99%. The best overall model was linear discriminant analysis, but it also had a high number of features (n = 17).
DISCUSSION/CONCLUSION: ML models can accurately predict death in neonates. This analysis demonstrates the most commonly used predictors and metrics for AI prediction models for neonatal mortality. Future studies should focus on external validation, calibration, as well as deployment of applications that can be readily accessible to health-care providers.
每天约有 7000 名新生儿死亡,占 5 岁以下儿童死亡人数的近一半。了解哪些新生儿死亡风险较高可能会对全球产生重要影响。因此,整合高计算技术(例如人工智能[AI])可能有助于识别新生儿死亡的早期和潜在可改变的预测因素。因此,本研究的目的是整理、批判性评价和分析包含 AI 的新生儿预测研究。
在 PubMed、Cochrane、OVID 和 Google Scholar 中进行文献检索。我们纳入了使用 AI(例如机器学习(ML)和深度学习)制定新生儿死亡预测模型的研究。我们排除了小样本量(n<500 人)和仅使用产前因素预测死亡率的研究。两名独立的研究者筛选所有纳入的文章。数据收集包括研究设计、模型数量、每个模型使用的特征、特征重要性、内部和/或外部验证以及校准分析。我们的主要结果是每个研究中所有模型的平均接收特征曲线(AUC)或灵敏度和特异性。
在 434 篇文章中,有 11 篇研究被纳入。参与者总数为 126 万人,胎龄从 22 周至足月不等。特征数量从 3 到 66 不等,预测时间最早为出生后 5 分钟,最长为 7 天。每个研究的平均模型数量为 4 个,最常用的模型是神经网络、随机森林和逻辑回归(58.3%)。5 项研究(45.5%)报告了校准图,2 项研究(18.2%)进行了外部验证。8 项研究报告了 AUC 结果,5 项研究报告了灵敏度和特异性。AUC 从 58.3%到 97.0%不等。平均敏感度范围为 63%至 80%,特异性为 78%至 99%。整体最佳模型是线性判别分析,但它也有大量特征(n=17)。
讨论/结论:ML 模型可以准确预测新生儿死亡。本分析展示了用于新生儿死亡率 AI 预测模型的最常用预测因素和指标。未来的研究应侧重于外部验证、校准以及开发可方便地供医疗保健提供者使用的应用程序。