Department of Pure and Applied Sciences (DiSPeA), University of Urbino Carlo Bo, Urbino, Italy.
Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
Pediatr Pulmonol. 2024 Dec;59(12):3400-3409. doi: 10.1002/ppul.27216. Epub 2024 Aug 16.
Prematurity is the strongest predictor of bronchopulmonary dysplasia (BPD). Most previous studies investigated additional risk factors by conventional statistics, while the few studies applying artificial intelligence, and specifically machine learning (ML), for this purpose were mainly targeted to the predictive ability of specific interventions. This study aimed to apply ML to identify, among routinely collected data, variables predictive of BPD, and to compare these variables with those identified through conventional statistics.
Very preterm infants were recruited; antenatal, perinatal, and postnatal clinical data were collected. A BPD prediction model was built using conventional statistics, and nine supervised ML algorithms were applied for the same purpose: the results of the best-performing model were described and compared with those of conventional statistics.
Both conventional statistics and ML identified the degree of immaturity (low gestational age and/or birth weight), need for mechanical ventilation, and absent or reversed end diastolic flow (AREDF) in the umbilical arteries as risk factors for BPD. Each of the two approaches also identified additional potentially predictive clinical variables.
ML algorithms might be useful to integrate conventional statistics in identifying novel risk factors, in addition to prematurity, for the development of BPD in very preterm infants. Specifically, the identification of AREDF status as an independent risk factor for BPD by both conventional statistics and ML highlights the opportunity to include detailed antenatal information in clinical predictive models for neonatal diseases.
早产儿是支气管肺发育不良(BPD)的最强预测因子。大多数先前的研究通过常规统计学方法研究了其他危险因素,而少数应用人工智能,特别是机器学习(ML)进行此项研究的主要目的是预测特定干预措施的能力。本研究旨在应用 ML 从常规收集的数据中识别出预测 BPD 的变量,并将这些变量与通过常规统计学方法识别出的变量进行比较。
招募极早产儿;收集产前、围产期和产后的临床数据。使用常规统计学方法建立 BPD 预测模型,并应用九种监督 ML 算法来达到相同的目的:描述表现最佳的模型的结果,并与常规统计学方法的结果进行比较。
常规统计学和 ML 均确定了不成熟程度(低胎龄和/或出生体重)、需要机械通气以及脐动脉中无反向舒张末期血流(AREDF)是 BPD 的危险因素。两种方法都各自确定了其他潜在的预测性临床变量。
ML 算法可能有助于将常规统计学方法整合到识别除早产以外的新型危险因素中,以用于极早产儿 BPD 的发生。具体来说,常规统计学和 ML 均将 AREDF 状态确定为 BPD 的独立危险因素,这突出了在新生儿疾病的临床预测模型中纳入详细的产前信息的机会。