From the Division of Pediatric Neurology (R.S., L.C.), Department of Pediatrics, University of Alberta; Alberta Children's Hospital Research Institute and Department of Clinical Neurosciences (K.A.); Department of Clinical Neurosciences (N.D.F.); Department of Pediatrics and Clinical Neurosciences (M.D.), University of Calgary, Alberta; Departments of Pediatrics and Neurology/Neurosurgery (M.I.S., M.O.), McGill University, Montreal, Quebec, Canada; Newcastle upon Tyne Hospitals (A.P.B.), NHS Foundation Trust, Newcastle upon Tyne, United Kingdom; Department of Neurology (M.J.R.), Boston Children's Hospital and Department of Neurology, Harvard Medical School, Boston, MA; Department of Neonatology (E.S.), Soroka University Medical Center and Faculty of Health sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel; Department of Neonatology (L.S.V.), University Medical Center Utrecht, The Netherlands; Departments of Pediatrics and Community Health Sciences (D.D.), Owerko Centre at the Alberta Children's Hospital Research Institute, Hotchkiss Brain Institute, Cummings School of Medicine; Faculty of Nursing and Cumming School of Medicine (N.L.), Departments of Pediatrics, Psychiatry and Community Health Sciences; Alberta Children's Hospital Research Institute and Department of Clinical Neurosciences (P.M.); Departments of Clinical Neurosciences (M.D.H.), Community Health Sciences, Medicine and Radiology, Hotchkiss Brain Institute and Department of Pediatrics (A.K.), Cumming School of Medicine, University of Calgary, Alberta, Canada.
Neurology. 2024 Jun;102(11):e209393. doi: 10.1212/WNL.0000000000209393. Epub 2024 May 15.
Perinatal arterial ischemic stroke (PAIS) is a focal vascular brain injury presumed to occur between the fetal period and the first 28 days of life. It is the leading cause of hemiparetic cerebral palsy. Multiple maternal, intrapartum, delivery, and fetal factors have been associated with PAIS, but studies are limited by modest sample sizes and complex interactions between factors. Machine learning approaches use large and complex data sets to enable unbiased identification of clinical predictors but have not yet been applied to PAIS. We combined large PAIS data sets and used machine learning methods to identify clinical PAIS factors and compare this data-driven approach with previously described literature-driven clinical prediction models.
Common data elements from 3 registries with patients with PAIS, the Alberta Perinatal Stroke Project, Canadian Cerebral Palsy Registry, International Pediatric Stroke Study, and a longitudinal cohort of healthy controls (Alberta Pregnancy Outcomes and Nutrition Study), were used to identify potential predictors of PAIS. Inclusion criteria were term birth and idiopathic PAIS (absence of primary causative medical condition). Data including maternal/pregnancy, intrapartum, and neonatal factors were collected between January 2003 and March 2020. Common data elements were entered into a validated random forest machine learning pipeline to identify the highest predictive features and develop a predictive model. Univariable analyses were completed post hoc to assess the relationship between each predictor and outcome.
A machine learning model was developed using data from 2,571 neonates, including 527 cases (20%) and 2,044 controls (80%). With a mean of 21 features selected, the random forest machine learning approach predicted the outcome with approximately 86.5% balanced accuracy. Factors that were selected a priori through literature-driven variable selection that were also identified as most important by the machine learning model were maternal age, recreational substance exposure, tobacco exposure, intrapartum maternal fever, and low Apgar score at 5 minutes. Additional variables identified through machine learning included in utero alcohol exposure, infertility, miscarriage, primigravida, meconium, spontaneous vaginal delivery, neonatal head circumference, and 1-minute Apgar score. Overall, the machine learning model performed better (area under the curve [AUC] 0.93) than the literature-driven model (AUC 0.73).
Machine learning may be an alternative, unbiased method to identify clinical predictors associated with PAIS. Identification of previously suggested and novel clinical factors requires cautious interpretation but supports the multifactorial nature of PAIS pathophysiology. Our results suggest that identification of neonates at risk of PAIS is possible.
围产期动脉缺血性卒中(PAIS)是一种局灶性血管性脑损伤,据推测发生在胎儿期和生命的头 28 天之间。它是导致偏瘫性脑瘫的主要原因。多种母体、分娩期、分娩和胎儿因素与 PAIS 相关,但研究受到样本量小和因素之间复杂相互作用的限制。机器学习方法使用大型和复杂的数据集来实现临床预测因子的无偏识别,但尚未应用于 PAIS。我们结合了大型 PAIS 数据集,并使用机器学习方法来识别临床 PAIS 因素,并将这种数据驱动的方法与先前描述的文献驱动的临床预测模型进行比较。
从 Alberta Perinatal Stroke Project、Canadian Cerebral Palsy Registry、International Pediatric Stroke Study 三个注册中心的患者和一个健康对照组(Alberta Pregnancy Outcomes and Nutrition Study)的大型数据集,使用常见的数据元素来识别 PAIS 的潜在预测因子。纳入标准为足月出生和特发性 PAIS(无原发性病因)。数据包括母体/妊娠、分娩期和新生儿因素,收集时间为 2003 年 1 月至 2020 年 3 月。常见数据元素被输入到经过验证的随机森林机器学习管道中,以识别最高预测特征并开发预测模型。事后进行单变量分析,以评估每个预测因子与结局之间的关系。
使用 2571 名新生儿的数据开发了一个机器学习模型,其中包括 527 例病例(20%)和 2044 名对照(80%)。随机森林机器学习方法选择了约 21 个特征,对结果的预测准确率约为 86.5%。通过文献驱动的变量选择预先确定的因素,以及通过机器学习模型确定的最重要因素,包括母亲年龄、娱乐性物质暴露、烟草暴露、分娩期母亲发热和 5 分钟时 Apgar 评分低。通过机器学习确定的其他变量包括宫内酒精暴露、不孕、流产、初产妇、胎粪、自然阴道分娩、新生儿头围和 1 分钟 Apgar 评分。总的来说,机器学习模型的表现优于文献驱动的模型(曲线下面积[AUC] 0.93 比 0.73)。
机器学习可能是一种替代的、无偏的方法,可以识别与 PAIS 相关的临床预测因子。对先前提出的和新的临床因素的识别需要谨慎解释,但支持 PAIS 病理生理学的多因素性质。我们的结果表明,识别有发生 PAIS 风险的新生儿是可能的。