Department of Pediatrics and Adolescent Medicine, Division of Neonatology, Pediatric Intensive Care and Neuropediatrics, Comprehensive Center for Pediatrics, Medical University of Vienna, Waehringer Guertel 18-20, 1090, Vienna, Austria.
Department of Neonatology and Pediatric Intensive Care Medicine, University Children's Hospital, University Medical Center Hamburg Eppendorf, Hamburg, Germany.
Sci Rep. 2024 Sep 2;14(1):20374. doi: 10.1038/s41598-024-71278-6.
Assessing pain in newborns in the NICU is crucial due to their frequent exposure to painful stimuli, yet it's challenging due to the subjective nature of current methods. This study aimed to evaluate the effectiveness of an AI system designed for automatic facial recognition by comparing its performance with the expert opinion of health care provider. This is a secondary analysis from an eye-tracking study, assessing neonatal pain evaluations by healthcare professionals. The performance of AI software, FaceReader 9, was compared to experts' evaluations using a visual-analog scale, focusing on identifying specific facial action units associated with different pain levels. The study found significant differences in AI-generated metrics-arousal and valence-across three stimulus types: non-noxious thermal, short-noxious, and prolonged-noxious, with p-values below 0.001. A strong correlation (r = 0.84, p ≤ .001) was observed between AI metrics and expert ratings. Eleven facial action units were identified as relevant to describe neonatal pain. The findings highlight the AI system's potential in accurately detecting and analyzing newborn facial expressions in response to varying pain intensities, demonstrating a significant correlation with healthcare professionals' assessments. This suggests that AI technology could enhance objective pain assessment in neonates.
由于新生儿在 NICU 中经常接触到疼痛刺激,因此评估他们的疼痛至关重要,但由于目前方法的主观性,这具有挑战性。本研究旨在通过比较人工智能系统与医疗保健提供者的专家意见,评估专为自动面部识别设计的人工智能系统的有效性。这是一项眼动研究的二次分析,评估医疗保健专业人员对新生儿疼痛的评估。使用视觉模拟量表比较人工智能软件 FaceReader 9 的表现与专家评估,重点是识别与不同疼痛水平相关的特定面部动作单元。研究发现,在三种刺激类型(非伤害性热、短伤害性和长伤害性)中,AI 生成的唤醒度和效价等指标存在显著差异,p 值均低于 0.001。AI 指标与专家评分之间存在很强的相关性(r=0.84,p≤0.001)。确定了 11 个面部动作单元与描述新生儿疼痛相关。研究结果突出了人工智能系统在准确检测和分析新生儿对不同疼痛强度的面部表情反应方面的潜力,与医疗保健专业人员的评估具有显著相关性。这表明人工智能技术可以增强对新生儿的客观疼痛评估。