Ou Zijun, Wang Haitao, Zhang Bin, Liang Haobang, Hu Bei, Ren Longlong, Liu Yanjuan, Zhang Yuhu, Dai Chengbo, Wu Hejun, Li Weifeng, Li Xin
School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong Province, China.
Department of Neurology, Guangdong Neuroscience Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, Guangdong Province, China.
Neural Regen Res. 2025 Jan 1;20(1):234-241. doi: 10.4103/1673-5374.393103. Epub 2024 Jan 8.
JOURNAL/nrgr/04.03/01300535-202501000-00031/figure1/v/2024-05-14T021156Z/r/image-tiff Early identification and treatment of stroke can greatly improve patient outcomes and quality of life. Although clinical tests such as the Cincinnati Pre-hospital Stroke Scale (CPSS) and the Face Arm Speech Test (FAST) are commonly used for stroke screening, accurate administration is dependent on specialized training. In this study, we proposed a novel multimodal deep learning approach, based on the FAST, for assessing suspected stroke patients exhibiting symptoms such as limb weakness, facial paresis, and speech disorders in acute settings. We collected a dataset comprising videos and audio recordings of emergency room patients performing designated limb movements, facial expressions, and speech tests based on the FAST. We compared the constructed deep learning model, which was designed to process multi-modal datasets, with six prior models that achieved good action classification performance, including the I3D, SlowFast, X3D, TPN, TimeSformer, and MViT. We found that the findings of our deep learning model had a higher clinical value compared with the other approaches. Moreover, the multi-modal model outperformed its single-module variants, highlighting the benefit of utilizing multiple types of patient data, such as action videos and speech audio. These results indicate that a multi-modal deep learning model combined with the FAST could greatly improve the accuracy and sensitivity of early stroke identification of stroke, thus providing a practical and powerful tool for assessing stroke patients in an emergency clinical setting.
《期刊》/nrgr/04.03/01300535 - 202501000 - 00031/图1/v/2024 - 05 - 14T021156Z/图像 - tiff格式 早期识别和治疗中风可显著改善患者预后和生活质量。尽管诸如辛辛那提院前中风量表(CPSS)和面部 - 手臂 - 言语测试(FAST)等临床测试常用于中风筛查,但准确实施依赖于专业培训。在本研究中,我们基于FAST提出了一种新颖的多模态深度学习方法,用于评估在急性情况下出现肢体无力、面部麻痹和言语障碍等症状的疑似中风患者。我们收集了一个数据集,该数据集包含急诊室患者基于FAST进行指定肢体运动、面部表情和言语测试的视频和音频记录。我们将设计用于处理多模态数据集的构建深度学习模型与六个在动作分类方面取得良好性能的先前模型进行了比较,这六个模型包括I3D、SlowFast、X3D、TPN、TimeSformer和MViT。我们发现,与其他方法相比,我们的深度学习模型的结果具有更高的临床价值。此外,多模态模型优于其单模块变体,突出了利用多种类型患者数据(如动作视频和语音音频)的益处。这些结果表明,结合FAST的多模态深度学习模型可大大提高中风早期识别的准确性和敏感性,从而为急诊临床环境中评估中风患者提供一个实用且强大的工具。