Skaramagkas Vasileios, Boura Iro, Karamanis Georgios, Kyprakis Ioannis, Fotiadis Dimitrios I, Kefalopoulou Zinovia, Spanaki Cleanthe, Tsiknakis Manolis
Electrical and Computer Engineering, Hellenic Mediterranean University, Estavromenos, Heraklion, GR-710 04, Greece.
Institute of Computer Science, Foundation for Research and Technology Hellas (FORTH), 100 Nikolaou Plastira, Heraklion, GR-700 03, Greece.
NPJ Digit Med. 2025 Apr 26;8(1):226. doi: 10.1038/s41746-025-01630-1.
Hypomimia is a prominent, levodopa-responsive symptom in Parkinson's disease (PD). In our study, we aimed to distinguish ON and OFF dopaminergic medication state in a cohort of PD patients, analyzing their facial videos with a unique, interpretable Dual Stream Transformer model. Our approach integrated two streams of data: facial frame features and optical flow, processed through a transformer-based architecture. Various configurations of embedding dimensions, dense layer sizes, and attention heads were examined to enhance model performance. The final model, trained on 183 PD patients, attained an accuracy of 86% in differentiating between ON- and OFF-medication state. Moreover, uniform classification performance (up to 88%) was obtained across various stages of PD severity, as expressed by the Hoehn and Yahr (H&Y) scale. These values highlight the potential of our model as a non-invasive, cost-effective instrument for clinicians to remotely and accurately detect patients' response to treatment from early to more advanced PD stages.
面部表情减少是帕金森病(PD)中一种突出的、对左旋多巴有反应的症状。在我们的研究中,我们旨在区分一组PD患者的多巴胺能药物开启和关闭状态,使用一种独特的、可解释的双流Transformer模型分析他们的面部视频。我们的方法整合了两种数据流:面部帧特征和光流,通过基于Transformer的架构进行处理。研究了嵌入维度、密集层大小和注意力头的各种配置以提高模型性能。最终模型在183名PD患者上进行训练,在区分药物开启和关闭状态时达到了86%的准确率。此外,正如Hoehn和Yahr(H&Y)量表所表示的,在PD严重程度的各个阶段都获得了一致的分类性能(高达88%)。这些数值凸显了我们的模型作为一种非侵入性、经济高效的工具的潜力,可供临床医生从早期到更晚期的PD阶段远程且准确地检测患者对治疗的反应。