通过合成在不受控制环境中生成的视频的能量骨架图像对帕金森病进行严重程度分类。

Severity Classification of Parkinson's Disease via Synthesis of Energy Skeleton Images from Videos Produced in Uncontrolled Environments.

作者信息

Ben Hadj-Alouane Nejib, Dhoot Arav, Turki-Hadj Alouane Monia, Pangracious Vinod

机构信息

Electrical and Computer Engineering Department, American University in Dubai, Dubai P.O. Box 28282, United Arab Emirates.

Columbia College, Columbia University, New York, NY 10027, USA.

出版信息

Diagnostics (Basel). 2024 Nov 28;14(23):2685. doi: 10.3390/diagnostics14232685.

DOI:10.3390/diagnostics14232685

PMID:39682593

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11640083/

Abstract

Parkinson's Disease is a prevalent neurodegenerative disorder affecting millions worldwide, primarily marked by motor and non-motor symptoms due to the degeneration of dopamine-producing neurons. Despite the absence of a cure, current treatments focus on symptom management, often relying on pharmacotherapy and surgical interventions. Early diagnosis remains a critical challenge, particularly in underserved areas, as existing diagnostic protocols lack standardization and accessibility. This paper proposes a novel framework for the diagnosis and severity classification of PD using video data captured in uncontrolled environments. Leveraging deep learning techniques, our approach synthesizes Skeleton Energy Images (SEIs) from gait sequences and employs three advanced models-a Convolutional Neural Network (CNN), a Residual Network (ResNet), and a Vision Transformer (ViT)-to analyze these images. Our methodology allows for the accurate detection of PD and differentiation of its severity without requiring specialized equipment or professional oversight. The dataset used consists of labeled videos capturing the early stages of the disease, facilitating the potential for timely intervention. The four models performed very accurately during the training phase. In fact, an accuracy higher than 99% was achieved by the ViT and ResNet models. Moreover, a lesser accuracy of 90% was achieved by the CNN five-layer model. During the test phase, only the best-performing models from the training experiments were tested. The ResNet-18 model has achieved a 100% accuracy. However, the ViT and the CNN five-layer models have achieved, respectively, 99.96% and 96.40% test accuracy. The results demonstrate high accuracy, highlighting the framework's capabilities, and in particular the effectiveness of the workflow used for generating the SEI images. Given the nature of the dataset used, the proposed framework stands to function as a cost-effective and accessible tool for early PD detection in various healthcare settings. This study contributes to the advancement of mobile health technologies, aiming to enhance early diagnosis and monitoring of Parkinson's Disease.

摘要

帕金森病是一种普遍的神经退行性疾病，影响着全球数百万人，主要表现为由于产生多巴胺的神经元退化而导致的运动和非运动症状。尽管无法治愈，但目前的治疗重点是症状管理，通常依赖药物治疗和手术干预。早期诊断仍然是一项严峻挑战，尤其是在医疗服务不足的地区，因为现有的诊断方案缺乏标准化且难以获得。本文提出了一种新颖的框架，用于使用在不受控制的环境中捕获的视频数据对帕金森病进行诊断和严重程度分类。利用深度学习技术，我们的方法从步态序列中合成骨骼能量图像（SEIs），并采用三种先进模型——卷积神经网络（CNN）、残差网络（ResNet）和视觉Transformer（ViT）——来分析这些图像。我们的方法无需专门设备或专业监督，就能准确检测帕金森病并区分其严重程度。所使用的数据集由捕捉疾病早期阶段的标记视频组成，便于及时干预。这四种模型在训练阶段表现得非常准确。事实上，ViT和ResNet模型的准确率高于99%。此外，CNN五层模型的准确率较低，为90%。在测试阶段，仅对训练实验中表现最佳的模型进行了测试。ResNet-18模型的准确率达到了100%。然而，ViT和CNN五层模型的测试准确率分别为99.96%和96.40%。结果显示出高准确率，突出了该框架的能力，特别是用于生成SEI图像的工作流程的有效性。鉴于所使用数据集的性质，所提出的框架有望成为各种医疗环境中早期帕金森病检测的经济高效且易于使用的工具。这项研究有助于推动移动健康技术的发展，旨在加强帕金森病的早期诊断和监测。