用于神经退行性疾病评估和活动识别的多速变压器网络。

Multi-speed transformer network for neurodegenerative disease assessment and activity recognition.

机构信息

École de Technologie Supérieure, ÉTS, 1100 Notre-Dame St W, Montreal, Quebec H3C 1K3, Canada.

Department of Computer Science, Università Degli Studi di Bari "Aldo Moro", Via Orabona 4, Bari 70125, Italy.

出版信息

Comput Methods Programs Biomed. 2023 Mar;230:107344. doi: 10.1016/j.cmpb.2023.107344. Epub 2023 Jan 9.

Abstract

BACKGROUND AND OBJECTIVE

Neurodegenerative diseases are the most frequent age-related diseases. This type of disease, if not discovered in the initial stage, will compromise the quality of life of the affected subject. Thus, a timely diagnosis is of paramount importance. One of the most used tasks from neurologists to detect and determine the severity of the disease is analysing human gait. This work presents the dataset named "Beside Gait" containing timeseries of coordinates of extracted body joints of people with neurodegenerative diseases in various stages of the disease as well as control subjects. In addition, the novel Multi-Speed transformer technique will be presented and benchmarked against several other techniques making use of deep learning and Shallow Learning. The objective is to recognize subjects affected by some form of neurodegenerative disease in early stage using a computer vision technique making use of deep learning that can be integrated into a smartphone app for offline inference with the aim of promptly initiate investigations and treatment to improve the patient's quality of life.

METHODS

The recorded videos were processed, and the skeleton of the person in the video was extracted using pose estimation. The raw time-series coordinates of the joints extracted by the pose estimation algorithm were tested against novel deep neural network architectures and Shallow Learning techniques. In this work, the proposed Multi-Speed Transformer is benchmarked against other deep neural networks such as Temporal Convolutional Neural Networks, Transformers, as well as Shallow Learning techniques making use of feature extraction and different classifiers such as Random Forests, K Nearest Neighbours, Ada Boost, Linear and RBF SVM. The proposed Multi-Speed Transformer architecture has been developed to learn short and long-term patterns to model the various pathological gaits.

RESULTS

The Multi-Speed Transformer outperformed all other existing models reaching an accuracy of 96.9%, a sensitivity of 96.9%, a precision of 97.7%, and a specificity of 97.1% in binary classification. The accuracy in multi-class classification for detecting the presence of the disease in various stages is 71.6%, the sensitivity is 67.7%, and the specificity is 71.8%. In addition, tests have also been conducted against two other different activity recognition datasets, namely SHREC and JHMDB, in the exact same conditions. Multi-Speed Transformer has demonstrated to beat always all other tested techniques as well as the techniques reviewed in the state-of-the-art with respectively of accuracy 91.8% and 74%. Having those datasets more than two classes, specificity was not computed.

CONCLUSIONS

The Multi-Speed Transformer is a valuable technique for neurodegenerative disease assessment through computer vision. In addition, the novel dataset "Beside Gait" here presented is an important starting point for future research work on automatic recognition of neurodegenerative diseases using gait analysis.

摘要

背景与目的

神经退行性疾病是最常见的与年龄相关的疾病。如果在早期阶段未发现这种疾病,将影响受影响个体的生活质量。因此,及时诊断至关重要。神经科医生用于检测和确定疾病严重程度的最常用任务之一是分析人类步态。这项工作提出了一个名为“Beside Gait”的数据集,其中包含处于疾病不同阶段的神经退行性疾病患者和对照受试者的身体关节提取坐标的时间序列。此外,还将介绍新型多速转换器技术,并与使用深度学习和浅层学习的其他几种技术进行基准测试。目的是使用计算机视觉技术识别早期受某种形式的神经退行性疾病影响的受试者,该技术可以集成到智能手机应用程序中进行离线推理,旨在及时启动调查和治疗,以提高患者的生活质量。

方法

记录的视频经过处理,使用姿势估计从视频中提取人的骨骼。原始关节时间序列坐标由姿势估计算法提取,并与新型深度神经网络架构和浅层学习技术进行了测试。在这项工作中,所提出的多速转换器与其他深度神经网络(如时间卷积神经网络、Transformer)以及利用特征提取和不同分类器(如随机森林、K 最近邻、Ada Boost、线性和 RBF SVM)的浅层学习技术进行了基准测试。所提出的多速转换器架构旨在学习短期和长期模式,以对各种病理步态进行建模。

结果

多速转换器在二进制分类中表现优于所有其他现有模型,达到了 96.9%的准确率、96.9%的灵敏度、97.7%的精度和 97.1%的特异性。在检测各种阶段疾病存在的多类分类中,准确率为 71.6%,灵敏度为 67.7%,特异性为 71.8%。此外,还在完全相同的条件下对另外两个不同的活动识别数据集 SHREC 和 JHMDB 进行了测试。多速转换器始终击败了所有其他测试技术以及技术评论中的最先进技术,准确率分别为 91.8%和 74%。由于这些数据集有两个以上的类别,因此未计算特异性。

结论

多速转换器是通过计算机视觉评估神经退行性疾病的有价值技术。此外,本文提出的新型数据集“Beside Gait”是使用步态分析自动识别神经退行性疾病的未来研究工作的重要起点。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索