Yang Jiacheng, Williams Stefan, Hogg David C, Alty Jane E, Relton Samuel D
School of Computing, University of Leeds, UK.
Leeds Institute of Health Sciences, University of Leeds, UK; Leeds Teaching Hospitals NHS Trust, UK.
J Neurol Sci. 2024 Aug 15;463:123089. doi: 10.1016/j.jns.2024.123089. Epub 2024 Jun 10.
The core clinical sign of Parkinson's disease (PD) is bradykinesia, for which a standard test is finger tapping: the clinician observes a person repetitively tap finger and thumb together. That requires an expert eye, a scarce resource, and even experts show variability and inaccuracy. Existing applications of technology to finger tapping reduce the tapping signal to one-dimensional measures, with researcher-defined features derived from those measures.
(1) To apply a deep learning neural network directly to video of finger tapping, without human-defined measures/features, and determine classification accuracy for idiopathic PD versus controls. (2) To visualise the features learned by the model.
152 smartphone videos of 10s finger tapping were collected from 40 people with PD and 37 controls. We down-sampled pixel dimensions and videos were split into 1 s clips. A 3D convolutional neural network was trained on these clips.
For discriminating PD from controls, our model showed training accuracy 0.91, and test accuracy 0.69, with test precision 0.73, test recall 0.76 and test AUROC 0.76. We also report class activation maps for the five most predictive features. These show the spatial and temporal sections of video upon which the network focuses attention to make a prediction, including an apparent dropping thumb movement distinct for the PD group.
A deep learning neural network can be applied directly to standard video of finger tapping, to distinguish PD from controls, without a requirement to extract a one-dimensional signal from the video, or pre-define tapping features.
帕金森病(PD)的核心临床症状是运动迟缓,对此的标准测试是手指敲击:临床医生观察一个人反复将手指和拇指一起敲击。这需要专业的眼光,而这是一种稀缺资源,甚至专家也会出现差异和不准确的情况。现有的手指敲击技术应用将敲击信号简化为一维测量,并从这些测量中得出研究人员定义的特征。
(1)将深度学习神经网络直接应用于手指敲击视频,无需人工定义的测量/特征,并确定特发性帕金森病与对照组的分类准确率。(2)可视化模型学习到的特征。
从40名帕金森病患者和37名对照组中收集了152个10秒手指敲击的智能手机视频。我们对像素维度进行了下采样,并将视频分割成1秒的片段。在这些片段上训练了一个三维卷积神经网络。
为了区分帕金森病患者和对照组,我们的模型显示训练准确率为0.91,测试准确率为0.69,测试精确率为0.73,测试召回率为0.76,测试曲线下面积为0.76。我们还报告了五个最具预测性特征的类别激活图。这些图显示了网络在进行预测时关注的视频的空间和时间部分,包括帕金森病组明显的拇指下降运动。
深度学习神经网络可以直接应用于手指敲击的标准视频,以区分帕金森病患者和对照组,而无需从视频中提取一维信号或预先定义敲击特征。