School of Computer Science, Electrical and Electronic Engineering, and Engineering Maths, University of Bristol, Bristol BS8 1UB, UK.
Translational Health Sciences, University of Bristol Medical School, Bristol BS8 1UD, UK.
Sensors (Basel). 2021 Jun 16;21(12):4133. doi: 10.3390/s21124133.
Parkinson's disease (PD) is a chronic neurodegenerative condition that affects a patient's everyday life. Authors have proposed that a machine learning and sensor-based approach that continuously monitors patients in naturalistic settings can provide constant evaluation of PD and objectively analyse its progression. In this paper, we make progress toward such PD evaluation by presenting a multimodal deep learning approach for discriminating between people with PD and without PD. Specifically, our proposed architecture, named MCPD-Net, uses two data modalities, acquired from vision and accelerometer sensors in a home environment to train variational autoencoder (VAE) models. These are modality-specific VAEs that predict effective representations of human movements to be fused and given to a classification module. During our end-to-end training, we minimise the difference between the latent spaces corresponding to the two data modalities. This makes our method capable of dealing with missing modalities during inference. We show that our proposed multimodal method outperforms unimodal and other multimodal approaches by an average increase in F1-score of 0.25 and 0.09, respectively, on a data set with real patients. We also show that our method still outperforms other approaches by an average increase in F1-score of 0.17 when a modality is missing during inference, demonstrating the benefit of training on multiple modalities.
帕金森病(PD)是一种慢性神经退行性疾病,影响患者的日常生活。作者提出,一种基于机器学习和传感器的方法,可以持续监测自然环境中的患者,从而对 PD 进行持续评估,并客观分析其进展。在本文中,我们通过提出一种用于区分 PD 患者和非 PD 患者的多模态深度学习方法,在 PD 评估方面取得了进展。具体来说,我们提出的名为 MCPD-Net 的架构使用来自视觉和加速度计传感器的两种数据模态,在家庭环境中训练变分自编码器(VAE)模型。这些是特定于模态的 VAE,可以预测人类运动的有效表示形式,然后将其融合并提供给分类模块。在我们的端到端训练过程中,我们最小化了对应于两种数据模态的潜在空间之间的差异。这使得我们的方法能够在推理过程中处理缺失的模态。我们表明,在包含真实患者数据的数据集上,我们提出的多模态方法的平均 F1 分数分别比单模态和其他多模态方法提高了 0.25 和 0.09,这表明多模态训练的好处。当推理过程中缺失一个模态时,我们的方法的平均 F1 分数仍比其他方法提高了 0.17,这进一步证明了多模态训练的优势。