Department of Computer Science, Stanford University, Stanford CA 94305, USA.
Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford CA 94305, USA.
Med Image Anal. 2021 Oct;73:102179. doi: 10.1016/j.media.2021.102179. Epub 2021 Jul 21.
Parkinson's disease (PD) is a brain disorder that primarily affects motor function, leading to slow movement, tremor, and stiffness, as well as postural instability and difficulty with walking/balance. The severity of PD motor impairments is clinically assessed by part III of the Movement Disorder Society Unified Parkinson's Disease Rating Scale (MDS-UPDRS), a universally-accepted rating scale. However, experts often disagree on the exact scoring of individuals. In the presence of label noise, training a machine learning model using only scores from a single rater may introduce bias, while training models with multiple noisy ratings is a challenging task due to the inter-rater variabilities. In this paper, we introduce an ordinal focal neural network to estimate the MDS-UPDRS scores from input videos, to leverage the ordinal nature of MDS-UPDRS scores and combat class imbalance. To handle multiple noisy labels per exam, the training of the network is regularized via rater confusion estimation (RCE), which encodes the rating habits and skills of raters via a confusion matrix. We apply our pipeline to estimate MDS-UPDRS test scores from their video recordings including gait (with multiple Raters, R=3) and finger tapping scores (single rater). On a sizable clinical dataset for the gait test (N=55), we obtained a classification accuracy of 72% with majority vote as ground-truth, and an accuracy of ∼84% of our model predicting at least one of the raters' scores. Our work demonstrates how computer-assisted technologies can be used to track patients and their motor impairments, even when there is uncertainty in the clinical ratings. The latest version of the code will be available at https://github.com/mlu355/PD-Motor-Severity-Estimation.
帕金森病(PD)是一种主要影响运动功能的脑部疾病,导致运动缓慢、震颤和僵硬,以及姿势不稳和行走/平衡困难。PD 运动障碍的严重程度由运动障碍协会统一帕金森病评定量表(MDS-UPDRS)第三部分进行临床评估,这是一种普遍接受的评定量表。然而,专家们对个体的评分往往存在分歧。在存在标签噪声的情况下,仅使用单个评分者的评分训练机器学习模型可能会引入偏差,而使用多个有噪声的评分训练模型由于评分者之间的变异性而具有挑战性。在本文中,我们引入了一个有序焦点神经网络,从输入视频中估计 MDS-UPDRS 评分,以利用 MDS-UPDRS 评分的有序性质并对抗类别不平衡。为了处理每次检查的多个有噪声的标签,通过评分者混淆估计(RCE)对网络的训练进行正则化,通过混淆矩阵对评分者的评分习惯和技能进行编码。我们将我们的流水线应用于从他们的视频记录中估计 MDS-UPDRS 测试分数,包括步态(多个评分者,R=3)和手指敲击分数(单个评分者)。在步态测试的大型临床数据集(N=55)上,我们以多数票作为真实值获得了 72%的分类准确性,并且我们的模型预测至少一个评分者分数的准确性约为 84%。我们的工作表明,即使在临床评分存在不确定性的情况下,计算机辅助技术也可以用于跟踪患者及其运动障碍。最新版本的代码将在 https://github.com/mlu355/PD-Motor-Severity-Estimation 上提供。