Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Suite 514, Boston, MA, 02115, USA.
Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, 02138, USA.
BMC Neurol. 2021 May 18;21(1):201. doi: 10.1186/s12883-021-02226-4.
Characterization of prediagnostic Parkinson's Disease (PD) and early prediction of subsequent development are critical for preventive interventions, risk stratification and understanding of disease pathology. This study aims to characterize the role of the prediagnostic period in PD and, using selected features from this period as novel interception points, construct a prediction model to accelerate the diagnosis in a real-world setting.
We constructed two sets of machine learning models: a retrospective approach highlighting exposures up to 5 years prior to PD diagnosis, and an alternative model that prospectively predicted future PD diagnosis from all individuals at their first diagnosis of a gait or tremor disorder, these being features that appeared to represent the initiation of a differential diagnostic window.
We found many novel features captured by the retrospective models; however, the high accuracy was primarily driven from surrogate diagnoses for PD, such as gait and tremor disorders, suggesting the presence of a distinctive differential diagnostic period when the clinician already suspected PD. The model utilizing a gait/tremor diagnosis as the interception point, achieved a validation AUC of 0.874 with potential time compression to a future PD diagnosis of more than 300 days. Comparisons of predictive diagnoses between the prospective and prediagnostic cohorts suggest the presence of distinctive trajectories of PD progression based on comorbidity profiles.
Overall, our machine learning approach allows for both guiding clinical decisions such as the initiation of neuroprotective interventions and importantly, the possibility of earlier diagnosis for clinical trials for disease modifying therapies.
帕金森病(PD)的临床前期特征及后续发展的早期预测对预防干预、风险分层和疾病病理机制的理解至关重要。本研究旨在探讨临床前期在 PD 中的作用,并利用该时期的选定特征作为新的切入点,构建一个预测模型,以便在真实环境中加速诊断。
我们构建了两套机器学习模型:一种是回顾性方法,突出显示 PD 诊断前 5 年的暴露情况;另一种是前瞻性模型,从首次出现步态或震颤障碍的所有个体中预测未来 PD 诊断,这些特征似乎代表了鉴别诊断窗口的开始。
我们发现了许多由回顾性模型捕获的新特征;然而,高准确性主要来自 PD 的替代诊断,如步态和震颤障碍,这表明当临床医生已经怀疑 PD 时,存在一个独特的鉴别诊断期。利用步态/震颤诊断作为切入点的模型,在验证 AUC 为 0.874,具有超过 300 天的未来 PD 诊断时间压缩潜力。前瞻性和临床前期队列之间的预测诊断比较表明,基于合并症特征,PD 进展存在独特的轨迹。
总的来说,我们的机器学习方法不仅可以指导临床决策,如神经保护干预的启动,而且重要的是,为疾病修饰疗法的临床试验提供更早诊断的可能性。