Crawford Jonathan L
Department of Electrical and Computer Engineering, Boston University, Boston, Massachusetts, United States of America.
PLOS Digit Health. 2025 Feb 10;4(2):e0000757. doi: 10.1371/journal.pdig.0000757. eCollection 2025 Feb.
Parkinson's disease is the second most prevalent neurodegenerative disorder with over ten million active cases worldwide and one million new diagnoses per year. Detecting and subsequently diagnosing the disease is challenging because of symptom heterogeneity with respect to complexity, as well as the type and timing of phenotypic manifestations. Typically, language impairment can present in the prodromal phase and precede motor symptoms suggesting that a linguistic-based approach could serve as a diagnostic method for incipient Parkinson's disease. Additionally, improved linguistic models may enhance other approaches through fusion techniques. The field of large language models is advancing rapidly, presenting the opportunity to explore the use of these new models for detecting Parkinson's disease and to improve on current linguistic approaches with high-dimensional representations of linguistics. We evaluate the application of state-of-the-art large language models to detect Parkinson's disease automatically from spontaneous speech with up to 78% accuracy. We also demonstrate that large language models can be used to predict the severity of PD in a regression task. We further demonstrate that the better performance of large language models is due to their ability to extract more relevant linguistic features and not due to increased dimensionality of the feature space.
帕金森病是全球第二常见的神经退行性疾病,全球有超过1000万活跃病例,每年有100万新诊断病例。由于症状在复杂性、表型表现的类型和时间方面存在异质性,检测并随后诊断该疾病具有挑战性。通常,语言障碍可能出现在前驱期,并先于运动症状出现,这表明基于语言的方法可以作为早期帕金森病的诊断方法。此外,改进的语言模型可以通过融合技术增强其他方法。大语言模型领域正在迅速发展,这为探索使用这些新模型来检测帕金森病以及利用语言学的高维表示改进当前的语言方法提供了机会。我们评估了最先进的大语言模型从自发语音中自动检测帕金森病的应用,准确率高达78%。我们还证明,大语言模型可用于在回归任务中预测帕金森病的严重程度。我们进一步证明,大语言模型表现更好是因为它们能够提取更多相关的语言特征,而不是由于特征空间维度的增加。