一种使用半监督语音嵌入检测帕金森病的新型融合架构。

A novel fusion architecture for detecting Parkinson's Disease using semi-supervised speech embeddings.

作者信息

Adnan Tariq, Abdelkader Abdelrahman, Liu Zipei, Hossain Ekram, Park Sooyong, Islam Md Saiful, Hoque Ehsan

机构信息

Department of Computer Science, University of Rochester, Rochester, NY, USA.

Ministry of Defense Health Services, Riyadh, Saudi Arabia.

出版信息

NPJ Parkinsons Dis. 2025 Jun 20;11(1):176. doi: 10.1038/s41531-025-00956-7.

DOI:10.1038/s41531-025-00956-7

PMID:40541966

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12181232/

Abstract

We introduce a framework for screening Parkinson’s disease (PD) using English pangram utterances. Our dataset includes 1306 participants (392 with PD) from both home and clinical settings, covering diverse demographics (53.2% female). We used deep learning embeddings from Wav2Vec 2.0, WavLM, and ImageBind to capture speech dynamics indicative of PD. Our novel fusion model for PD classification aligns different speech embeddings into a cohesive feature space, outperforming baseline alternatives. In a stratified randomized split, the model achieved an AUROC of 88.9% and an accuracy of 85.7%. Statistical bias analysis showed equitable performance across sex, ethnicity, and age subgroups, with robustness across various disease durations and PD stages. Detailed error analysis revealed higher misclassification rates in specific age ranges for males and females, aligning with clinical insights. External testing yielded AUROCs of 82.1% and 78.4% on two clinical datasets, and an AUROC of 77.4% on an unseen general spontaneous English speech dataset, demonstrating versatility in natural speech analysis and potential for global accessibility and health equity.

摘要

我们介绍了一种使用英语全字母句话语来筛查帕金森病（PD）的框架。我们的数据集包括来自家庭和临床环境的1306名参与者（392名患有PD），涵盖了不同的人口统计学特征（53.2%为女性）。我们使用了来自Wav2Vec 2.0、WavLM和ImageBind的深度学习嵌入来捕捉指示PD的语音动态。我们用于PD分类的新型融合模型将不同的语音嵌入对齐到一个连贯的特征空间中，优于基线替代模型。在分层随机分割中，该模型的曲线下面积（AUROC）达到88.9%，准确率达到85.7%。统计偏差分析表明，该模型在性别、种族和年龄亚组中的表现公平，在不同疾病持续时间和PD阶段具有稳健性。详细的错误分析显示，男性和女性在特定年龄范围内的误分类率较高，这与临床见解一致。在两个临床数据集上进行外部测试时，曲线下面积分别为82.1%和78.4%，在一个未见过的一般自然英语语音数据集上的曲线下面积为77.4%，这表明该模型在自然语音分析中具有通用性，具有全球可及性和健康公平性的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6309/12181232/72e9c970e353/41531_2025_956_Fig1_HTML.jpg

相似文献

A novel fusion architecture for detecting Parkinson's Disease using semi-supervised speech embeddings.

NPJ Parkinsons Dis. 2025 Jun 20;11(1):176. doi: 10.1038/s41531-025-00956-7.

Evaluating language model embeddings for Parkinson's disease cohort harmonization using a novel manually curated variable mapping schema.

Sci Rep. 2025 Jun 20;15(1):20210. doi: 10.1038/s41598-025-06447-2.

A comparison of speech and language therapy techniques for dysarthria in Parkinson's disease.

Cochrane Database Syst Rev. 2001(2):CD002814. doi: 10.1002/14651858.CD002814.

Use of β-adrenoreceptor drugs and Parkinson's disease incidence in women from the French E3N cohort study.

J Parkinsons Dis. 2025 Apr 29:1877718X251330993. doi: 10.1177/1877718X251330993.

Speech and language therapy for dysarthria in Parkinson's disease.

Cochrane Database Syst Rev. 2001(2):CD002812. doi: 10.1002/14651858.CD002812.

Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.

J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.

Referral criteria to palliative care for patients with Parkinson's disease: a systematic review.

Curr Med Res Opin. 2023 Feb;39(2):267-279. doi: 10.1080/03007995.2022.2146405. Epub 2022 Dec 28.

Analyzing Wav2Vec 1.0 Embeddings for Cross-Database Parkinson's Disease Detection and Speech Features Extraction.

Sensors (Basel). 2024 Aug 26;24(17):5520. doi: 10.3390/s24175520.

Physical exercise for people with Parkinson's disease: a systematic review and network meta-analysis.

Cochrane Database Syst Rev. 2024 Apr 8;4(4):CD013856. doi: 10.1002/14651858.CD013856.pub3.

Enhancing the diagnostic potential of electroretinography in Parkinson's disease: A review of protocol and cohort criteria.

J Parkinsons Dis. 2025 Jun;15(4):694-709. doi: 10.1177/1877718X251331863. Epub 2025 Apr 29.

引用本文的文献

Remote AI Screening for Parkinson's Disease: A Multimodal, Cross-Setting Validation Study.

Res Sq. 2025 Jun 26:rs.3.rs-6844936. doi: 10.21203/rs.3.rs-6844936/v1.

本文引用的文献

Using AI to measure Parkinson's disease severity at home.

NPJ Digit Med. 2023 Aug 23;6(1):156. doi: 10.1038/s41746-023-00905-9.

Wearable movement-tracking data identify Parkinson's disease years before clinical diagnosis.

Nat Med. 2023 Aug;29(8):2048-2056. doi: 10.1038/s41591-023-02440-2. Epub 2023 Jul 3.

Artificial intelligence-enabled detection and assessment of Parkinson's disease using nocturnal breathing signals.

Nat Med. 2022 Oct;28(10):2207-2215. doi: 10.1038/s41591-022-01932-x. Epub 2022 Aug 22.

Patient Experience in Early-Stage Parkinson's Disease: Using a Mixed Methods Analysis to Identify Which Concepts Are Cardinal for Clinical Trial Outcome Assessment.

Neurol Ther. 2022 Sep;11(3):1319-1340. doi: 10.1007/s40120-022-00375-3. Epub 2022 Jul 1.

Patient contrastive learning: A performant, expressive, and practical approach to electrocardiogram modeling.

PLoS Comput Biol. 2022 Feb 14;18(2):e1009862. doi: 10.1371/journal.pcbi.1009862. eCollection 2022 Feb.

Bias Investigation in Artificial Intelligence Systems for Early Detection of Parkinson's Disease: A Narrative Review.

Diagnostics (Basel). 2022 Jan 11;12(1):166. doi: 10.3390/diagnostics12010166.

Why does Africa have the lowest number of Neurologists and how to cover the Gap?

J Neurol Sci. 2022 Mar 15;434:120119. doi: 10.1016/j.jns.2021.120119. Epub 2021 Dec 29.

Effects of physician visit frequency for Parkinson's disease treatment on mortality, hospitalization, and costs: a retrospective cohort study.

BMC Geriatr. 2021 Dec 15;21(1):707. doi: 10.1186/s12877-021-02685-x.

Detecting Parkinson Disease Using a Web-Based Speech Task: Observational Study.

J Med Internet Res. 2021 Oct 19;23(10):e26305. doi: 10.2196/26305.

Hypomimia in Parkinson's Disease: What Is It Telling Us?

Front Neurol. 2021 Jan 25;11:603582. doi: 10.3389/fneur.2020.603582. eCollection 2020.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种使用半监督语音嵌入检测帕金森病的新型融合架构。

A novel fusion architecture for detecting Parkinson's Disease using semi-supervised speech embeddings.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献