• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语音分析和可解释机器学习的帕金森病无创检测

Non-invasive detection of Parkinson's disease based on speech analysis and interpretable machine learning.

作者信息

Xu Huanqing, Xie Wei, Pang Mingzhen, Li Ya, Jin Luhua, Huang Fangliang, Shao Xian

机构信息

The School of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, China.

Jiangxi Medical College, Nanchang University, Nanchang, Jiangxi, China.

出版信息

Front Aging Neurosci. 2025 Apr 30;17:1586273. doi: 10.3389/fnagi.2025.1586273. eCollection 2025.

DOI:10.3389/fnagi.2025.1586273
PMID:40370753
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12075230/
Abstract

OBJECTIVE

Parkinson's disease (PD) is a progressive neurodegenerative disorder that significantly impacts motor function and speech patterns. Early detection of PD through non-invasive methods, such as speech analysis, can improve treatment outcomes and quality of life for patients. This study aims to develop an interpretable machine learning model that uses speech recordings and acoustic features to predict PD.

METHODS

A dataset of speech recordings from individuals with and without PD was analyzed. The dataset includes features such as fundamental frequency (Fo), jitter, shimmer, noise-to-harmonics ratio (NHR), and non-linear dynamic complexity measures. Exploratory data analysis (EDA) was conducted to identify patterns and relationships in the data. The dataset was split into 70% training and 30% testing sets. To address class imbalance, synthetic minority oversampling technique (SMOTE) was applied. Several machine learning algorithms, including K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Trees, Random Forests, and Neural Networks, were implemented and evaluated. Model performance was assessed using accuracy, recall, F1 score, and area under the receiver operating characteristic curve (AUC-ROC) metrics. SHapley Additive exPlanations (SHAP) were used to explain the models and evaluate feature contributions.

RESULTS

The analysis revealed that features related to speech instability, such as jitter, shimmer, and NHR, were highly predictive of PD. Non-linear metrics, including Recurrence Plot Dimension Entropy (RPDE) and Pitch Period Entropy (PPE), also made significant contributions to the model's predictive power. Random Forest and Gradient Boosting models achieved the highest performance, with an AUC-ROC of 0.98, recall of 0.95, ensuring minimal false negatives. SHAp values highlighted the importance of fundamental frequency variation and harmonic-to-noise ratio in distinguishing PD patients from healthy individuals.

CONCLUSION

The developed machine learning model accurately predicts Parkinson's disease using speech recordings, with Random Forest and Gradient Boosting algorithms demonstrating superior performance. Key predictive features include jitter, shimmer, and non-linear dynamic complexity measures. This study provides a reliable, non-invasive tool for early PD detection and underscores the potential of speech analysis in diagnosing neurodegenerative diseases.

摘要

目的

帕金森病(PD)是一种进行性神经退行性疾病,对运动功能和言语模式有显著影响。通过语音分析等非侵入性方法早期检测帕金森病,可以改善患者的治疗效果和生活质量。本研究旨在开发一种可解释的机器学习模型,该模型利用语音记录和声学特征来预测帕金森病。

方法

分析了有和没有帕金森病个体的语音记录数据集。该数据集包括基频(Fo)、抖动、闪烁、噪声与谐波比(NHR)以及非线性动态复杂性度量等特征。进行探索性数据分析(EDA)以识别数据中的模式和关系。数据集被分为70%的训练集和30%的测试集。为了解决类别不平衡问题,应用了合成少数过采样技术(SMOTE)。实施并评估了几种机器学习算法,包括K近邻(KNN)、支持向量机(SVM)、决策树、随机森林和神经网络。使用准确率、召回率、F1分数和接收器操作特征曲线下面积(AUC-ROC)指标评估模型性能。使用SHapley加性解释(SHAP)来解释模型并评估特征贡献。

结果

分析表明,与语音不稳定性相关的特征,如抖动、闪烁和NHR,对帕金森病具有高度预测性。包括递归图维度熵(RPDE)和基音周期熵(PPE)在内的非线性度量也对模型的预测能力做出了重大贡献。随机森林和梯度提升模型表现最佳AUC-ROC为0.98,召回率为0.95,确保了最小的假阴性。SHAP值突出了基频变化和谐波与噪声比在区分帕金森病患者和健康个体方面的重要性。

结论

所开发的机器学习模型使用语音记录准确预测帕金森病,随机森林和梯度提升算法表现出卓越性能。关键预测特征包括抖动、闪烁和非线性动态复杂性度量。本研究为帕金森病的早期检测提供了一种可靠的非侵入性工具,并强调了语音分析在诊断神经退行性疾病方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/37beefeef521/fnagi-17-1586273-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/ed3035285a79/fnagi-17-1586273-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/8ef8d63bf096/fnagi-17-1586273-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/be75f6df8c57/fnagi-17-1586273-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/4ea919899ef8/fnagi-17-1586273-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/2ba189b30ac8/fnagi-17-1586273-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/6c6ad3b351e7/fnagi-17-1586273-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/054ce083dfa1/fnagi-17-1586273-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/37beefeef521/fnagi-17-1586273-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/ed3035285a79/fnagi-17-1586273-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/8ef8d63bf096/fnagi-17-1586273-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/be75f6df8c57/fnagi-17-1586273-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/4ea919899ef8/fnagi-17-1586273-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/2ba189b30ac8/fnagi-17-1586273-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/6c6ad3b351e7/fnagi-17-1586273-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/054ce083dfa1/fnagi-17-1586273-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a153/12075230/37beefeef521/fnagi-17-1586273-g008.jpg

相似文献

1
Non-invasive detection of Parkinson's disease based on speech analysis and interpretable machine learning.基于语音分析和可解释机器学习的帕金森病无创检测
Front Aging Neurosci. 2025 Apr 30;17:1586273. doi: 10.3389/fnagi.2025.1586273. eCollection 2025.
2
Gradient boosting for Parkinson's disease diagnosis from voice recordings.基于语音记录的梯度提升算法用于帕金森病诊断
BMC Med Inform Decis Mak. 2020 Sep 15;20(1):228. doi: 10.1186/s12911-020-01250-7.
3
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
4
Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测:回顾性队列研究
Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.
5
Construction of a risk prediction model for lung infection after chemotherapy in lung cancer patients based on the machine learning algorithm.基于机器学习算法构建肺癌患者化疗后肺部感染风险预测模型。
Front Oncol. 2024 Aug 9;14:1403392. doi: 10.3389/fonc.2024.1403392. eCollection 2024.
6
Development and validation of an interpretable machine learning model for predicting in-hospital mortality for ischemic stroke patients in ICU.用于预测ICU中缺血性中风患者院内死亡率的可解释机器学习模型的开发与验证
Int J Med Inform. 2025 Jun;198:105874. doi: 10.1016/j.ijmedinf.2025.105874. Epub 2025 Mar 9.
7
Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.使用机器学习算法开发并验证针对20岁及以上抑郁症患者冠心病风险的预测模型。
Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024.
8
Machine learning models predict triage levels, massive transfusion protocol activation, and mortality in trauma utilizing patients hemodynamics on admission.机器学习模型利用创伤患者入院时的血流动力学来预测分诊级别、大量输血方案的激活和死亡率。
Comput Biol Med. 2024 Sep;179:108880. doi: 10.1016/j.compbiomed.2024.108880. Epub 2024 Jul 16.
9
Development and Validation of an Interpretable Machine Learning Prediction Model for Total Pathological Complete Response after Neoadjuvant Chemotherapy in Locally Advanced Breast Cancer: Multicenter Retrospective Analysis.局部晚期乳腺癌新辅助化疗后总病理完全缓解的可解释机器学习预测模型的开发与验证:多中心回顾性分析
J Cancer. 2024 Aug 1;15(15):5058-5071. doi: 10.7150/jca.97190. eCollection 2024.
10
Predicting isolated impaired glucose tolerance without oral glucose tolerance test using machine learning in Chinese Han men.在中国汉族男性中使用机器学习在不进行口服葡萄糖耐量试验的情况下预测单纯性糖耐量受损
Front Endocrinol (Lausanne). 2025 Apr 24;16:1514397. doi: 10.3389/fendo.2025.1514397. eCollection 2025.

引用本文的文献

1
Speech-Based Parkinson's Detection Using Pre-Trained Self-Supervised Automatic Speech Recognition (ASR) Models and Supervised Contrastive Learning.基于语音的帕金森病检测:使用预训练的自监督自动语音识别(ASR)模型和监督对比学习
Bioengineering (Basel). 2025 Jul 1;12(7):728. doi: 10.3390/bioengineering12070728.

本文引用的文献

1
Multi-label speech feature selection for Parkinson's Disease subtype recognition using graph model.基于图模型的帕金森病亚型识别多标签语音特征选择
Comput Biol Med. 2025 Feb;185:109566. doi: 10.1016/j.compbiomed.2024.109566. Epub 2024 Dec 24.
2
Parkinson's disease detection based on features refinement through L1 regularized SVM and deep neural network.基于 L1 正则化 SVM 和深度神经网络的特征细化的帕金森病检测。
Sci Rep. 2024 Jan 16;14(1):1333. doi: 10.1038/s41598-024-51600-y.
3
Machine Learning-Based Classification of Parkinson's Disease Patients Using Speech Biomarkers.
基于机器学习的帕金森病患者语音生物标志物分类
J Parkinsons Dis. 2024;14(1):95-109. doi: 10.3233/JPD-230002.
4
Machine learning for the prediction of postoperative nosocomial pulmonary infection in patients with spinal cord injury.机器学习预测脊髓损伤患者术后医院获得性肺部感染。
Eur Spine J. 2023 Nov;32(11):3825-3835. doi: 10.1007/s00586-023-07772-8. Epub 2023 May 17.
5
Machine learning approaches to identify Parkinson's disease using voice signal features.利用语音信号特征识别帕金森病的机器学习方法。
Front Artif Intell. 2023 Mar 28;6:1084001. doi: 10.3389/frai.2023.1084001. eCollection 2023.
6
Machine-Learning-Based Disease Diagnosis: A Comprehensive Review.基于机器学习的疾病诊断:全面综述
Healthcare (Basel). 2022 Mar 15;10(3):541. doi: 10.3390/healthcare10030541.
7
Voice in Parkinson's Disease: A Machine Learning Study.帕金森病中的语音:一项机器学习研究。
Front Neurol. 2022 Feb 15;13:831428. doi: 10.3389/fneur.2022.831428. eCollection 2022.
8
The combined effect of REM sleep behavior disorder and hyposmia on cognition and motor phenotype in Parkinson's disease.快速眼动睡眠行为障碍与嗅觉减退对帕金森病认知和运动表型的联合影响。
J Neurol Sci. 2016 Sep 15;368:374-8. doi: 10.1016/j.jns.2016.07.057. Epub 2016 Jul 26.
9
T test as a parametric statistic.T检验作为一种参数统计方法。
Korean J Anesthesiol. 2015 Dec;68(6):540-6. doi: 10.4097/kjae.2015.68.6.540. Epub 2015 Nov 25.
10
Early identification and treatment of communication and swallowing deficits in Parkinson disease.帕金森病中沟通和吞咽障碍的早期识别与治疗
Semin Speech Lang. 2013 Aug;34(3):185-202. doi: 10.1055/s-0033-1358367. Epub 2013 Oct 28.