用于帕金森病语音数据的分层提升双阶段特征约简集成模型

Hierarchical Boosting Dual-Stage Feature Reduction Ensemble Model for Parkinson's Disease Speech Data.

作者信息

Yang Mingyao, Ma Jie, Wang Pin, Huang Zhiyong, Li Yongming, Liu He, Hameed Zeeshan

机构信息

College of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400000, China.

Chongqing Academy of Educational Sciences, Chongqing 400000, China.

出版信息

Diagnostics (Basel). 2021 Dec 9;11(12):2312. doi: 10.3390/diagnostics11122312.

DOI:10.3390/diagnostics11122312

PMID:34943549

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8700329/

Abstract

As a neurodegenerative disease, Parkinson's disease (PD) is hard to identify at the early stage, while using speech data to build a machine learning diagnosis model has proved effective in its early diagnosis. However, speech data show high degrees of redundancy, repetition, and unnecessary noise, which influence the accuracy of diagnosis results. Although feature reduction (FR) could alleviate this issue, the traditional FR is one-sided (traditional feature extraction could construct high-quality features without feature preference, while traditional feature selection could achieve feature preference but could not construct high-quality features). To address this issue, the Hierarchical Boosting Dual-Stage Feature Reduction Ensemble Model (HBD-SFREM) is proposed in this paper. The major contributions of HBD-SFREM are as follows: (1) The instance space of the deep hierarchy is built by an iterative deep extraction mechanism. (2) The manifold features extraction method embeds the nearest neighbor feature preference method to form the dual-stage feature reduction pair. (3) The dual-stage feature reduction pair is iteratively performed by the AdaBoost mechanism to obtain instances features with higher quality, thus achieving a substantial improvement in model recognition accuracy. (4) The deep hierarchy instance space is integrated into the original instance space to improve the generalization of the algorithm. Three PD speech datasets and a self-collected dataset are used to test HBD-SFREM in this paper. Compared with other FR algorithms and deep learning algorithms, the accuracy of HBD-SFREM in PD speech recognition is improved significantly and would not be affected by a small sample dataset. Thus, HBD-SFREM could give a reference for other related studies.

摘要

作为一种神经退行性疾病，帕金森病（PD）在早期很难识别，而利用语音数据构建机器学习诊断模型已被证明在其早期诊断中是有效的。然而，语音数据表现出高度的冗余、重复和不必要的噪声，这影响了诊断结果的准确性。尽管特征约简（FR）可以缓解这个问题，但传统的FR是片面的（传统特征提取可以构建高质量特征而无特征偏好，而传统特征选择可以实现特征偏好但不能构建高质量特征）。为了解决这个问题，本文提出了层次增强双阶段特征约简集成模型（HBD-SFREM）。HBD-SFREM的主要贡献如下：（1）通过迭代深度提取机制构建深度层次的实例空间。（2）流形特征提取方法嵌入最近邻特征偏好方法，形成双阶段特征约简对。（3）通过AdaBoost机制对双阶段特征约简对进行迭代，以获得更高质量的实例特征，从而在模型识别准确率上有显著提高。（4）将深度层次实例空间集成到原始实例空间中，以提高算法的泛化能力。本文使用三个帕金森病语音数据集和一个自行收集的数据集对HBD-SFREM进行测试。与其他FR算法和深度学习算法相比，HBD-SFREM在帕金森病语音识别中的准确率显著提高，且不受小样本数据集的影响。因此，HBD-SFREM可为其他相关研究提供参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1af0/8700329/d22be5c6520d/diagnostics-11-02312-g001.jpg

相似文献

Hierarchical Boosting Dual-Stage Feature Reduction Ensemble Model for Parkinson's Disease Speech Data.用于帕金森病语音数据的分层提升双阶段特征约简集成模型

Diagnostics (Basel). 2021 Dec 9;11(12):2312. doi: 10.3390/diagnostics11122312.

[Psychosis speech recognition algorithm based on deep embedded sparse stacked autoencoder and manifold ensemble].基于深度嵌入式稀疏堆叠自动编码器和流形集成的精神病语音识别算法

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2021 Aug 25;38(4):655-662. doi: 10.7507/1001-5515.202010050.

Classification of Parkinson's disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples.利用语音样本的多编辑最近邻和集成学习算法对帕金森病进行分类。

Biomed Eng Online. 2016 Nov 16;15(1):122. doi: 10.1186/s12938-016-0242-6.

[Combining speech sample and feature bilateral selection algorithm for classification of Parkinson's disease].[结合语音样本与特征双边选择算法用于帕金森病分类]

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2018 Feb 1;34(6):942-948. doi: 10.7507/1001-5515.201704061.

[A partition bagging ensemble learning algorithm for Parkinson's speech data mining].[一种用于帕金森语音数据挖掘的分区装袋集成学习算法]

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2019 Aug 25;36(4):548-556. doi: 10.7507/1001-5515.201803061.

Estimation of Parkinson's disease severity using speech features and extreme gradient boosting.基于语音特征和极端梯度提升算法的帕金森病严重程度评估。

Med Biol Eng Comput. 2020 Nov;58(11):2757-2773. doi: 10.1007/s11517-020-02250-5. Epub 2020 Sep 10.

Intra-subject enveloped multilayer fuzzy sample compression for speech diagnosis of Parkinson's disease.针对帕金森病语音诊断的单主体包络多层模糊样本压缩。

Med Biol Eng Comput. 2024 Feb;62(2):371-388. doi: 10.1007/s11517-023-02944-6. Epub 2023 Oct 24.

Gradient boosting for Parkinson's disease diagnosis from voice recordings.基于语音记录的梯度提升算法用于帕金森病诊断

BMC Med Inform Decis Mak. 2020 Sep 15;20(1):228. doi: 10.1186/s12911-020-01250-7.

[Research on Parkinson's disease recognition algorithm based on sample enhancement].基于样本增强的帕金森病识别算法研究

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Feb 25;41(1):17-25. doi: 10.7507/1001-5515.202304011.

FLP: Factor lattice pattern-based automated detection of Parkinson's disease and specific language impairment using recorded speech.FLP：基于因子格子模式的帕金森病和特定语言障碍的自动检测，使用记录的语音。

Comput Biol Med. 2024 May;173:108280. doi: 10.1016/j.compbiomed.2024.108280. Epub 2024 Mar 20.

本文引用的文献

Automated Detection of Parkinson's Disease Based on Multiple Types of Sustained Phonations Using Linear Discriminant Analysis and Genetically Optimized Neural Network.基于线性判别分析和遗传优化神经网络的多种持续发声类型对帕金森病的自动检测

IEEE J Transl Eng Health Med. 2019 Oct 7;7:2000410. doi: 10.1109/JTEHM.2019.2940900. eCollection 2019.

Performance analysis of different classification algorithms using different feature selection methods on Parkinson's disease detection.不同分类算法在帕金森病检测中使用不同特征选择方法的性能分析。

J Neurosci Methods. 2018 Nov 1;309:81-90. doi: 10.1016/j.jneumeth.2018.08.017. Epub 2018 Sep 1.

Multimodal Assessment of Parkinson's Disease: A Deep Learning Approach.帕金森病的多模态评估：深度学习方法。

IEEE J Biomed Health Inform. 2019 Jul;23(4):1618-1630. doi: 10.1109/JBHI.2018.2866873. Epub 2018 Aug 23.

Comparative Motor Pre-clinical Assessment in Parkinson's Disease Using Supervised Machine Learning Approaches.使用监督机器学习方法进行帕金森病的比较运动临床前评估。

Ann Biomed Eng. 2018 Dec;46(12):2057-2068. doi: 10.1007/s10439-018-2104-9. Epub 2018 Jul 20.

Taste Recognition in E-Tongue Using Local Discriminant Preservation Projection.利用局部判别保持投影的电子舌味觉识别。

IEEE Trans Cybern. 2019 Mar;49(3):947-960. doi: 10.1109/TCYB.2018.2789889. Epub 2018 Jan 17.

Parkin function in Parkinson's disease.帕金森病中的帕金蛋白功能。

Science. 2018 Apr 20;360(6386):267-268. doi: 10.1126/science.aar6606.

Kernel K-Means Sampling for Nyström Approximation.核 K-Means 抽样的 Nyström 逼近。

IEEE Trans Image Process. 2018 May;27(5):2108-2120. doi: 10.1109/TIP.2018.2796860.

Feature selection and extraction for class prediction in dysphonia measures analysis:A case study on Parkinson's disease speech rehabilitation.用于发声障碍测量分析中类别预测的特征选择与提取：帕金森病言语康复的案例研究

Technol Health Care. 2017 Aug 9;25(4):693-708. doi: 10.3233/THC-170824.

Kernel-based Joint Feature Selection and Max-Margin Classification for Early Diagnosis of Parkinson's Disease.基于核的联合特征选择和最大间隔分类用于帕金森病的早期诊断。

Sci Rep. 2017 Jan 25;7:41069. doi: 10.1038/srep41069.

An Expert Diagnosis System for Parkinson Disease Based on Genetic Algorithm-Wavelet Kernel-Extreme Learning Machine.基于遗传算法-小波核-极限学习机的帕金森病专家诊断系统

Parkinsons Dis. 2016;2016:5264743. doi: 10.1155/2016/5264743. Epub 2016 May 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于帕金森病语音数据的分层提升双阶段特征约简集成模型

Hierarchical Boosting Dual-Stage Feature Reduction Ensemble Model for Parkinson's Disease Speech Data.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献