Suppr超能文献

用于帕金森病语音数据的分层提升双阶段特征约简集成模型

Hierarchical Boosting Dual-Stage Feature Reduction Ensemble Model for Parkinson's Disease Speech Data.

作者信息

Yang Mingyao, Ma Jie, Wang Pin, Huang Zhiyong, Li Yongming, Liu He, Hameed Zeeshan

机构信息

College of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400000, China.

Chongqing Academy of Educational Sciences, Chongqing 400000, China.

出版信息

Diagnostics (Basel). 2021 Dec 9;11(12):2312. doi: 10.3390/diagnostics11122312.

Abstract

As a neurodegenerative disease, Parkinson's disease (PD) is hard to identify at the early stage, while using speech data to build a machine learning diagnosis model has proved effective in its early diagnosis. However, speech data show high degrees of redundancy, repetition, and unnecessary noise, which influence the accuracy of diagnosis results. Although feature reduction (FR) could alleviate this issue, the traditional FR is one-sided (traditional feature extraction could construct high-quality features without feature preference, while traditional feature selection could achieve feature preference but could not construct high-quality features). To address this issue, the Hierarchical Boosting Dual-Stage Feature Reduction Ensemble Model (HBD-SFREM) is proposed in this paper. The major contributions of HBD-SFREM are as follows: (1) The instance space of the deep hierarchy is built by an iterative deep extraction mechanism. (2) The manifold features extraction method embeds the nearest neighbor feature preference method to form the dual-stage feature reduction pair. (3) The dual-stage feature reduction pair is iteratively performed by the AdaBoost mechanism to obtain instances features with higher quality, thus achieving a substantial improvement in model recognition accuracy. (4) The deep hierarchy instance space is integrated into the original instance space to improve the generalization of the algorithm. Three PD speech datasets and a self-collected dataset are used to test HBD-SFREM in this paper. Compared with other FR algorithms and deep learning algorithms, the accuracy of HBD-SFREM in PD speech recognition is improved significantly and would not be affected by a small sample dataset. Thus, HBD-SFREM could give a reference for other related studies.

摘要

作为一种神经退行性疾病,帕金森病(PD)在早期很难识别,而利用语音数据构建机器学习诊断模型已被证明在其早期诊断中是有效的。然而,语音数据表现出高度的冗余、重复和不必要的噪声,这影响了诊断结果的准确性。尽管特征约简(FR)可以缓解这个问题,但传统的FR是片面的(传统特征提取可以构建高质量特征而无特征偏好,而传统特征选择可以实现特征偏好但不能构建高质量特征)。为了解决这个问题,本文提出了层次增强双阶段特征约简集成模型(HBD-SFREM)。HBD-SFREM的主要贡献如下:(1)通过迭代深度提取机制构建深度层次的实例空间。(2)流形特征提取方法嵌入最近邻特征偏好方法,形成双阶段特征约简对。(3)通过AdaBoost机制对双阶段特征约简对进行迭代,以获得更高质量的实例特征,从而在模型识别准确率上有显著提高。(4)将深度层次实例空间集成到原始实例空间中,以提高算法的泛化能力。本文使用三个帕金森病语音数据集和一个自行收集的数据集对HBD-SFREM进行测试。与其他FR算法和深度学习算法相比,HBD-SFREM在帕金森病语音识别中的准确率显著提高,且不受小样本数据集的影响。因此,HBD-SFREM可为其他相关研究提供参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1af0/8700329/d22be5c6520d/diagnostics-11-02312-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验