一种通过语音和手写记录实现帕金森病的通用优化与学习框架。

A generic optimization and learning framework for Parkinson disease via speech and handwritten records.

作者信息

Yousif Nada R, Balaha Hossam Magdy, Haikal Amira Y, El-Gendy Eman M

机构信息

Computer and Control Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt.

出版信息

J Ambient Intell Humaniz Comput. 2022 Aug 26:1-21. doi: 10.1007/s12652-022-04342-6.

DOI:10.1007/s12652-022-04342-6

PMID:36042792

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9411848/

Abstract

Parkinson's disease (PD) is a neurodegenerative disorder with slow progression whose symptoms can be identified at late stages. Early diagnosis and treatment of PD can help to relieve the symptoms and delay progression. However, this is very challenging due to the similarities between the symptoms of PD and other diseases. The current study proposes a generic framework for the diagnosis of PD using handwritten images and (or) speech signals. For the handwriting images, 8 pre-trained convolutional neural networks (CNN) via transfer learning tuned by Aquila Optimizer were trained on the NewHandPD dataset to diagnose PD. For the speech signals, features from the MDVR-KCL dataset are extracted numerically using 16 feature extraction algorithms and fed to 4 different machine learning algorithms tuned by Grid Search algorithm, and graphically using 5 different techniques and fed to the 8 pretrained CNN structures. The authors propose a new technique in extracting the features from the voice dataset based on the segmentation of variable speech-signal-segment-durations, i.e., the use of different durations in the segmentation phase. Using the proposed technique, 5 datasets with 281 numerical features are generated. Results from different experiments are collected and recorded. For the NewHandPD dataset, the best-reported metric is 99.75% using the VGG19 structure. For the MDVR-KCL dataset, the best-reported metrics are 99.94% using the KNN and SVM ML algorithms and the combined numerical features; and 100% using the combined the mel-specgram graphical features and VGG19 structure. These results are better than other state-of-the-art researches.

摘要

帕金森病（PD）是一种进展缓慢的神经退行性疾病，其症状在晚期才能被识别。帕金森病的早期诊断和治疗有助于缓解症状并延缓病情进展。然而，由于帕金森病与其他疾病症状相似，这一过程极具挑战性。当前研究提出了一个使用手写图像和（或）语音信号诊断帕金森病的通用框架。对于手写图像，通过Aquila优化器进行迁移学习调优的8个预训练卷积神经网络（CNN）在NewHandPD数据集上进行训练以诊断帕金森病。对于语音信号，使用16种特征提取算法从MDVR-KCL数据集中数值提取特征，并将其输入到由网格搜索算法调优的4种不同机器学习算法中；同时，使用5种不同技术从图形上提取特征，并将其输入到8个预训练的CNN结构中。作者提出了一种基于可变语音信号段持续时间分割从语音数据集中提取特征的新技术，即在分割阶段使用不同的持续时间。使用该技术生成了5个包含281个数值特征的数据集。收集并记录了不同实验的结果。对于NewHandPD数据集，使用VGG19结构报告的最佳指标为99.75%。对于MDVR-KCL数据集，使用KNN和SVM机器学习算法以及组合数值特征报告的最佳指标为99.94%；使用组合的梅尔频谱图图形特征和VGG19结构时最佳指标为100%。这些结果优于其他现有研究。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

一种通过语音和手写记录实现帕金森病的通用优化与学习框架。

A generic optimization and learning framework for Parkinson disease via speech and handwritten records.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

一种通过语音和手写记录实现帕金森病的通用优化与学习框架。

A generic optimization and learning framework for Parkinson disease via speech and handwritten records.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献