基于预训练深度学习模型的语音转文本分析用于帕金森语音障碍评估的可行性研究

Feasibility Study of Parkinson's Speech Disorder Evaluation With Pre-Trained Deep Learning Model for Speech-to-Text Analysis.

作者信息

Kim Kwang Hyeon, Lee Byung-Jou, Koo Hae-Won

机构信息

Clinical Research Support Center, Inje University Ilsan Paik Hospital, Inje University College of Medicine, Goyang, Korea.

Department of Neurosurgery, Inje University Ilsan Paik Hospital, Inje University College of Medicine, Goyang, Korea.

出版信息

Korean J Neurotrauma. 2024 Sep 23;20(3):168-179. doi: 10.13004/kjnt.2024.20.e30. eCollection 2024 Sep.

DOI:10.13004/kjnt.2024.20.e30

PMID:39372118

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11450341/

Abstract

OBJECTIVE

This study investigates the feasibility of employing a pre-trained deep learning wave-to-vec model for speech-to-text analysis in individuals with speech disorders arising from Parkinson's disease (PD).

METHODS

A publicly available dataset containing speech recordings including the Hoehn and Yahr (H&Y) staging, Movement Disorder Society Unified Parkinson's Disease Rating Scale (UPDRS) Part I, UPDRS Part II scores, and gender information from both healthy controls (HC) and those diagnosed with PD was utilized. Employing the Wav2Vec model, a speech-to-text analysis method was implemented on PD patient data. Tasks conducted included word letter classification, word match probability assessment, and analysis of speech waveform characteristics as provided by the model's output.

RESULTS

For the dataset comprising 20 cases, among individuals with PD, the H&Y score averaged 2.50±0.67, the UPDRS II-part 5 score averaged 0.70±1.00, and the UPDRS III-part 18 score averaged 0.80±0.98. Additionally, the number of words derived from decoded text subsequent to speech recognition was evaluated, resulting in mean values of 299.10±16.79 and 259.80±93.39 for the HC and PD groups, respectively. Furthermore, the calculated degree of agreement for all syllables was based on the speech process. The accuracy for the reading sentences was observed to be 0.31 and 0.10, respectively.

CONCLUSION

This study aimed to demonstrate the effectiveness of wave-to-vec in enhancing speech-to-text analysis for patients with speech disorders. The findings could pave the way for the development of clinical tools for improved diagnosis, evaluation, and communication support for this population.

摘要

目的

本研究探讨使用预训练的深度学习Wave2Vec模型对帕金森病（PD）引起的言语障碍患者进行语音转文本分析的可行性。

方法

利用一个公开可用的数据集，其中包含语音记录，包括霍恩和亚尔（H&Y）分期、运动障碍协会统一帕金森病评定量表（UPDRS）第一部分、UPDRS第二部分得分以及健康对照（HC）和被诊断为PD患者的性别信息。采用Wave2Vec模型，对PD患者数据实施语音转文本分析方法。进行的任务包括单词字母分类、单词匹配概率评估以及对模型输出提供的语音波形特征进行分析。

结果

对于包含20个病例的数据集，在PD患者中，H&Y评分平均为2.50±0.67，UPDRS第二部分第5项得分平均为0.70±1.00，UPDRS第三部分第18项得分平均为0.80±0.98。此外，评估了语音识别后解码文本中的单词数量，HC组和PD组的平均值分别为299.10±16.79和259.80±93.39。此外，基于语音过程计算了所有音节的一致程度。观察到阅读句子的准确率分别为0.31和0.10。

结论

本研究旨在证明Wave2Vec在增强言语障碍患者语音转文本分析方面的有效性。这些发现可为开发临床工具以改善对该人群的诊断、评估和沟通支持铺平道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3252/11450341/0cd804108ea6/kjn-20-168-g001.jpg

相似文献

Feasibility Study of Parkinson's Speech Disorder Evaluation With Pre-Trained Deep Learning Model for Speech-to-Text Analysis.基于预训练深度学习模型的语音转文本分析用于帕金森语音障碍评估的可行性研究

Korean J Neurotrauma. 2024 Sep 23;20(3):168-179. doi: 10.13004/kjnt.2024.20.e30. eCollection 2024 Sep.

Gait video-based prediction of unified Parkinson's disease rating scale score: a retrospective study.基于步态视频的统一帕金森病评定量表评分预测：一项回顾性研究。

BMC Neurol. 2023 Oct 5;23(1):358. doi: 10.1186/s12883-023-03385-2.

Dual-Task Performance and Brain Morphologic Characteristics in Parkinson's Disease.帕金森病的双重任务表现与脑形态学特征

Neurodegener Dis. 2024;24(3-4):106-116. doi: 10.1159/000540393. Epub 2024 Jul 31.

Correlation Analysis Between 3D and Plane DAT Binding Parameters of C-CFT PET/CT and the Clinical Characteristics of Patients with Parkinson's Disease.C-CFT PET/CT的三维与平面多巴胺转运体（DAT）结合参数之间的相关性分析及帕金森病患者的临床特征

J Integr Neurosci. 2025 Apr 25;24(4):24440. doi: 10.31083/JIN24440.

Treatment Detection and Movement Disorder Society-Unified Parkinson's Disease Rating Scale, Part III Estimation Using Finger Tapping Tasks.使用手指叩击任务评估治疗检测与运动障碍学会-统一帕金森病评定量表第三部分。

Mov Disord. 2023 Oct;38(10):1795-1805. doi: 10.1002/mds.29520. Epub 2023 Jul 4.

Predicting the Progression of Parkinson's Disease MDS-UPDRS-III Motor Severity Score from Gait Data using Deep Learning.利用深度学习从步态数据预测帕金森病MDS-UPDRS-III运动严重程度评分的进展情况

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:249-252. doi: 10.1109/EMBC46164.2021.9630769.

Neurologic Dysfunction Assessment in Parkinson Disease Based on Fundus Photographs Using Deep Learning.基于深度学习的帕金森病眼底照片的神经功能障碍评估。

JAMA Ophthalmol. 2023 Mar 1;141(3):234-240. doi: 10.1001/jamaophthalmol.2022.5928.

Analyzing Wav2Vec 1.0 Embeddings for Cross-Database Parkinson's Disease Detection and Speech Features Extraction.分析 Wav2Vec 1.0 嵌入以进行跨数据库帕金森病检测和语音特征提取。

Sensors (Basel). 2024 Aug 26;24(17):5520. doi: 10.3390/s24175520.

Vowel articulation in Parkinson's disease.帕金森病患者的元音发音。

J Voice. 2011 Jul;25(4):467-72. doi: 10.1016/j.jvoice.2010.01.009. Epub 2010 May 1.

Long-term outcomes of deep brain stimulation in severe Parkinson's disease utilizing UPDRS III and modified Hoehn and Yahr as a severity scale.以统一帕金森病评定量表第三部分（UPDRS III）和改良Hoehn-Yahr分级作为严重程度量表，评估深部脑刺激术治疗重度帕金森病的长期疗效。

Clin Neurol Neurosurg. 2019 Apr;179:67-73. doi: 10.1016/j.clineuro.2019.02.018. Epub 2019 Feb 21.

引用本文的文献

Letter to the Editor: Commentary on Feasibility Study of Parkinson's Speech Disorder Evaluation With Pre-Trained Deep Learning Model for Speech-to-Text Analysis ( 2024;20:168-179).

Korean J Neurotrauma. 2024 Dec 16;20(4):303-304. doi: 10.13004/kjnt.2024.20.e40. eCollection 2024 Dec.

本文引用的文献

Pathophysiology and Neuroimmune Interactions Underlying Parkinson's Disease and Traumatic Brain Injury.帕金森病和创伤性脑损伤的病理生理学和神经免疫相互作用。

Int J Mol Sci. 2023 Apr 13;24(8):7186. doi: 10.3390/ijms24087186.

The effects of intensive speech treatment on intelligibility in Parkinson's disease: A randomised controlled trial.强化言语治疗对帕金森病患者言语清晰度的影响：一项随机对照试验。

EClinicalMedicine. 2020 Jun 28;24:100429. doi: 10.1016/j.eclinm.2020.100429. eCollection 2020 Jul.

Speech intelligibility of Parkinson's disease patients evaluated by different groups of healthcare professionals and naïve listeners.不同医疗专业人员和普通听众对帕金森病患者言语可懂度的评估

Logoped Phoniatr Vocol. 2021 Oct;46(3):141-147. doi: 10.1080/14015439.2020.1785546. Epub 2020 Jul 7.

Predicting Intelligibility Deficits in Parkinson's Disease With Perceptual Speech Ratings.使用感知语音评分预测帕金森病的可懂度缺陷。

J Speech Lang Hear Res. 2020 Feb 26;63(2):433-443. doi: 10.1044/2019_JSLHR-19-00134.

Artificial intelligence for assisting diagnostics and assessment of Parkinson's disease-A review.用于辅助帕金森病诊断和评估的人工智能——综述

Clin Neurol Neurosurg. 2019 Sep;184:105442. doi: 10.1016/j.clineuro.2019.105442. Epub 2019 Jul 16.

Speech, language and swallowing impairments in functional neurological disorder: a scoping review.功能性神经障碍中的言语、语言和吞咽障碍：范围综述。

Int J Lang Commun Disord. 2019 May;54(3):309-320. doi: 10.1111/1460-6984.12448. Epub 2018 Dec 27.

A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech.一种基于语音识别的解决方案，用于从自发语音中自动检测轻度认知障碍。

Curr Alzheimer Res. 2018;15(2):130-138. doi: 10.2174/1567205014666171121114930.

Functional speech disorders: clinical manifestations, diagnosis, and management.功能性言语障碍：临床表现、诊断与管理

Handb Clin Neurol. 2016;139:379-388. doi: 10.1016/B978-0-12-801772-2.00033-3.

Progression of voice and speech impairment in the course of Parkinson's disease: a longitudinal study.帕金森病病程中语音和言语障碍的进展：一项纵向研究。

Parkinsons Dis. 2013;2013:389195. doi: 10.1155/2013/389195. Epub 2013 Dec 10.

A computer vision framework for finger-tapping evaluation in Parkinson's disease.一种用于帕金森病手指敲击评估的计算机视觉框架。

Artif Intell Med. 2014 Jan;60(1):27-40. doi: 10.1016/j.artmed.2013.11.004. Epub 2013 Nov 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于预训练深度学习模型的语音转文本分析用于帕金森语音障碍评估的可行性研究

Feasibility Study of Parkinson's Speech Disorder Evaluation With Pre-Trained Deep Learning Model for Speech-to-Text Analysis.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献