失语症评估工具的自动语音识别平台比较研究。

A Comparative Investigation of Automatic Speech Recognition Platforms for Aphasia Assessment Batteries.

机构信息

Department of Biomedical Engineering, Shantou University, Shantou 515063, China.

Computer and Information Technology Department, IT Institute @ Phoenix College, Phoenix, AZ 85013, USA.

出版信息

Sensors (Basel). 2023 Jan 11;23(2):857. doi: 10.3390/s23020857.

DOI:10.3390/s23020857

PMID:36679654

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9863375/

Abstract

The rehabilitation of aphasics is fundamentally based on the assessment of speech impairment. Developing methods for assessing speech impairment automatically is important due to the growing number of stroke cases each year. Traditionally, aphasia is assessed manually using one of the well-known assessment batteries, such as the Western Aphasia Battery (WAB), the Chinese Rehabilitation Research Center Aphasia Examination (CRRCAE), and the Boston Diagnostic Aphasia Examination (BDAE). In aphasia testing, a speech-language pathologist (SLP) administers multiple subtests to assess people with aphasia (PWA). The traditional assessment is a resource-intensive process that requires the presence of an SLP. Thus, automating the assessment of aphasia is essential. This paper evaluated and compared custom machine learning (ML) speech recognition algorithms against off-the-shelf platforms using healthy and aphasic speech datasets on the naming and repetition subtests of the aphasia battery. Convolutional neural networks (CNN) and linear discriminant analysis (LDA) are the customized ML algorithms, while Microsoft Azure and Google speech recognition are off-the-shelf platforms. The results of this study demonstrated that CNN-based speech recognition algorithms outperform LDA and off-the-shelf platforms. The ResNet-50 architecture of CNN yielded an accuracy of 99.64 ± 0.26% on the healthy dataset. Even though Microsoft Azure was not trained on the same healthy dataset, it still generated comparable results to the LDA and superior results to Google's speech recognition platform.

摘要

失语症患者的康复治疗主要基于言语障碍的评估。由于每年中风病例的不断增加，自动开发言语障碍评估方法非常重要。传统上，通过使用广为人知的评估量表之一（如西方失语症量表 (WAB)、中国康复研究中心失语症检查表 (CRRCAE) 和波士顿诊断性失语症检查 (BDAE)）来手动评估失语症。在失语症测试中，言语治疗师 (SLP) 通过多项子测试评估失语症患者 (PWA)。传统评估是一个资源密集型过程，需要 SLP 的参与。因此，自动化失语症评估至关重要。本文使用失语症电池的命名和重复子测试中的健康和失语症语音数据集，评估和比较了针对定制机器学习 (ML) 语音识别算法和现成平台的比较。卷积神经网络 (CNN) 和线性判别分析 (LDA) 是定制的 ML 算法，而 Microsoft Azure 和 Google 语音识别是现成的平台。这项研究的结果表明，基于 CNN 的语音识别算法优于 LDA 和现成的平台。CNN 的 ResNet-50 架构在健康数据集上的准确率为 99.64 ± 0.26%。尽管 Microsoft Azure 没有在相同的健康数据上进行训练，但它仍然产生了与 LDA 相当的结果，并且优于 Google 的语音识别平台。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5afe/9863375/a05dc69cf67d/sensors-23-00857-g001.jpg

相似文献

A Comparative Investigation of Automatic Speech Recognition Platforms for Aphasia Assessment Batteries.失语症评估工具的自动语音识别平台比较研究。

Sensors (Basel). 2023 Jan 11;23(2):857. doi: 10.3390/s23020857.

Performance Evaluation of Machine Learning Frameworks for Aphasia Assessment.机器学习框架在失语症评估中的性能评估。

Sensors (Basel). 2021 Apr 7;21(8):2582. doi: 10.3390/s21082582.

An Efficient Deep Learning Based Method for Speech Assessment of Mandarin-Speaking Aphasic Patients.基于深度学习的汉语失语症患者语音评估的有效方法。

IEEE J Biomed Health Inform. 2020 Nov;24(11):3191-3202. doi: 10.1109/JBHI.2020.3011104. Epub 2020 Nov 4.

Development and application of a Chinese Version of the Language Screening Test (CLAST) in post-stroke patients.中文版语言筛查测试（CLAST）在中风后患者中的开发与应用。

Medicine (Baltimore). 2020 Sep 11;99(37):e22165. doi: 10.1097/MD.0000000000022165.

Evaluating Fluency in Aphasia: Fluency Scales, Trichotomous Judgements, or Machine Learning.评估失语症中的语言流畅性：流畅性量表、三分法判断还是机器学习。

Aphasiology. 2024;38(1):168-180. doi: 10.1080/02687038.2023.2171261. Epub 2023 Feb 6.

NUVA: A Naming Utterance Verifier for Aphasia Treatment.NUVA：一种用于失语症治疗的命名发声验证器。

Comput Speech Lang. 2021 Sep;69:None. doi: 10.1016/j.csl.2021.101221.

Automatic Assessment of Speech Impairment in Cantonese-speaking People with Aphasia.粤语失语症患者言语障碍的自动评估

IEEE J Sel Top Signal Process. 2020 Feb;14(2):331-345. doi: 10.1109/JSTSP.2019.2956371. Epub 2019 Nov 28.

The unique role of the frontal aslant tract in speech and language processing.额斜束在言语和语言处理中的独特作用。

Neuroimage Clin. 2022;34:103020. doi: 10.1016/j.nicl.2022.103020. Epub 2022 Apr 26.

Cost-effectiveness of speech and language therapy plus scalp acupuncture versus speech and language therapy alone for community-based patients with Broca's aphasia after stroke: a post hoc analysis of data from a randomised controlled trial.言语和语言治疗联合头皮针刺与单纯言语和语言治疗对社区脑卒中后 Broca 失语症患者的成本效益：一项随机对照试验数据的事后分析。

BMJ Open. 2021 Sep 6;11(9):e046609. doi: 10.1136/bmjopen-2020-046609.

Racial-Ethnic Differences in Word Fluency and Auditory Comprehension Among Persons With Poststroke Aphasia.中风后失语症患者在词语流畅性和听觉理解方面的种族-民族差异

Arch Phys Med Rehabil. 2017 Apr;98(4):681-686. doi: 10.1016/j.apmr.2016.10.010. Epub 2016 Nov 10.

引用本文的文献

A computer-aid speech rehabilitation system with mirrored video generating.具有镜像视频生成功能的计算机辅助言语康复系统。

Technol Health Care. 2024;32(S1):543-553. doi: 10.3233/THC-248047.

本文引用的文献

Predicting Severity in People with Aphasia: A Natural Language Processing and Machine Learning Approach.预测失语症患者的严重程度：一种自然语言处理和机器学习方法。

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2299-2302. doi: 10.1109/EMBC46164.2021.9630694.

Machine learning-based multimodal prediction of language outcomes in chronic aphasia.基于机器学习的慢性失语症语言预后的多模态预测。

Hum Brain Mapp. 2021 Apr 15;42(6):1682-1698. doi: 10.1002/hbm.25321. Epub 2020 Dec 30.

An Efficient Deep Learning Based Method for Speech Assessment of Mandarin-Speaking Aphasic Patients.基于深度学习的汉语失语症患者语音评估的有效方法。

IEEE J Biomed Health Inform. 2020 Nov;24(11):3191-3202. doi: 10.1109/JBHI.2020.3011104. Epub 2020 Nov 4.

Automatic Assessment of Speech Impairment in Cantonese-speaking People with Aphasia.粤语失语症患者言语障碍的自动评估

IEEE J Sel Top Signal Process. 2020 Feb;14(2):331-345. doi: 10.1109/JSTSP.2019.2956371. Epub 2019 Nov 28.

The global burden of stroke: persistent and disabling.全球中风负担：持续存在且导致残疾。

Lancet Neurol. 2019 May;18(5):417-418. doi: 10.1016/S1474-4422(19)30030-4. Epub 2019 Mar 11.

Global Burden of Stroke.全球卒中负担。

Circ Res. 2017 Feb 3;120(3):439-448. doi: 10.1161/CIRCRESAHA.116.308413.

Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease.新型语音信号处理算法可实现帕金森病的高精度分类。

IEEE Trans Biomed Eng. 2012 May;59(5):1264-71. doi: 10.1109/TBME.2012.2183367. Epub 2012 Jan 9.

Comparison of machine learning methods for classifying aphasic and non-aphasic speakers.基于机器学习的失语症和非失语症患者分类方法比较。

Comput Methods Programs Biomed. 2011 Dec;104(3):349-57. doi: 10.1016/j.cmpb.2011.02.015. Epub 2011 Apr 14.

A meta-analysis of clinical outcomes in the treatment of aphasia.失语症治疗临床结果的荟萃分析。

J Speech Lang Hear Res. 1998 Feb;41(1):172-87. doi: 10.1044/jslhr.4101.172.

The Aachen Aphasia Test.亚琛失语症测试。

Adv Neurol. 1984;42:291-303.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

失语症评估工具的自动语音识别平台比较研究。

A Comparative Investigation of Automatic Speech Recognition Platforms for Aphasia Assessment Batteries.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献