• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习在远程康复语音治疗场景中的应用。

Deep learning applications in telerehabilitation speech therapy scenarios.

机构信息

MIFT Department, University Of Messina, Italy; Campus Bio Medico, University Of Rome, Italy.

Speech Language Pathologist, Messina, Italy.

出版信息

Comput Biol Med. 2022 Sep;148:105864. doi: 10.1016/j.compbiomed.2022.105864. Epub 2022 Jul 12.

DOI:10.1016/j.compbiomed.2022.105864
PMID:35853398
Abstract

Nowadays, many application scenarios benefit from automatic speech recognition (ASR) technology. Within the field of speech therapy, in some cases ASR is exploited in the treatment of dysarthria with the aim of supporting articulation output. However, in presence of atypical speech, standard ASR approaches do not provide any reliable result in terms of voice recognition due to main issues, including: (i) the extreme intra and inter-speakers variability of the speech in presence of speech impairments, such as dysarthria; (ii) the absence of dedicated corpora containing voice samples from users with a speech disability to train a state-of-the-art speech model, particularly in non-English languages. In this paper, we focus on isolated word recognition for native Italian speakers with dysarthria and we exploit an existing mobile app to collect audio data from users with speech disorders while they perform articulation exercises for speech therapy purposes. With this data availability, a convolutional neural network has been trained to spot a small number of keywords within atypical speech, according to a speaker dependent method. Finally, we discuss the benefits of the trained ASR system in tailored telerehabilitation contexts intended for patients with dysarthria who can follow treatment plans under the supervision of remote speech language pathologists.

摘要

如今,许多应用场景都受益于自动语音识别(ASR)技术。在言语治疗领域,在某些情况下,ASR 被用于治疗构音障碍,以支持发音输出。然而,在存在非典型语音的情况下,由于主要问题,标准的 ASR 方法在语音识别方面无法提供任何可靠的结果,这些问题包括:(i)在存在语音障碍(如构音障碍)的情况下,语音的极端内和跨说话者变异性;(ii)缺乏包含来自言语障碍用户的语音样本的专用语料库来训练最先进的语音模型,特别是在非英语语言中。在本文中,我们专注于母语为意大利语的构音障碍患者的孤立单词识别,并利用现有的移动应用程序从言语障碍患者那里收集音频数据,当他们为言语治疗目的进行发音练习时。有了这些数据的可用性,我们根据特定说话者的方法,使用卷积神经网络来识别非典型语音中的少量关键字。最后,我们讨论了经过训练的 ASR 系统在专门为构音障碍患者设计的远程康复环境中的优势,这些患者可以在远程言语语言病理学家的监督下遵循治疗计划。

相似文献

1
Deep learning applications in telerehabilitation speech therapy scenarios.深度学习在远程康复语音治疗场景中的应用。
Comput Biol Med. 2022 Sep;148:105864. doi: 10.1016/j.compbiomed.2022.105864. Epub 2022 Jul 12.
2
Dysarthric Speech Transformer: A Sequence-to-Sequence Dysarthric Speech Recognition System.构音障碍语音转换器:一种序列到序列的构音障碍语音识别系统。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:3407-3416. doi: 10.1109/TNSRE.2023.3307020. Epub 2023 Aug 29.
3
Optimising Speaker-Dependent Feature Extraction Parameters to Improve Automatic Speech Recognition Performance for People with Dysarthria.优化说话人相关特征提取参数,以提高构音障碍患者的自动语音识别性能。
Sensors (Basel). 2021 Sep 27;21(19):6460. doi: 10.3390/s21196460.
4
Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System.言语视觉:基于端到端深度学习的构音障碍自动语音识别系统。
IEEE Trans Neural Syst Rehabil Eng. 2021;29:852-861. doi: 10.1109/TNSRE.2021.3076778. Epub 2021 May 7.
5
The use of speech recognition technology by people living with amyotrophic lateral sclerosis: a scoping review.肌萎缩侧索硬化症患者使用语音识别技术:范围综述。
Disabil Rehabil Assist Technol. 2023 Oct;18(7):1043-1055. doi: 10.1080/17483107.2021.1974961. Epub 2021 Sep 11.
6
Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech.基于机器学习的方言阿萨姆语语音自动识别样本提取。
Neural Netw. 2016 Jun;78:97-111. doi: 10.1016/j.neunet.2015.12.010. Epub 2015 Dec 30.
7
An Internet-based telerehabilitation system for the assessment of motor speech disorders: a pilot study.一种用于评估运动性言语障碍的基于互联网的远程康复系统:一项试点研究。
Am J Speech Lang Pathol. 2006 Feb;15(1):45-56. doi: 10.1044/1058-0360(2006/006).
8
A speech-controlled environmental control system for people with severe dysarthria.一种用于严重构音障碍患者的语音控制环境控制系统。
Med Eng Phys. 2007 Jun;29(5):586-93. doi: 10.1016/j.medengphy.2006.06.009. Epub 2006 Oct 17.
9
Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders.自动语音识别(ASR)及其作为评估或治疗嗓音、言语和语言障碍的工具的应用。
Logoped Phoniatr Vocol. 2009;34(2):91-6. doi: 10.1080/14015430802657216.
10
Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech.用于构音障碍语音的自动语音识别平台评估
Folia Phoniatr Logop. 2021;73(5):432-441. doi: 10.1159/000511042. Epub 2020 Nov 13.

引用本文的文献

1
Assistive Technologies for Individuals with a Disability from a Neurological Condition: A Narrative Review on the Multimodal Integration.针对患有神经系统疾病的残疾人的辅助技术:关于多模态整合的叙述性综述
Healthcare (Basel). 2025 Jul 1;13(13):1580. doi: 10.3390/healthcare13131580.
2
Co-designing the integration of voice-based conversational AI and web augmentation to amplify web inclusivity.共同设计基于语音的对话式人工智能与网页增强的整合,以增强网页的包容性。
Sci Rep. 2024 Jul 13;14(1):16162. doi: 10.1038/s41598-024-66725-3.
3
AFM signal model for dysarthric speech classification using speech biomarkers.
基于语音生物标志物的构音障碍语音分类的原子力显微镜信号模型。
Front Hum Neurosci. 2024 Feb 20;18:1346297. doi: 10.3389/fnhum.2024.1346297. eCollection 2024.
4
Models and Approaches for Comprehension of Dysarthric Speech Using Natural Language Processing: Systematic Review.使用自然语言处理理解构音障碍语音的模型与方法:系统综述
JMIR Rehabil Assist Technol. 2023 Oct 27;10:e44489. doi: 10.2196/44489.