Suppr超能文献

不同时频表示在构音障碍语音可懂度评估中的研究。

Investigation of Different Time-Frequency Representations for Intelligibility Assessment of Dysarthric Speech.

出版信息

IEEE Trans Neural Syst Rehabil Eng. 2020 Dec;28(12):2880-2889. doi: 10.1109/TNSRE.2020.3035392. Epub 2021 Jan 28.

Abstract

Speech disorders linked to neurological problems affect person's ability to communicate through speech. Dysarthria is one of the speech disorders caused due to muscle weakness producing slow, slurred and less intelligible speech. Automatic intelligibility assessment of dysarthria from speech can be used as a promising clinical tool in treatment. This paper explores the use of perceptually enhanced Fourier transform spectrograms and Constant-Q transform spectrograms with CNN to assess word level and sentence level intelligibility of dysarthric speech from UA and TORGO databases. Constant-Q transform and perceptually enhanced mel warped STFT spectrograms performed better in the classification task.

摘要

与神经问题相关的言语障碍会影响一个人通过言语进行交流的能力。构音障碍是由于肌肉无力导致的言语缓慢、含糊不清且清晰度降低的一种言语障碍。对构音障碍语音的自动可懂度评估可以作为一种很有前途的临床治疗工具。本文探索了使用基于感知的增强傅里叶变换声谱图和恒 Q 变换声谱图与 CNN 相结合,从 UA 和 TORGO 数据库评估构音障碍语音的单词水平和句子水平的可懂度。恒 Q 变换和基于感知的增强梅尔扭曲 STFT 声谱图在分类任务中表现更好。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验