迈向一种用于自动可懂度评估的临床工具。

Towards A Clinical Tool For Automatic Intelligibility Assessment.

作者信息

Berisha Visar, Utianski Rene, Liss Julie

机构信息

Department of Speech and Hearing Science, Arizona State University.

出版信息

Proc IEEE Int Conf Acoust Speech Signal Process. 2013:2825-2828. doi: 10.1109/ICASSP.2013.6638172.

DOI:10.1109/ICASSP.2013.6638172

PMID:25004985

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4082827/

Abstract

An important, yet under-explored, problem in speech processing is the automatic assessment of intelligibility for pathological speech. In practice, intelligibility assessment is often done through subjective tests administered by speech pathologists; however research has shown that these tests are inconsistent, costly, and exhibit poor reliability. Although some automatic methods for intelligibility assessment for telecommunications exist, research specific to pathological speech has been limited. Here, we propose an algorithm that captures important multi-scale perceptual cues shown to correlate well with intelligibility. Nonlinear classifiers are trained at each time scale and a final intelligibility decision is made using ensemble learning methods from machine learning. Preliminary results indicate a marked improvement in intelligibility assessment over published baseline results.

摘要

语音处理中一个重要但尚未充分探索的问题是对病理性语音可懂度的自动评估。在实践中，可懂度评估通常通过言语病理学家进行的主观测试来完成；然而，研究表明这些测试不一致、成本高且可靠性差。虽然存在一些用于电信语音可懂度评估的自动方法，但针对病理性语音的研究却很有限。在此，我们提出一种算法，该算法捕捉与可懂度密切相关的重要多尺度感知线索。在每个时间尺度上训练非线性分类器，并使用机器学习中的集成学习方法做出最终的可懂度决策。初步结果表明，与已发表的基线结果相比，可懂度评估有显著改善。

相似文献

Towards A Clinical Tool For Automatic Intelligibility Assessment.

Proc IEEE Int Conf Acoust Speech Signal Process. 2013:2825-2828. doi: 10.1109/ICASSP.2013.6638172.

Modeling Pathological Speech Perception From Data With Similarity Labels.

Proc IEEE Int Conf Acoust Speech Signal Process. 2014 May;2014:915-919. doi: 10.1109/ICASSP.2014.6853730.

Speech technology-based assessment of phoneme intelligibility in dysarthria.

Int J Lang Commun Disord. 2009 Sep-Oct;44(5):716-30. doi: 10.1080/13682820802342062.

Validation and cross-linguistic adaptation of the Frenchay Dysarthria Assessment (FDA-2) speech intelligibility tests: Hebrew version.

Int J Lang Commun Disord. 2022 Sep;57(5):1023-1049. doi: 10.1111/1460-6984.12737. Epub 2022 Jun 17.

Intelligibility assessment of cleft lip and palate speech using Gaussian posteriograms based on joint spectro-temporal features.

J Acoust Soc Am. 2018 Oct;144(4):2413. doi: 10.1121/1.5064463.

Language-independent automatic evaluation of intelligibility of chronically hoarse persons.

Folia Phoniatr Logop. 2014;66(6):219-26. doi: 10.1159/000365969. Epub 2015 Jan 31.

Automatic intelligibility classification of sentence-level pathological speech.

Comput Speech Lang. 2015 Jan;29(1):132-144. doi: 10.1016/j.csl.2014.02.001.

A serious game for speech training in dysarthric speakers with Parkinson's disease: Exploring therapeutic efficacy and patient satisfaction.

Int J Lang Commun Disord. 2022 Jul;57(4):808-821. doi: 10.1111/1460-6984.12722. Epub 2022 Mar 26.

Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.

IEEE Trans Biomed Eng. 2017 Nov;64(11):2584-2594. doi: 10.1109/TBME.2016.2644258.

Intelligibility of laryngectomees' substitute speech: automatic speech recognition and subjective rating.

Eur Arch Otorhinolaryngol. 2006 Feb;263(2):188-93. doi: 10.1007/s00405-005-0974-6. Epub 2005 Jul 7.

引用本文的文献

Feature engineering and machine learning for computer-assisted screening of children with speech disorders.

PLOS Digit Health. 2022 May 26;1(5):e0000041. doi: 10.1371/journal.pdig.0000041. eCollection 2022 May.

Intelligibility in Context Scale: Sensitivity and specificity in the Jamaican context.

Clin Linguist Phon. 2021 Feb 1;35(2):154-171. doi: 10.1080/02699206.2020.1766574. Epub 2020 May 28.

Toward clinical application of landmark-based speech analysis: Landmark expression in normal adult speech.

J Acoust Soc Am. 2017 Nov;142(5):EL441. doi: 10.1121/1.5009687.

Predicting Intelligibility Gains in Dysarthria Through Automated Speech Feature Analysis.

J Speech Lang Hear Res. 2017 Nov 9;60(11):3058-3068. doi: 10.1044/2017_JSLHR-S-16-0453.

本文引用的文献

Perceptual learning of dysarthric speech: a review of experimental studies.

J Speech Lang Hear Res. 2012 Feb;55(1):290-305. doi: 10.1044/1092-4388(2011/10-0349). Epub 2011 Dec 22.

Discriminating dysarthria type from envelope modulation spectra.

J Speech Lang Hear Res. 2010 Oct;53(5):1246-55. doi: 10.1044/1092-4388(2010/09-0121). Epub 2010 Jul 19.

Speech and language therapy for dysarthria due to non-progressive brain damage.

Cochrane Database Syst Rev. 2005 Jul 20(3):CD002088. doi: 10.1002/14651858.CD002088.pub2.

The effects of familiarization on intelligibility and lexical segmentation in hypokinetic and ataxic dysarthria.

J Acoust Soc Am. 2002 Dec;112(6):3022-30. doi: 10.1121/1.1515793.

Intelligibility as a linear combination of dimensions in dysarthric speech.

J Commun Disord. 2002 May-Jun;35(3):283-92. doi: 10.1016/s0021-9924(02)00065-5.

Dysarthric speech: a comparison of computerized speech recognition and listener intelligibility.

J Rehabil Res Dev. 1997 Jul;34(3):309-16.

Treatment efficacy: dysarthria.

J Speech Hear Res. 1996 Oct;39(5):S46-57. doi: 10.1044/jshr.3905.s46.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

迈向一种用于自动可懂度评估的临床工具。

Towards A Clinical Tool For Automatic Intelligibility Assessment.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献