发育性运动言语障碍中单词和句子层面言语可懂度的自动评估：一项跨语言研究。

Automated Assessment of Word- and Sentence-Level Speech Intelligibility in Developmental Motor Speech Disorders: A Cross-Linguistic Investigation.

作者信息

Carl Micalle, Icht Michal

机构信息

Department of Communication Disorders, Ariel University, Ariel 40700, Israel.

出版信息

Diagnostics (Basel). 2025 Jul 28;15(15):1892. doi: 10.3390/diagnostics15151892.

DOI:10.3390/diagnostics15151892

PMID:40804857

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12345943/

Abstract

: Accurate assessment of speech intelligibility is necessary for individuals with motor speech disorders. Transcription or scaled rating methods by naïve listeners are the most reliable tasks for these purposes; however, they are often resource-intensive and time-consuming within clinical contexts. Automatic speech recognition (ASR) systems, which transcribe speech into text, have been increasingly utilized for assessing speech intelligibility. This study investigates the feasibility of using an open-source ASR system to assess speech intelligibility in Hebrew and English speakers with Down syndrome (DS). : Recordings from 65 Hebrew- and English-speaking participants were included: 33 speakers with DS and 32 typically developing (TD) peers. Speech samples (words, sentences) were transcribed using Whisper (OpenAI) and by naïve listeners. The proportion of agreement between ASR transcriptions and those of naïve listeners was compared across speaker groups (TD, DS) and languages (Hebrew, English) for word-level data. Further comparisons for Hebrew speakers were conducted across speaker groups and stimuli (words, sentences). : The strength of the correlation between listener and ASR transcription scores varied across languages, and was higher for English ( = 0.98) than for Hebrew ( = 0.81) for speakers with DS. A higher proportion of listener-ASR agreement was demonstrated for TD speakers, as compared to those with DS (0.94 vs. 0.74, respectively), and for English, in comparison to Hebrew speakers (0.91 for English DS speakers vs. 0.74 for Hebrew DS speakers). Listener-ASR agreement for single words was consistently higher than for sentences among Hebrew speakers. Speakers' intelligibility influenced word-level agreement among Hebrew- but not English-speaking participants with DS. : ASR performance for English closely approximated that of naïve listeners, suggesting potential near-future clinical applicability within single-word intelligibility assessment. In contrast, a lower proportion of agreement between human listeners and ASR for Hebrew speech indicates that broader clinical implementation may require further training of ASR models in this language.

摘要

对于患有运动性言语障碍的个体而言，准确评估言语可懂度是必要的。由未经专业训练的听众进行转录或采用量表评分方法是实现这些目的最可靠的任务；然而，在临床环境中，它们往往资源消耗大且耗时。自动语音识别（ASR）系统可将语音转录为文本，已越来越多地用于评估言语可懂度。本研究调查了使用开源ASR系统评估患有唐氏综合征（DS）的希伯来语和英语使用者言语可懂度的可行性。

纳入了65名讲希伯来语和英语参与者的录音：33名患有DS的说话者和32名发育正常（TD）的同龄人。使用Whisper（OpenAI）和未经专业训练的听众对语音样本（单词、句子）进行转录。针对单词级数据，比较了ASR转录与未经专业训练的听众转录之间的一致比例，涉及说话者群体（TD、DS）和语言（希伯来语、英语）。针对希伯来语使用者，还在说话者群体和刺激类型（单词、句子）之间进行了进一步比较。

听众与ASR转录分数之间的相关强度因语言而异，对于患有DS的说话者，英语的相关性更高（=0.98），高于希伯来语（=0.81）。与患有DS的说话者相比，TD说话者的听众 - ASR一致性比例更高（分别为0.94对0.74），与希伯来语使用者相比，英语使用者的比例更高（英语DS使用者为0.91，希伯来语DS使用者为0.74）。在希伯来语使用者中，单个单词的听众 - ASR一致性始终高于句子。患有DS的希伯来语使用者的可懂度影响单词级一致性，但英语使用者并非如此。

英语的ASR表现与未经专业训练的听众的表现非常接近，表明在单词可懂度评估方面近期可能具有临床适用性。相比之下，希伯来语语音的人类听众与ASR之间的一致性比例较低，这表明更广泛的临床应用可能需要对该语言的ASR模型进行进一步训练。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/8612213fd799/diagnostics-15-01892-g001.jpg

相似文献

Automated Assessment of Word- and Sentence-Level Speech Intelligibility in Developmental Motor Speech Disorders: A Cross-Linguistic Investigation.

Diagnostics (Basel). 2025 Jul 28;15(15):1892. doi: 10.3390/diagnostics15151892.

The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.

Prescription of Controlled Substances: Benefits and Risks

The development of a novel, standardized, norm-referenced Arabic Discourse Assessment Tool (ADAT), including an examination of psychometric properties of discourse measures in aphasia.

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2103-2117. doi: 10.1111/1460-6984.13083. Epub 2024 Jun 18.

Factors affecting judgment accuracy when scoring children's responses to non-word repetition stimuli in real time.

Int J Lang Commun Disord. 2024 Mar-Apr;59(2):678-697. doi: 10.1111/1460-6984.12954. Epub 2023 Oct 9.

Seeing a Talker's Mouth Reduces the Effort of Perceiving Speech and Repairing Perceptual Mistakes for Listeners With Cochlear Implants.

Ear Hear. 2025 Jun 16. doi: 10.1097/AUD.0000000000001683.

Prosodic skills in Spanish-speaking adolescents and young adults with Down syndrome.

Int J Lang Commun Disord. 2024 Jul-Aug;59(4):1284-1295. doi: 10.1111/1460-6984.13001. Epub 2023 Dec 28.

Non-speech oral motor treatment for children with developmental speech sound disorders.

Cochrane Database Syst Rev. 2015 Mar 25;2015(3):CD009383. doi: 10.1002/14651858.CD009383.pub2.

A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.

Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.

A scoping review of transcription-less practices for analysis of aphasic discourse and implications for future research.

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1734-1762. doi: 10.1111/1460-6984.13028. Epub 2024 Mar 23.

本文引用的文献

Perceptual and acoustic predictors of speech intelligibility among Hebrew-speaking young adults with down syndrome.

J Commun Disord. 2025 May-Jun;115:106529. doi: 10.1016/j.jcomdis.2025.106529. Epub 2025 Apr 21.

Can We Trust Our Ears? How Accurate and Reliable Are Speech-Language Pathologists' Estimates of Children's Speech Intelligibility?

Am J Speech Lang Pathol. 2025 Mar 10;34(2):853-867. doi: 10.1044/2024_AJSLP-24-00247. Epub 2025 Feb 24.

Speech Technology for Automatic Recognition and Assessment of Dysarthric Speech: An Overview.

J Speech Lang Hear Res. 2025 Feb 4;68(2):547-577. doi: 10.1044/2024_JSLHR-23-00740. Epub 2025 Jan 15.

Decoding disparities: evaluating automatic speech recognition system performance in transcribing Black and White patient verbal communication with nurses in home healthcare.

JAMIA Open. 2024 Dec 10;7(4):ooae130. doi: 10.1093/jamiaopen/ooae130. eCollection 2024 Dec.

Automatic Speech Recognition in Primary Progressive Apraxia of Speech.

J Speech Lang Hear Res. 2024 Sep 12;67(9):2964-2976. doi: 10.1044/2024_JSLHR-24-00049. Epub 2024 Aug 6.

Accuracy of Speech Sound Analysis: Comparison of an Automatic Artificial Intelligence Algorithm With Clinician Assessment.

J Speech Lang Hear Res. 2024 Sep 12;67(9):3004-3021. doi: 10.1044/2024_JSLHR-24-00009. Epub 2024 Aug 22.

An automatic measure for speech intelligibility in dysarthrias-validation across multiple languages and neurological disorders.

Front Digit Health. 2024 Jul 23;6:1440986. doi: 10.3389/fdgth.2024.1440986. eCollection 2024.

How People Living With Amyotrophic Lateral Sclerosis Use Personalized Automatic Speech Recognition Technology to Support Communication.

J Speech Lang Hear Res. 2024 Nov 7;67(11):4186-4202. doi: 10.1044/2024_JSLHR-24-00097. Epub 2024 Jul 11.

Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech.

J Speech Lang Hear Res. 2024 Nov 7;67(11):4176-4185. doi: 10.1044/2024_JSLHR-24-00045. Epub 2024 Jul 4.

Evaluating OpenAI's Whisper ASR: Performance analysis across diverse accents and speaker traits.

JASA Express Lett. 2024 Feb 1;4(2). doi: 10.1121/10.0024876.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

发育性运动言语障碍中单词和句子层面言语可懂度的自动评估：一项跨语言研究。

Automated Assessment of Word- and Sentence-Level Speech Intelligibility in Developmental Motor Speech Disorders: A Cross-Linguistic Investigation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献