• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

发育性运动言语障碍中单词和句子层面言语可懂度的自动评估:一项跨语言研究。

Automated Assessment of Word- and Sentence-Level Speech Intelligibility in Developmental Motor Speech Disorders: A Cross-Linguistic Investigation.

作者信息

Carl Micalle, Icht Michal

机构信息

Department of Communication Disorders, Ariel University, Ariel 40700, Israel.

出版信息

Diagnostics (Basel). 2025 Jul 28;15(15):1892. doi: 10.3390/diagnostics15151892.

DOI:10.3390/diagnostics15151892
PMID:40804857
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12345943/
Abstract

: Accurate assessment of speech intelligibility is necessary for individuals with motor speech disorders. Transcription or scaled rating methods by naïve listeners are the most reliable tasks for these purposes; however, they are often resource-intensive and time-consuming within clinical contexts. Automatic speech recognition (ASR) systems, which transcribe speech into text, have been increasingly utilized for assessing speech intelligibility. This study investigates the feasibility of using an open-source ASR system to assess speech intelligibility in Hebrew and English speakers with Down syndrome (DS). : Recordings from 65 Hebrew- and English-speaking participants were included: 33 speakers with DS and 32 typically developing (TD) peers. Speech samples (words, sentences) were transcribed using Whisper (OpenAI) and by naïve listeners. The proportion of agreement between ASR transcriptions and those of naïve listeners was compared across speaker groups (TD, DS) and languages (Hebrew, English) for word-level data. Further comparisons for Hebrew speakers were conducted across speaker groups and stimuli (words, sentences). : The strength of the correlation between listener and ASR transcription scores varied across languages, and was higher for English ( = 0.98) than for Hebrew ( = 0.81) for speakers with DS. A higher proportion of listener-ASR agreement was demonstrated for TD speakers, as compared to those with DS (0.94 vs. 0.74, respectively), and for English, in comparison to Hebrew speakers (0.91 for English DS speakers vs. 0.74 for Hebrew DS speakers). Listener-ASR agreement for single words was consistently higher than for sentences among Hebrew speakers. Speakers' intelligibility influenced word-level agreement among Hebrew- but not English-speaking participants with DS. : ASR performance for English closely approximated that of naïve listeners, suggesting potential near-future clinical applicability within single-word intelligibility assessment. In contrast, a lower proportion of agreement between human listeners and ASR for Hebrew speech indicates that broader clinical implementation may require further training of ASR models in this language.

摘要

对于患有运动性言语障碍的个体而言,准确评估言语可懂度是必要的。由未经专业训练的听众进行转录或采用量表评分方法是实现这些目的最可靠的任务;然而,在临床环境中,它们往往资源消耗大且耗时。自动语音识别(ASR)系统可将语音转录为文本,已越来越多地用于评估言语可懂度。本研究调查了使用开源ASR系统评估患有唐氏综合征(DS)的希伯来语和英语使用者言语可懂度的可行性。

纳入了65名讲希伯来语和英语参与者的录音:33名患有DS的说话者和32名发育正常(TD)的同龄人。使用Whisper(OpenAI)和未经专业训练的听众对语音样本(单词、句子)进行转录。针对单词级数据,比较了ASR转录与未经专业训练的听众转录之间的一致比例,涉及说话者群体(TD、DS)和语言(希伯来语、英语)。针对希伯来语使用者,还在说话者群体和刺激类型(单词、句子)之间进行了进一步比较。

听众与ASR转录分数之间的相关强度因语言而异,对于患有DS的说话者,英语的相关性更高(=0.98),高于希伯来语(=0.81)。与患有DS的说话者相比,TD说话者的听众 - ASR一致性比例更高(分别为0.94对0.74),与希伯来语使用者相比,英语使用者的比例更高(英语DS使用者为0.91,希伯来语DS使用者为0.74)。在希伯来语使用者中,单个单词的听众 - ASR一致性始终高于句子。患有DS的希伯来语使用者的可懂度影响单词级一致性,但英语使用者并非如此。

英语的ASR表现与未经专业训练的听众的表现非常接近,表明在单词可懂度评估方面近期可能具有临床适用性。相比之下,希伯来语语音的人类听众与ASR之间的一致性比例较低,这表明更广泛的临床应用可能需要对该语言的ASR模型进行进一步训练。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/aa3d487c137f/diagnostics-15-01892-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/8612213fd799/diagnostics-15-01892-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/b573229a4cb3/diagnostics-15-01892-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/aa3d487c137f/diagnostics-15-01892-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/8612213fd799/diagnostics-15-01892-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/b573229a4cb3/diagnostics-15-01892-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca1c/12345943/aa3d487c137f/diagnostics-15-01892-g003.jpg

相似文献

1
Automated Assessment of Word- and Sentence-Level Speech Intelligibility in Developmental Motor Speech Disorders: A Cross-Linguistic Investigation.发育性运动言语障碍中单词和句子层面言语可懂度的自动评估:一项跨语言研究。
Diagnostics (Basel). 2025 Jul 28;15(15):1892. doi: 10.3390/diagnostics15151892.
2
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
The development of a novel, standardized, norm-referenced Arabic Discourse Assessment Tool (ADAT), including an examination of psychometric properties of discourse measures in aphasia.开发一种新型、标准化、基于常模的阿拉伯语语篇评估工具(ADAT),包括评估失语症患者语篇测量的心理测量特性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2103-2117. doi: 10.1111/1460-6984.13083. Epub 2024 Jun 18.
5
Factors affecting judgment accuracy when scoring children's responses to non-word repetition stimuli in real time.实时评分儿童对非词重复刺激反应时影响判断准确性的因素。
Int J Lang Commun Disord. 2024 Mar-Apr;59(2):678-697. doi: 10.1111/1460-6984.12954. Epub 2023 Oct 9.
6
Seeing a Talker's Mouth Reduces the Effort of Perceiving Speech and Repairing Perceptual Mistakes for Listeners With Cochlear Implants.看到说话者的嘴部动作可减轻人工耳蜗佩戴者感知语音和纠正感知错误的难度。
Ear Hear. 2025 Jun 16. doi: 10.1097/AUD.0000000000001683.
7
Prosodic skills in Spanish-speaking adolescents and young adults with Down syndrome.西班牙语为母语的唐氏综合征青少年和成年人的韵律技能。
Int J Lang Commun Disord. 2024 Jul-Aug;59(4):1284-1295. doi: 10.1111/1460-6984.13001. Epub 2023 Dec 28.
8
Non-speech oral motor treatment for children with developmental speech sound disorders.针对发育性语音障碍儿童的非言语口腔运动治疗。
Cochrane Database Syst Rev. 2015 Mar 25;2015(3):CD009383. doi: 10.1002/14651858.CD009383.pub2.
9
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.
10
A scoping review of transcription-less practices for analysis of aphasic discourse and implications for future research.无转录语料分析的范围综述及对未来研究的启示
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1734-1762. doi: 10.1111/1460-6984.13028. Epub 2024 Mar 23.

本文引用的文献

1
Perceptual and acoustic predictors of speech intelligibility among Hebrew-speaking young adults with down syndrome.
J Commun Disord. 2025 May-Jun;115:106529. doi: 10.1016/j.jcomdis.2025.106529. Epub 2025 Apr 21.
2
Can We Trust Our Ears? How Accurate and Reliable Are Speech-Language Pathologists' Estimates of Children's Speech Intelligibility?我们能相信自己的耳朵吗?言语语言病理学家对儿童言语清晰度的评估有多准确和可靠?
Am J Speech Lang Pathol. 2025 Mar 10;34(2):853-867. doi: 10.1044/2024_AJSLP-24-00247. Epub 2025 Feb 24.
3
Speech Technology for Automatic Recognition and Assessment of Dysarthric Speech: An Overview.
J Speech Lang Hear Res. 2025 Feb 4;68(2):547-577. doi: 10.1044/2024_JSLHR-23-00740. Epub 2025 Jan 15.
4
Decoding disparities: evaluating automatic speech recognition system performance in transcribing Black and White patient verbal communication with nurses in home healthcare.解码差异:评估自动语音识别系统在转录家庭医疗保健中黑人和白人患者与护士的言语交流方面的性能。
JAMIA Open. 2024 Dec 10;7(4):ooae130. doi: 10.1093/jamiaopen/ooae130. eCollection 2024 Dec.
5
Automatic Speech Recognition in Primary Progressive Apraxia of Speech.原发性进行性运动性构音障碍的自动语音识别。
J Speech Lang Hear Res. 2024 Sep 12;67(9):2964-2976. doi: 10.1044/2024_JSLHR-24-00049. Epub 2024 Aug 6.
6
Accuracy of Speech Sound Analysis: Comparison of an Automatic Artificial Intelligence Algorithm With Clinician Assessment.语音分析的准确性:自动人工智能算法与临床医生评估的比较。
J Speech Lang Hear Res. 2024 Sep 12;67(9):3004-3021. doi: 10.1044/2024_JSLHR-24-00009. Epub 2024 Aug 22.
7
An automatic measure for speech intelligibility in dysarthrias-validation across multiple languages and neurological disorders.一种用于构音障碍中言语可懂度的自动测量方法——跨多种语言和神经系统疾病的验证
Front Digit Health. 2024 Jul 23;6:1440986. doi: 10.3389/fdgth.2024.1440986. eCollection 2024.
8
How People Living With Amyotrophic Lateral Sclerosis Use Personalized Automatic Speech Recognition Technology to Support Communication.肌萎缩侧索硬化症患者如何使用个性化自动语音识别技术来支持交流。
J Speech Lang Hear Res. 2024 Nov 7;67(11):4186-4202. doi: 10.1044/2024_JSLHR-24-00097. Epub 2024 Jul 11.
9
Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech.口语障碍者会话语音的自动语音识别。
J Speech Lang Hear Res. 2024 Nov 7;67(11):4176-4185. doi: 10.1044/2024_JSLHR-24-00045. Epub 2024 Jul 4.
10
Evaluating OpenAI's Whisper ASR: Performance analysis across diverse accents and speaker traits.评估OpenAI的Whisper自动语音识别技术:不同口音和说话者特征下的性能分析。
JASA Express Lett. 2024 Feb 1;4(2). doi: 10.1121/10.0024876.