• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于注意力的深度学习人工智能模型对肌萎缩侧索硬化症中的构音障碍进行临床评估和解读。

Clinical assessment and interpretation of dysarthria in ALS using attention based deep learning AI models.

作者信息

Merler Michele, Agurto Carla, Peller Julian, Roitberg Esteban, Taitz Alan, Trevisan Marcos A, Navar Indu, Berry James D, Fraenkel Ernest, Ostrow Lyle W, Cecchi Guillermo A, Norel Raquel

机构信息

IBM Research, Yorktown Heights, NY, USA.

EverythingALS, Peter Cohen Foundation, Los Altos, CA, USA.

出版信息

NPJ Digit Med. 2025 May 8;8(1):260. doi: 10.1038/s41746-025-01654-7.

DOI:10.1038/s41746-025-01654-7
PMID:40341287
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12062206/
Abstract

Speech dysarthria is a key symptom of neurological conditions like ALS, yet existing AI models designed to analyze it from audio signal rely on handcrafted features with limited inference performance. Deep learning approaches improve accuracy but lack interpretability. We propose an attention-based deep learning AI model to assess dysarthria severity based on listener effort ratings. Using 2,102 recordings from 125 participants, rated by three speech-language pathologists on a 100-point scale, we trained models directly from recordings collected remotely. Our best model achieved R of 0.92 and RMSE of 6.78. Attention-based interpretability identified key phonemes, such as vowel sounds influenced by 'r' (e.g., "car," "more"), and isolated inspiration sounds as markers of speech deterioration. This model enhances precision in dysarthria assessment while maintaining clinical interpretability. By improving sensitivity to subtle speech changes, it offers a valuable tool for research and patient care in ALS and other neurological disorders.

摘要

言语构音障碍是肌萎缩侧索硬化症(ALS)等神经系统疾病的关键症状,但现有的旨在从音频信号分析该症状的人工智能模型依赖于手工制作的特征,推理性能有限。深度学习方法提高了准确性,但缺乏可解释性。我们提出了一种基于注意力的深度学习人工智能模型,以根据听众的努力程度评分来评估构音障碍的严重程度。我们使用了125名参与者的2102份录音,由三名言语语言病理学家以100分制进行评分,我们直接从远程收集的录音中训练模型。我们最好的模型的R值为0.92,均方根误差为6.78。基于注意力的可解释性确定了关键音素,如受“r”影响的元音(如“car”、“more”),以及孤立的吸气音作为言语恶化的标志。该模型提高了构音障碍评估的精度,同时保持了临床可解释性。通过提高对细微语音变化的敏感性,它为ALS和其他神经系统疾病的研究及患者护理提供了一个有价值的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/6ebe7e277927/41746_2025_1654_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/d2b7005a2a21/41746_2025_1654_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/5aaa202418b3/41746_2025_1654_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/e635063242b4/41746_2025_1654_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/6ebe7e277927/41746_2025_1654_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/d2b7005a2a21/41746_2025_1654_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/5aaa202418b3/41746_2025_1654_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/e635063242b4/41746_2025_1654_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c3/12062206/6ebe7e277927/41746_2025_1654_Fig4_HTML.jpg

相似文献

1
Clinical assessment and interpretation of dysarthria in ALS using attention based deep learning AI models.使用基于注意力的深度学习人工智能模型对肌萎缩侧索硬化症中的构音障碍进行临床评估和解读。
NPJ Digit Med. 2025 May 8;8(1):260. doi: 10.1038/s41746-025-01654-7.
2
Listener effort quantifies clinically meaningful progression of dysarthria in people living with amyotrophic lateral sclerosis.倾听努力程度量化了肌萎缩侧索硬化症患者构音障碍具有临床意义的进展情况。
medRxiv. 2024 Jun 1:2024.05.31.24308140. doi: 10.1101/2024.05.31.24308140.
3
Subclinical articulatory changes of vowel parameters in Korean amyotrophic lateral sclerosis patients with perceptually normal voices.韩国肌萎缩侧索硬化症患者在可感知的正常嗓音中,其元音参数的亚临床发音变化。
PLoS One. 2023 Oct 13;18(10):e0292460. doi: 10.1371/journal.pone.0292460. eCollection 2023.
4
An exploratory study of student, speech-language pathologist and emergency worker impressions of speakers with dysarthria.一项关于学生、言语语言病理学家和急救人员对构音障碍患者印象的探索性研究。
Int J Speech Lang Pathol. 2021 Jun;23(3):265-274. doi: 10.1080/17549507.2020.1768286. Epub 2020 Jul 14.
5
Sensitivity and specificity of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised to detect dysarthria in individuals with amyotrophic lateral sclerosis.肌萎缩侧索硬化功能评定量表修订版检测肌萎缩侧索硬化患者构音障碍的敏感性和特异性。
Muscle Nerve. 2023 Sep;68(3):296-302. doi: 10.1002/mus.27923. Epub 2023 Jun 21.
6
Automatic Assessment of Intelligibility in Noise in Parkinson Disease: Validation Study.帕金森病噪声环境下言语可懂度的自动评估:验证研究。
J Med Internet Res. 2022 Oct 20;24(10):e40567. doi: 10.2196/40567.
7
Digital markers of motor speech impairments in spontaneous speech of patients with ALS-FTD spectrum disorders.ALS-FTD 谱系障碍患者自发言语中运动言语障碍的数字标志物。
Amyotroph Lateral Scler Frontotemporal Degener. 2024 May;25(3-4):317-325. doi: 10.1080/21678421.2023.2288106. Epub 2023 Dec 5.
8
Acoustic vowel analysis and speech intelligibility in young adult Hebrew speakers: Developmental dysarthria versus typical development.青年希伯来语说话者的声学元音分析和言语可懂度:发育性构音障碍与典型发育。
Int J Lang Commun Disord. 2021 Mar;56(2):283-298. doi: 10.1111/1460-6984.12598. Epub 2021 Feb 1.
9
Digital markers of motor speech impairments in natural speech of patients with ALS-FTD spectrum disorders.肌萎缩侧索硬化-额颞叶痴呆谱系障碍患者自然言语中运动性言语障碍的数字标志物。
medRxiv. 2023 May 1:2023.04.29.23289308. doi: 10.1101/2023.04.29.23289308.
10
Deep-learning models reveal how context and listener attention shape electrophysiological correlates of speech-to-language transformation.深度学习模型揭示了语境和听众注意力如何塑造言语到语言转换的电生理相关性。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012537. doi: 10.1371/journal.pcbi.1012537. eCollection 2024 Nov.

引用本文的文献

1
Acoustic signatures of bulbar ALS: Predictive modeling with sustained vowels and LightGBM.延髓性肌萎缩侧索硬化症的声学特征:基于持续元音和LightGBM的预测建模
eNeurologicalSci. 2025 Jul 30;40:100579. doi: 10.1016/j.ensci.2025.100579. eCollection 2025 Sep.

本文引用的文献

1
An automatic measure for speech intelligibility in dysarthrias-validation across multiple languages and neurological disorders.一种用于构音障碍中言语可懂度的自动测量方法——跨多种语言和神经系统疾病的验证
Front Digit Health. 2024 Jul 23;6:1440986. doi: 10.3389/fdgth.2024.1440986. eCollection 2024.
2
Voice signals database of ALS patients with different dysarthria severity and healthy controls.肌萎缩侧索硬化症患者不同构音障碍严重程度与健康对照者的语音信号数据库。
Sci Data. 2024 Jul 19;11(1):800. doi: 10.1038/s41597-024-03597-2.
3
Practical Considerations and Applied Examples of Cross-Validation for Model Development and Evaluation in Health Care: Tutorial.
医疗保健中模型开发与评估的交叉验证的实际考量与应用示例:教程
JMIR AI. 2023 Dec 18;2:e49023. doi: 10.2196/49023.
4
Word Frequency and Predictability Dissociate in Naturalistic Reading.自然阅读中单词频率与可预测性相互分离。
Open Mind (Camb). 2024 Mar 5;8:177-201. doi: 10.1162/opmi_a_00119. eCollection 2024.
5
AFM signal model for dysarthric speech classification using speech biomarkers.基于语音生物标志物的构音障碍语音分类的原子力显微镜信号模型。
Front Hum Neurosci. 2024 Feb 20;18:1346297. doi: 10.3389/fnhum.2024.1346297. eCollection 2024.
6
Comparison of Vowel and Sentence Intelligibility in People With Dysarthria Secondary to Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症继发构音障碍患者的元音和句子清晰度比较
J Speech Lang Hear Res. 2024 Feb 20:1-10. doi: 10.1044/2024_JSLHR-23-00497.
7
Driving and suppressing the human language network using large language models.使用大型语言模型驱动和抑制人类语言网络。
Nat Hum Behav. 2024 Mar;8(3):544-561. doi: 10.1038/s41562-023-01783-7. Epub 2024 Jan 3.
8
A Guide to Cross-Validation for Artificial Intelligence in Medical Imaging.医学成像中人工智能的交叉验证指南
Radiol Artif Intell. 2023 May 24;5(4):e220232. doi: 10.1148/ryai.220232. eCollection 2023 Jul.
9
Rate of speech decline in individuals with amyotrophic lateral sclerosis.肌萎缩侧索硬化症患者言语下降率。
Sci Rep. 2022 Sep 20;12(1):15713. doi: 10.1038/s41598-022-19651-1.
10
Phonemes based detection of parkinson's disease for telehealth applications.基于音素的帕金森病检测,用于远程医疗应用。
Sci Rep. 2022 Jun 11;12(1):9687. doi: 10.1038/s41598-022-13865-z.