Wav2DDK：基于远程采集语音的自动弹舌率估算算法的分析和临床验证。

Wav2DDK: Analytical and Clinical Validation of an Automated Diadochokinetic Rate Estimation Algorithm on Remotely Collected Speech.

机构信息

School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe.

Aural Analytics Inc., Tempe, AZ.

出版信息

J Speech Lang Hear Res. 2023 Aug 17;66(8S):3166-3181. doi: 10.1044/2023_JSLHR-22-00282. Epub 2023 Aug 9.

DOI:10.1044/2023_JSLHR-22-00282

PMID:37556308

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10555468/

Abstract

PURPOSE

Oral diadochokinesis is a useful task in assessment of speech motor function in the context of neurological disease. Remote collection of speech tasks provides a convenient alternative to in-clinic visits, but scoring these assessments can be a laborious process for clinicians. This work describes Wav2DDK, an automated algorithm for estimating the diadochokinetic (DDK) rate on remotely collected audio from healthy participants and participants with amyotrophic lateral sclerosis (ALS).

METHOD

Wav2DDK was developed using a corpus of 970 DDK assessments from healthy and ALS speakers where ground truth DDK rates were provided manually by trained annotators. The clinical utility of the algorithm was demonstrated on a corpus of 7,919 assessments collected longitudinally from 26 healthy controls and 82 ALS speakers. Corpora were collected via the participants' own mobile device, and instructions for speech elicitation were provided via a mobile app. DDK rate was estimated by parsing the character transcript from a deep neural network transformer acoustic model trained on healthy and ALS speech.

RESULTS

Algorithm estimated DDK rates are highly accurate, achieving .98 correlation with manual annotation, and an average error of only 0.071 syllables per second. The rate exactly matched ground truth for 83% of files and was within 0.5 syllables per second for 95% of files. Estimated rates achieve a high test-retest reliability ( = .95) and show good correlation with the revised ALS functional rating scale speech subscore ( = .67).

CONCLUSION

We demonstrate a system for automated DDK estimation that increases efficiency of calculation beyond manual annotation. Thorough analytical and clinical validation demonstrates that the algorithm is not only highly accurate, but also provides a convenient, clinically relevant metric for tracking longitudinal decline in ALS, serving to promote participation and diversity of participants in clinical research.

SUPPLEMENTAL MATERIAL

https://doi.org/10.23641/asha.23787033.

摘要

目的

口腔交替运动是评估神经疾病患者言语运动功能的一项有用任务。远程采集语音任务为临床医生提供了一种方便的替代方法，但对这些评估进行评分可能是一个繁琐的过程。本研究描述了 Wav2DDK，这是一种用于估计健康参与者和肌萎缩侧索硬化症（ALS）患者远程采集音频的交替运动（DDK）率的自动化算法。

方法

Wav2DDK 是使用来自健康和 ALS 说话者的 970 个 DDK 评估语料库开发的，其中由经过培训的注释者手动提供 DDK 率的真实值。该算法的临床实用性在 26 名健康对照者和 82 名 ALS 患者的 7919 次纵向采集语料库中得到了证明。语料库是通过参与者自己的移动设备收集的，语音激发的说明是通过移动应用程序提供的。DDK 率是通过解析从健康和 ALS 语音训练的深度神经网络转换器声学模型的字符转写来估计的。

结果

算法估计的 DDK 率非常准确，与手动注释的相关性达到.98，平均误差仅为每秒 0.071 个音节。对于 83%的文件，估计的速率与真实值完全匹配，对于 95%的文件，估计的速率与真实值相差在 0.5 个音节/秒以内。估计的速率具有较高的测试-重测可靠性（r =.95），与修订后的 ALS 功能评定量表言语子评分呈良好相关性（r =.67）。

结论

我们展示了一种用于自动 DDK 估计的系统，该系统提高了计算效率，超越了手动注释。彻底的分析和临床验证表明，该算法不仅高度准确，而且还提供了一种方便的、与临床相关的指标，用于跟踪 ALS 的纵向下降，从而促进了 ALS 临床研究中参与者的多样性和参与度。

补充材料

https://doi.org/10.23641/asha.23787033.

相似文献

Wav2DDK: Analytical and Clinical Validation of an Automated Diadochokinetic Rate Estimation Algorithm on Remotely Collected Speech.Wav2DDK：基于远程采集语音的自动弹舌率估算算法的分析和临床验证。

J Speech Lang Hear Res. 2023 Aug 17;66(8S):3166-3181. doi: 10.1044/2023_JSLHR-22-00282. Epub 2023 Aug 9.

Automated Acoustic Analysis of Oral Diadochokinesis to Assess Bulbar Motor Involvement in Amyotrophic Lateral Sclerosis.利用口腔弹舌音的自动声学分析评估肌萎缩性侧索硬化的延髓运动参与。

J Speech Lang Hear Res. 2020 Jan 15;63(1):59-73. doi: 10.1044/2019_JSLHR-19-00178. Print 2020 Jan 22.

Speech timing and monosyllabic diadochokinesis measures in the assessment of amyotrophic lateral sclerosis in Canadian French.在加拿大法语中，评估肌萎缩侧索硬化症时的言语时程和单音节交替发音测量。

Int J Speech Lang Pathol. 2024 Apr;26(2):267-277. doi: 10.1080/17549507.2023.2214706. Epub 2023 Jun 5.

A speech measure for early stratification of fast and slow progressors of bulbar amyotrophic lateral sclerosis: lip movement jitter.一种用于球部肌萎缩侧索硬化症快速进展者和缓慢进展者早期分层的言语测量方法：唇运动抖动。

Amyotroph Lateral Scler Frontotemporal Degener. 2020 Feb;21(1-2):34-41. doi: 10.1080/21678421.2019.1681454. Epub 2019 Nov 7.

Comparison of Automated Acoustic Methods for Oral Diadochokinesis Assessment in Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症患者口腔轮替运动评估的自动声学方法比较

J Speech Lang Hear Res. 2020 Oct 16;63(10):3453-3460. doi: 10.1044/2020_JSLHR-20-00109. Epub 2020 Sep 21.

Oral diadochokinetic production in children with typical speech development and speech-sound disorders.儿童在典型言语发育和言语障碍方面的口腔交替运动产生。

Int J Lang Commun Disord. 2023 Sep-Oct;58(5):1783-1798. doi: 10.1111/1460-6984.12908. Epub 2023 May 25.

Validating Automatic Diadochokinesis Analysis Methods Across Dysarthria Severity and Syllable Task in Amyotrophic Lateral Sclerosis.验证自动言语速率分析方法在肌萎缩侧索硬化症不同构音障碍严重程度和音节任务中的有效性。

J Speech Lang Hear Res. 2022 Mar 8;65(3):940-953. doi: 10.1044/2021_JSLHR-21-00503. Epub 2022 Feb 16.

Oral-diadochokinesis between Parkinson's disease and neurotypical elderly among Malaysian-Malay speakers.马来西亚-马来语使用者中帕金森病与神经典型老年人的口部交替运动。

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1701-1714. doi: 10.1111/1460-6984.13025. Epub 2024 Mar 7.

Oral-diadochokinetic rates for Hebrew-speaking healthy ageing population: non-word versus real-word repetition.说希伯来语的健康老年人群的口腔轮替运动速率：非单词与真实单词重复

Int J Lang Commun Disord. 2017 May;52(3):301-310. doi: 10.1111/1460-6984.12272. Epub 2016 Jul 18.

Diadochokinetic rates in healthy young and elderly Greek-speaking adults: The effect of types of stimuli.健康的年轻和老年希腊语成年人的交替运动率：刺激类型的影响。

Int J Lang Commun Disord. 2022 Sep;57(5):1085-1097. doi: 10.1111/1460-6984.12747. Epub 2022 Jun 15.

引用本文的文献

An Introduction to Machine Learning for Speech-Language Pathologists: Concepts, Terminology, and Emerging Applications.面向言语语言病理学家的机器学习导论：概念、术语及新兴应用

Perspect ASHA Spec Interest Groups. 2025 Apr;10(2):432-450. doi: 10.1044/2024_persp-24-00037. Epub 2025 Apr 1.

Enhancing online speech and language assessment: Item development for the remote adult language experiment (ReAL-E) tool.加强在线言语和语言评估：远程成人语言实验（ReAL-E）工具的项目开发。

J Commun Disord. 2025 Mar-Apr;114:106496. doi: 10.1016/j.jcomdis.2025.106496. Epub 2025 Jan 22.

Smartphone automated motor and speech analysis for early detection of Alzheimer's disease and Parkinson's disease: Validation of TapTalk across 20 different devices.用于早期检测阿尔茨海默病和帕金森病的智能手机自动运动和语音分析：TapTalk在20种不同设备上的验证

Alzheimers Dement (Amst). 2024 Oct 23;16(4):e70025. doi: 10.1002/dad2.70025. eCollection 2024 Oct-Dec.

本文引用的文献

Diadochokinetic rates in healthy young and elderly Greek-speaking adults: The effect of types of stimuli.健康的年轻和老年希腊语成年人的交替运动率：刺激类型的影响。

Int J Lang Commun Disord. 2022 Sep;57(5):1085-1097. doi: 10.1111/1460-6984.12747. Epub 2022 Jun 15.

J Speech Lang Hear Res. 2022 Mar 8;65(3):940-953. doi: 10.1044/2021_JSLHR-21-00503. Epub 2022 Feb 16.

Oral and Laryngeal Diadochokinesis Across the Life Span: A Scoping Review of Methods, Reference Data, and Clinical Applications.全生命周期的口腔和喉部交替运动：方法、参考数据及临床应用的范围综述

J Speech Lang Hear Res. 2022 Feb 9;65(2):574-623. doi: 10.1044/2021_JSLHR-21-00396. Epub 2021 Dec 27.

Digital medicine and the curse of dimensionality.数字医学与维度诅咒

NPJ Digit Med. 2021 Oct 28;4(1):153. doi: 10.1038/s41746-021-00521-5.

A Virtual, Randomized, Control Trial of a Digital Therapeutic for Speech, Language, and Cognitive Intervention in Post-stroke Persons With Aphasia.一项针对中风后失语症患者进行言语、语言和认知干预的数字疗法的虚拟随机对照试验。

Front Neurol. 2021 Feb 12;12:626780. doi: 10.3389/fneur.2021.626780. eCollection 2021.

Early detection and tracking of bulbar changes in ALS via frequent and remote speech analysis.通过频繁的远程语音分析早期检测和跟踪肌萎缩侧索硬化症中的延髓变化。

NPJ Digit Med. 2020 Oct 13;3:132. doi: 10.1038/s41746-020-00335-x. eCollection 2020.

Comparison of Automated Acoustic Methods for Oral Diadochokinesis Assessment in Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症患者口腔轮替运动评估的自动声学方法比较

J Speech Lang Hear Res. 2020 Oct 16;63(10):3453-3460. doi: 10.1044/2020_JSLHR-20-00109. Epub 2020 Sep 21.

DeepDDK: A Deep Learning based Oral-Diadochokinesis Analysis Software.DeepDDK：一款基于深度学习的口腔轮替运动分析软件。

IEEE EMBS Int Conf Biomed Health Inform. 2019;2019:1-4. doi: 10.1109/bhi.2019.8834506. Epub 2019 Sep 12.

Verification, analytical validation, and clinical validation (V3): the foundation of determining fit-for-purpose for Biometric Monitoring Technologies (BioMeTs).验证、分析验证和临床验证（V3）：确定生物识别监测技术（BioMeTs）适用性的基础。

NPJ Digit Med. 2020 Apr 14;3:55. doi: 10.1038/s41746-020-0260-4. eCollection 2020.

Assessment of speech impairment in patients with Parkinson's disease from acoustic quantifications of oral diadochokinetic sequences.通过口腔快速重复运动序列的声学量化评估帕金森病患者的言语障碍

J Acoust Soc Am. 2020 Feb;147(2):839. doi: 10.1121/10.0000581.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验