评估语音障碍儿童的 rhoticity 分类的声学特征表示和归一化。

Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.

机构信息

Communication Sciences & Disorders, Syracuse University, Syracuse, New York 13244, USA.

Electrical and Computer Engineering, University of Maryland, College Park, Maryland 20742, USA.

出版信息

JASA Express Lett. 2024 Feb 1;4(2). doi: 10.1121/10.0024632.

DOI:10.1121/10.0024632

PMID:38299984

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11522988/

Abstract

The effects of different acoustic representations and normalizations were compared for classifiers predicting perception of children's rhotic versus derhotic /ɹ/. Formant and Mel frequency cepstral coefficient (MFCC) representations for 350 speakers were z-standardized, either relative to values in the same utterance or age-and-sex data for typical /ɹ/. Statistical modeling indicated age-and-sex normalization significantly increased classifier performances. Clinically interpretable formants performed similarly to MFCCs and were endorsed for deep neural network engineering, achieving mean test-participant-specific F1-score = 0.81 after personalization and replication (σx = 0.10, med = 0.83, n = 48). Shapley additive explanations analysis indicated the third formant most influenced fully rhotic predictions.

摘要

比较了不同声学表示和归一化方法对预测儿童 r 音与 dr 音感知的分类器的影响。对 350 位发音者的共振峰和梅尔频率倒谱系数（MFCC）表示进行 z 标准化，分别相对于同一话语中的值或典型 r 音的年龄和性别数据。统计建模表明，年龄和性别归一化显著提高了分类器的性能。可临床解释的共振峰与 MFCCs 表现相似，并被推荐用于深度神经网络工程，在个性化和复制后，平均测试参与者特定的 F1 得分为 0.81（σx=0.10，中位数=0.83，n=48）。Shapley 加法解释分析表明，第三共振峰对完全 r 音预测的影响最大。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6f9/11522988/0a6b10e6fa3d/nihms-2029028-f0001.jpg

相似文献

Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.评估语音障碍儿童的 rhoticity 分类的声学特征表示和归一化。

JASA Express Lett. 2024 Feb 1;4(2). doi: 10.1121/10.0024632.

Acoustic Characteristics of Rhotic Vowel Productions of Young Children.幼儿儿化韵发声的声学特征。

Folia Phoniatr Logop. 2021;73(2):89-100. doi: 10.1159/000504250. Epub 2019 Dec 13.

Deriving individualised /r/ targets from the acoustics of children's non-rhotic vowels.从儿童非 rhotic 元音的声学特征中推导个性化的/r/目标。

Clin Linguist Phon. 2018;32(1):70-87. doi: 10.1080/02699206.2017.1330898. Epub 2017 Jul 13.

Evaluation of formant-like features on an automatic vowel classification task.在自动元音分类任务中对类共振峰特征的评估。

J Acoust Soc Am. 2004 Sep;116(3):1781-92. doi: 10.1121/1.1781620.

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures.分布式语音识别架构中基于梅尔频率倒谱系数的声学语音特征分析与预测

J Acoust Soc Am. 2008 Dec;124(6):3989-4000. doi: 10.1121/1.2997436.

Deep learning in automatic detection of dysphonia: Comparing acoustic features and developing a generalizable framework.深度学习在嗓音障碍自动检测中的应用：比较声学特征并开发一个可推广的框架。

Int J Lang Commun Disord. 2023 Mar;58(2):279-294. doi: 10.1111/1460-6984.12783. Epub 2022 Sep 18.

Acoustic and perceptual evaluation of category goodness of /t/ and /k/ in typical and misarticulated children's speech.典型儿童和发音错误儿童语音中/t/和/k/音类良好度的声学和感知评估。

J Acoust Soc Am. 2015 Jun;137(6):3422-35. doi: 10.1121/1.4921033.

Rhotic vowel accuracy and error patterns in young children with and without Speech Sound Disorders.患有和未患有语音障碍的幼儿的卷舌元音准确性及错误模式

J Commun Disord. 2019 Jul-Aug;80:18-34. doi: 10.1016/j.jcomdis.2019.03.003. Epub 2019 Mar 22.

Automated Dysarthria Severity Classification: A Study on Acoustic Features and Deep Learning Techniques.自动构音障碍严重程度分类：声学特征与深度学习技术研究。

IEEE Trans Neural Syst Rehabil Eng. 2022;30:1147-1157. doi: 10.1109/TNSRE.2022.3169814. Epub 2022 May 4.

A Comparative Study of Features for Acoustic Cough Detection Using Deep Architectures.使用深度架构进行声学咳嗽检测的特征比较研究

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:2601-2605. doi: 10.1109/EMBC.2019.8856412.

引用本文的文献

Artificial Intelligence-Assisted Speech Therapy for /ɹ/: A Single-Case Experimental Study.人工智能辅助的/ɹ/音言语治疗：一项单病例实验研究。

Am J Speech Lang Pathol. 2024 Sep 18;33(5):2461-2486. doi: 10.1044/2024_AJSLP-23-00448. Epub 2024 Aug 22.

本文引用的文献

Reproducible Speech Research With the Artificial Intelligence-Ready PERCEPT Corpora.人工智能就绪的 PERCEPT 语料库中的可复制语音研究。

J Speech Lang Hear Res. 2023 Jun 20;66(6):1986-2009. doi: 10.1044/2023_JSLHR-22-00343. Epub 2023 Jun 15.

Classification of accurate and misarticulated /r/ for ultrasound biofeedback using tongue part displacement trajectories.使用舌部位移轨迹对超声生物反馈中的准确发音和错误发音的 /r/ 进行分类。

Clin Linguist Phon. 2023 Feb 1;37(2):196-222. doi: 10.1080/02699206.2022.2039777. Epub 2022 Mar 7.

Comparing Biofeedback Types for Children With Residual /ɹ/ Errors in American English: A Single-Case Randomization Design.比较美式英语中残余/r/发音错误儿童的生物反馈类型：单病例随机设计。

Am J Speech Lang Pathol. 2021 Jul 14;30(4):1819-1845. doi: 10.1044/2021_AJSLP-20-00216. Epub 2021 Jul 7.

Mobile apps for treatment of speech disorders in children: An evidence-based analysis of quality and efficacy.移动应用程序治疗儿童言语障碍：基于证据的质量和疗效分析。

PLoS One. 2018 Aug 9;13(8):e0201513. doi: 10.1371/journal.pone.0201513. eCollection 2018.

Automated speech analysis tools for children's speech production: A systematic literature review.用于儿童言语产生的自动语音分析工具：一项系统的文献综述。

Int J Speech Lang Pathol. 2018 Nov;20(6):583-598. doi: 10.1080/17549507.2018.1477991. Epub 2018 Jul 11.

Selecting an acoustic correlate for automated measurement of American English rhotic production in children.选择一种声学相关特征用于自动测量儿童美式英语中r音的发音情况。

Int J Speech Lang Pathol. 2018 Nov;20(6):635-643. doi: 10.1080/17549507.2017.1359334. Epub 2017 Aug 10.

Comparing measurement errors for formants in synthetic and natural vowels.比较合成元音和自然元音中元音共振峰的测量误差。

J Acoust Soc Am. 2016 Feb;139(2):713-27. doi: 10.1121/1.4940665.

Optimizing Vowel Formant Measurements in Four Acoustic Analysis Systems for Diverse Speaker Groups.针对不同说话者群体，优化四种声学分析系统中的元音共振峰测量

Am J Speech Lang Pathol. 2016 Aug 1;25(3):335-54. doi: 10.1044/2015_AJSLP-15-0020.

Adolescent outcomes of children with early speech sound disorders with and without language impairment.有或无语言障碍的早期语音障碍儿童的青少年期结局

Am J Speech Lang Pathol. 2015 May;24(2):150-63. doi: 10.1044/2014_AJSLP-14-0075.

A multidimensional investigation of children's /r/ productions: perceptual, ultrasound, and acoustic measures.儿童/r/音产生的多维研究：感知、超声和声学测量。

Am J Speech Lang Pathol. 2013 Aug;22(3):540-53. doi: 10.1044/1058-0360(2013/12-0137). Epub 2013 Jun 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估语音障碍儿童的 rhoticity 分类的声学特征表示和归一化。

Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献