• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探究影响儿童语音自动声学分析可行性的因素。

Examining Factors Influencing the Viability of Automatic Acoustic Analysis of Child Speech.

作者信息

Knowles Thea, Clayards Meghan, Sonderegger Morgan

机构信息

School of Communication Sciences & Disorders, Western University, London, Ontario, Canada.

Health & Rehabilitation Sciences, Western University, London, Ontario, Canada.

出版信息

J Speech Lang Hear Res. 2018 Oct 26;61(10):2487-2501. doi: 10.1044/2018_JSLHR-S-17-0275.

DOI:10.1044/2018_JSLHR-S-17-0275
PMID:30458531
Abstract

PURPOSE

Heterogeneous child speech was force-aligned to investigate whether (a) manipulating specific parameters could improve alignment accuracy and (b) forced alignment could be used to replicate published results on acoustic characteristics of /s/ production by children.

METHOD

In Part 1, child speech from 2 corpora was force-aligned with a trainable aligner (Prosodylab-Aligner) under different conditions that systematically manipulated input training data and the type of transcription used. Alignment accuracy was determined by comparing hand and automatic alignments as to how often they overlapped (%-Match) and absolute differences in duration and boundary placements. Using mixed-effects regression, accuracy was modeled as a function of alignment conditions, as well as segment and child age. In Part 2, forced alignments derived from a subset of the alignment conditions in Part 1 were used to extract spectral center of gravity of /s/ productions from young children. These findings were compared to published results that used manual alignments of the same data.

RESULTS

Overall, the results of Part 1 demonstrated that using training data more similar to the data to be aligned as well as phonetic transcription led to improvements in alignment accuracy. Speech from older children was aligned more accurately than younger children. In Part 2, /s/ center of gravity extracted from force-aligned segments was found to diverge in the speech of male and female children, replicating the pattern found in previous work using manually aligned segments. This was true even for the least accurate forced alignment method.

CONCLUSIONS

Alignment accuracy of child speech can be improved by using more specific training and transcription. However, poor alignment accuracy was not found to impede acoustic analysis of /s/ produced by even very young children. Thus, forced alignment presents a useful tool for the analysis of child speech.

SUPPLEMENTAL MATERIAL

https://doi.org/10.23641/asha.7070105.

摘要

目的

将儿童异质语音进行强制对齐,以研究(a)操纵特定参数是否能提高对齐精度,以及(b)强制对齐是否可用于复制已发表的关于儿童/s/发音声学特征的研究结果。

方法

在第一部分中,来自2个语料库的儿童语音在不同条件下使用可训练对齐器(Prosodylab - Aligner)进行强制对齐,这些条件系统地操纵了输入训练数据和所用转录类型。通过比较人工对齐和自动对齐在重叠频率(%-匹配)以及时长和边界位置的绝对差异来确定对齐精度。使用混合效应回归,将精度建模为对齐条件以及音段和儿童年龄的函数。在第二部分中,利用第一部分中部分对齐条件得出的强制对齐结果,从幼儿的/s/发音中提取频谱重心。将这些结果与使用相同数据的人工对齐得出的已发表结果进行比较。

结果

总体而言,第一部分的结果表明,使用与待对齐数据更相似的训练数据以及语音转录可提高对齐精度。年龄较大儿童的语音比对年龄较小儿童的语音对齐得更准确。在第二部分中,从强制对齐音段中提取的/s/重心在男童和女童的语音中存在差异,这与之前使用人工对齐音段的研究结果一致。即使是最不准确的强制对齐方法也是如此。

结论

通过使用更具体的训练和转录可以提高儿童语音的对齐精度。然而,即使对于非常年幼的儿童,对齐精度较差也并未妨碍对其/s/发音的声学分析。因此,强制对齐是分析儿童语音的一种有用工具。

补充材料

https://doi.org/10.23641/asha.7070105

相似文献

1
Examining Factors Influencing the Viability of Automatic Acoustic Analysis of Child Speech.探究影响儿童语音自动声学分析可行性的因素。
J Speech Lang Hear Res. 2018 Oct 26;61(10):2487-2501. doi: 10.1044/2018_JSLHR-S-17-0275.
2
Performance of Forced-Alignment Algorithms on Children's Speech.强制对齐算法在儿童语音上的性能
J Speech Lang Hear Res. 2021 Jun 18;64(6S):2213-2222. doi: 10.1044/2020_JSLHR-20-00268. Epub 2021 Mar 11.
3
The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.梅森-阿尔伯塔音标分段器:一种基于深度神经网络和插值的强制对齐系统。
Phonetica. 2024 Sep 5;81(5):451-508. doi: 10.1515/phon-2024-0015. Print 2024 Oct 28.
4
Automatic alignment for New Englishes: Applying state-of-the-art aligners to Trinidadian English.新英语的自动对齐:将最先进的对齐工具应用于特立尼达英语。
J Acoust Soc Am. 2020 Apr;147(4):2283. doi: 10.1121/10.0001069.
5
Compensatory Strategies in the Developmental Patterns of English /s/: Gender and Vowel Context Effects.英语/s/发音发展模式中的补偿策略:性别与元音语境效应
J Speech Lang Hear Res. 2017 Mar 1;60(3):571-591. doi: 10.1044/2016_JSLHR-L-15-0381.
6
Acoustic and spectral patterns in young children's stop consonant productions.幼儿塞音发音中的声学和频谱模式。
J Acoust Soc Am. 2009 Sep;126(3):1369-78. doi: 10.1121/1.3192350.
7
Using automatic alignment to analyze endangered language data: testing the viability of untrained alignment.使用自动对齐分析濒危语言数据:测试未训练对齐的可行性。
J Acoust Soc Am. 2013 Sep;134(3):2235-46. doi: 10.1121/1.4816491.
8
Acoustic Analysis of Spatiotemporal Variability in Children With Childhood Apraxia of Speech.儿童言语运动障碍的时空可变性声学分析。
J Speech Lang Hear Res. 2024 Oct 8;67(10):3536-3548. doi: 10.1044/2024_JSLHR-24-00079. Epub 2024 Sep 11.
9
Acoustic and spectral characteristics of young children's fricative productions: a developmental perspective.幼儿擦音发音的声学和频谱特征:发展视角
J Acoust Soc Am. 2005 Oct;118(4):2570-8. doi: 10.1121/1.2010407.
10
Acoustic and perceptual evaluation of category goodness of /t/ and /k/ in typical and misarticulated children's speech.典型儿童和发音错误儿童语音中/t/和/k/音类良好度的声学和感知评估。
J Acoust Soc Am. 2015 Jun;137(6):3422-35. doi: 10.1121/1.4921033.

引用本文的文献

1
How Does Alignment Error Affect Automated Pronunciation Scoring in Children's Speech?对齐误差如何影响儿童语音的自动发音评分?
Interspeech. 2024 Sep;2024:5133-5137. doi: 10.21437/interspeech.2024-2239.
2
A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.一种基于深度学习的可调谐强制对齐系统:在儿童语音中的应用。
J Speech Lang Hear Res. 2025 Jul 29;68(7S):3583-3601. doi: 10.1044/2024_JSLHR-24-00347. Epub 2025 Mar 31.
3
Relating Acoustic Measures to Listener Ratings of Children's Productions of Word-Initial /ɹ/ and /w/.
将声学测量与听众对儿童单词首 /ɹ/ 和 /w/ 发音的评分相关联。
J Speech Lang Hear Res. 2023 Sep 13;66(9):3413-3427. doi: 10.1044/2023_JSLHR-22-00713. Epub 2023 Aug 17.
4
Robust Estimation of Hypernasality in Dysarthria with Acoustic Model Likelihood Features.基于声学模型似然特征的构音障碍鼻音过重的稳健估计
IEEE/ACM Trans Audio Speech Lang Process. 2020;28:2511-2522. doi: 10.1109/taslp.2020.3015035. Epub 2020 Aug 7.
5
Performance of Forced-Alignment Algorithms on Children's Speech.强制对齐算法在儿童语音上的性能
J Speech Lang Hear Res. 2021 Jun 18;64(6S):2213-2222. doi: 10.1044/2020_JSLHR-20-00268. Epub 2021 Mar 11.
6
A Deep Learning Algorithm for Objective Assessment of Hypernasality in Children With Cleft Palate.一种用于客观评估腭裂儿童过度鼻音的深度学习算法。
IEEE Trans Biomed Eng. 2021 Oct;68(10):2986-2996. doi: 10.1109/TBME.2021.3058424. Epub 2021 Sep 20.