• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于声学的新型发音距离度量。

A New Acoustic-Based Pronunciation Distance Measure.

作者信息

Bartelds Martijn, Richter Caitlin, Liberman Mark, Wieling Martijn

机构信息

Center for Language and Cognition, Faculty of Arts, University of Groningen, Groningen, Netherlands.

Department of Linguistics, University of Pennsylvania, Philadelphia, PA, United States.

出版信息

Front Artif Intell. 2020 May 29;3:39. doi: 10.3389/frai.2020.00039. eCollection 2020.

DOI:10.3389/frai.2020.00039
PMID:33733156
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7861290/
Abstract

We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as it does not require the time-consuming and error-prone process of phonetically transcribing speech samples which is necessary for current edit distance-based approaches. We minimize speaker variability in the data set by employing speaker-based cepstral mean and variance normalization, and compute word-based acoustic distances using the dynamic time warping algorithm. Our results indicate a strong correlation of = -0.71 ( < 0.0001) between the acoustic distances and human judgments of native-likeness provided by more than 1,100 native American-English raters. Therefore, the convenient acoustic measure performs only slightly lower than the state-of-the-art transcription-based performance of = -0.77. We also report the results of several small experiments which show that the acoustic measure is not only sensitive to segmental differences, but also to intonational differences and durational differences. However, it is not immune to unwanted differences caused by using a different recording device.

摘要

我们提出了一种用于比较发音的声学距离度量方法,并通过将非美国英语母语者的语音与美国英语母语者的语音集合进行比较,应用该度量方法来评估美国英语中的外国口音强度。仅基于声学的度量方法很有价值,因为它不需要当前基于编辑距离的方法所必需的对语音样本进行耗时且容易出错的语音转录过程。我们通过采用基于说话者的倒谱均值和方差归一化来最小化数据集中的说话者变异性,并使用动态时间规整算法计算基于单词的声学距离。我们的结果表明,声学距离与1100多名美国英语母语评分者对母语相似度的人类判断之间存在很强的相关性,相关系数为 = -0.71( < 0.0001)。因此,这种便捷的声学度量方法的表现仅略低于基于转录的最先进方法的表现 = -0.77。我们还报告了几个小型实验的结果,这些结果表明,声学度量方法不仅对音段差异敏感,而且对语调差异和时长差异也敏感。然而,它无法避免因使用不同录音设备而产生的不必要差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/896cf31c43cd/frai-03-00039-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/95af428bf242/frai-03-00039-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/5759f11e9d40/frai-03-00039-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/896cf31c43cd/frai-03-00039-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/95af428bf242/frai-03-00039-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/5759f11e9d40/frai-03-00039-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30f6/7861290/896cf31c43cd/frai-03-00039-g0003.jpg

相似文献

1
A New Acoustic-Based Pronunciation Distance Measure.一种基于声学的新型发音距离度量。
Front Artif Intell. 2020 May 29;3:39. doi: 10.3389/frai.2020.00039. eCollection 2020.
2
Listening Effort by Native and Nonnative Listeners Due to Noise, Reverberation, and Talker Foreign Accent During English Speech Perception.母语和非母语听者在英语语音感知中因噪声、混响和说话者外国口音而产生的听力努力。
J Speech Lang Hear Res. 2019 Apr 15;62(4):1068-1081. doi: 10.1044/2018_JSLHR-H-17-0423.
3
A cognitively grounded measure of pronunciation distance.一种基于认知的发音距离度量方法。
PLoS One. 2014 Jan 9;9(1):e75734. doi: 10.1371/journal.pone.0075734. eCollection 2014.
4
Is There a Foreign Effect on Moral Judgment?国外因素对道德判断有影响吗?
Brain Sci. 2021 Dec 10;11(12):1631. doi: 10.3390/brainsci11121631.
5
Detecting Foreign Accents in Song.检测歌曲中的外国口音。
Phonetica. 2019;76(6):429-447. doi: 10.1159/000500187. Epub 2019 Jul 9.
6
Comparative Acoustic Analyses of L2 English: The Search for Systematic Variation.二语英语的比较声学分析:系统变异的探寻。
Phonetica. 2020;77(6):441-479. doi: 10.1159/000508387. Epub 2020 Jul 21.
7
Word Durations in Non-Native English.非英语母语者的单词时长
J Phon. 2011 Jan 1;39(1):1-17. doi: 10.1016/j.wocn.2010.10.006.
8
Acoustic and Articulatory Characteristics of English Semivowels /ɹ, l, w/ Produced by Adult Second-Language Speakers.成人二语者产出的英语半元音/ɹ, l, w/的声学和发音特征。
J Speech Lang Hear Res. 2022 Mar 8;65(3):890-905. doi: 10.1044/2021_JSLHR-21-00152. Epub 2022 Feb 1.
9
Acoustic characteristics of Korean-English bilingual speakers' /l/ and the relationship to their foreign accent ratings.韩英双语者/l/的声学特征及其与外国口音评级的关系。
J Commun Disord. 2021 Nov-Dec;94:106157. doi: 10.1016/j.jcomdis.2021.106157. Epub 2021 Oct 6.
10
Shhh… I Need Quiet! Children's Understanding of American, British, and Japanese-accented English Speakers.嘘……我需要安静!儿童对带有美国、英国和日本口音的英语使用者的理解。
Lang Speech. 2018 Dec;61(4):657-673. doi: 10.1177/0023830918754598. Epub 2018 Feb 5.

引用本文的文献

1
Differential weighting of information during aloud and silent reading: Evidence from representational similarity analysis of fMRI data.出声阅读和默读过程中信息的差异加权:来自功能磁共振成像数据表征相似性分析的证据。
Imaging Neurosci (Camb). 2025 Jan 13;3. doi: 10.1162/imag_a_00428. eCollection 2025.
2
The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.梅森-阿尔伯塔音标分段器:一种基于深度神经网络和插值的强制对齐系统。
Phonetica. 2024 Sep 5;81(5):451-508. doi: 10.1515/phon-2024-0015. Print 2024 Oct 28.

本文引用的文献

1
An ERP investigation of regional and foreign accent processing.一项关于区域和外国口音处理的 ERP 研究。
Brain Lang. 2012 Aug;122(2):92-102. doi: 10.1016/j.bandl.2012.04.017. Epub 2012 Jun 12.
2
Quantitative social dialectology: explaining linguistic variation geographically and socially.定量社会方言学:从地理和社会角度解释语言变异。
PLoS One. 2011;6(9):e23613. doi: 10.1371/journal.pone.0023613. Epub 2011 Sep 1.
3
A comparison of vowel normalization procedures for language variation research.用于语言变异研究的元音归一化程序比较
J Acoust Soc Am. 2004 Nov;116(5):3099-107. doi: 10.1121/1.1795335.
4
The writer independent online handwriting recognition system frog on hand and cluster generative statistical dynamic time warping.作者独立的在线手写识别系统,即基于手部的青蛙模型和聚类生成统计动态时间规整。 (但这段英文表述似乎不太准确和清晰,可能影响理解。)
IEEE Trans Pattern Anal Mach Intell. 2004 Mar;26(3):299-310. doi: 10.1109/tpami.2004.1262308.
5
Spectral balance as an acoustic correlate of linguistic stress.频谱平衡作为语言重音的声学关联因素。
J Acoust Soc Am. 1996 Oct;100(4 Pt 1):2471-85. doi: 10.1121/1.417955.
6
Multidimensional scaling of similarity.相似度的多维缩放
Psychometrika. 1965 Dec;30(4):379-93. doi: 10.1007/BF02289530.
7
Factors affecting degree of perceived foreign accent in English sentences.影响英语句子中外国口音感知程度的因素。
J Acoust Soc Am. 1988 Jul;84(1):70-9. doi: 10.1121/1.396876.