将语调标签与基频的数学描述联系起来。

Connecting intonation labels to mathematical descriptions of fundamental frequency.

作者信息

Grabe Esther, Kochanski Greg, Coleman John

机构信息

University of Oxford, UK.

出版信息

Lang Speech. 2007;50(Pt 3):281-310. doi: 10.1177/00238309070500030101.

DOI:10.1177/00238309070500030101

PMID:17974321

Abstract

The mathematical models of intonation used in speech technology are often inaccessible to linguists. By the same token, phonological descriptions of intonation are rarely used by speech technologists, as they cannot be implemented directly in applications. Consequently, these research communities do not benefit much from each other's insights. In this paper, we explore the interface between the disciplines, in search of bridges between intonational phonology and speech technology. In a corpus of speech data from seven dialects of English, we hand-labeled over 700 sentences and identified seven nuclear accent types. Then we fitted a third-order polynomial to the fundamental frequency (F0) contour in the region around the accent mark. The polynomial captures the local shape (time-dependence) of F0 in a few numbers, in our case, four coefficients. The coefficients were subjected to statistical analysis. Nineteen of the 21 pairs of accent types differed significantly in one or more coefficients. Our approach bridges the gap between intonational phonology and speech technology. It provides quantitative, empirically testable models of intonation labels that can be implemented in applications.

摘要

语音技术中使用的语调数学模型往往是语言学家无法接触到的。同样，语音技术专家很少使用语调的音系学描述，因为它们无法直接在应用程序中实现。因此，这些研究群体无法从彼此的见解中获得太多益处。在本文中，我们探索了这两个学科之间的接口，寻找语调音系学和语音技术之间的桥梁。在一个来自七种英语方言的语音数据语料库中，我们手动标注了700多个句子，并识别出七种核心重音类型。然后我们在重音标记周围的区域对基频（F0）轮廓拟合了一个三阶多项式。该多项式用几个数字（在我们的例子中是四个系数）捕捉F0的局部形状（时间依赖性）。对这些系数进行了统计分析。21对重音类型中的19对在一个或多个系数上有显著差异。我们的方法弥合了语调音系学和语音技术之间的差距。它提供了可以在应用程序中实现的、定量的、可通过实证检验的语调标签模型。

相似文献

Connecting intonation labels to mathematical descriptions of fundamental frequency.

Lang Speech. 2007;50(Pt 3):281-310. doi: 10.1177/00238309070500030101.

Polynomial modeling of child and adult intonation in German spontaneous speech.

Lang Speech. 2011 Jun;54(Pt 2):199-223. doi: 10.1177/0023830910397495.

Phonological and phonetic marking of information status in Foreign Accent Syndrome.

Int J Lang Commun Disord. 2012 Nov-Dec;47(6):738-49. doi: 10.1111/j.1460-6984.2012.00184.x. Epub 2012 Sep 27.

Phonological theory informs the analysis of intonational exaggeration in Japanese infant-directed speech.

J Acoust Soc Am. 2013 Aug;134(2):1283-94. doi: 10.1121/1.4812755.

Intonation in neurogenic foreign accent syndrome.

J Commun Disord. 2012 Jan-Feb;45(1):1-11. doi: 10.1016/j.jcomdis.2011.10.002. Epub 2011 Nov 6.

Characterizing intonation deficit in motor speech disorders: an autosegmental-metrical analysis of spontaneous speech in hypokinetic dysarthria, ataxic dysarthria, and foreign accent syndrome.

J Speech Lang Hear Res. 2012 Oct;55(5):1472-84. doi: 10.1044/1092-4388(2012/11-0263).

When pitch Accents Encode Speaker Commitment: Evidence from French Intonation.

Lang Speech. 2016 Jun;59(Pt 2):266-93. doi: 10.1177/0023830915587337.

Production and perception of speech intonation in pediatric cochlear implant recipients and individuals with normal hearing.

Ear Hear. 2008 Jun;29(3):336-51. doi: 10.1097/AUD.0b013e318168d94d.

Prosodic peak estimation under segmental perturbations.

J Acoust Soc Am. 2010 Feb;127(2):862-73. doi: 10.1121/1.3268511.

Phonetics and phonology of thematic contrast in German.

Lang Speech. 2006;49(Pt 4):451-93. doi: 10.1177/00238309060490040201.

引用本文的文献

Internal structure of intonational categories: The (dis)appearance of a perceptual magnet effect.

Front Psychol. 2023 Jan 17;13:911349. doi: 10.3389/fpsyg.2022.911349. eCollection 2022.

Echoes of Past Contact: Venetian Influence on Cretan Greek Intonation.

Lang Speech. 2024 Jun;67(2):528-560. doi: 10.1177/00238309221091939. Epub 2022 May 13.

Identification, Analysis and Characterization of Base Units of Bird Vocal Communication: The White Spectacled Bulbul () as a Case Study.

Front Behav Neurosci. 2022 Feb 14;15:812939. doi: 10.3389/fnbeh.2021.812939. eCollection 2021.

Correction by Focus: Cleft Constructions and the Cross-Linguistic Variation in Phonological Form.

Front Psychol. 2021 Nov 29;12:648478. doi: 10.3389/fpsyg.2021.648478. eCollection 2021.

Unifying Amplitude and Phase Analysis: A Compositional Data Approach to Functional Multivariate Mixed-Effects Modeling of Mandarin Chinese.

J Am Stat Assoc. 2015 Apr 3;110(510):545-559. doi: 10.1080/01621459.2015.1006729. Epub 2015 Jul 6.

Emotion identification using extremely low frequency components of speech feature contours.

ScientificWorldJournal. 2014;2014:757121. doi: 10.1155/2014/757121. Epub 2014 May 20.

Accent detection is a slippery slope: Direction and rate of F0 change drives listeners' comprehension.

Lang Cogn Process. 2010 Jan 1;25(7-9):1178-1200. doi: 10.1080/01690961003783699.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

将语调标签与基频的数学描述联系起来。

Connecting intonation labels to mathematical descriptions of fundamental frequency.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献