一种用与语言区别特征相对应的声学线索标注语音的框架。

A framework for labeling speech with acoustic cues to linguistic distinctive features.

机构信息

Speech Communication Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, 50 Vassar Street, Cambridge, Massachusetts 02139,

出版信息

J Acoust Soc Am. 2019 Aug;146(2):EL184. doi: 10.1121/1.5121717.

DOI:10.1121/1.5121717

PMID:31472587

Abstract

Acoustic cues are characteristic patterns in the speech signal that provide lexical, prosodic, or additional information, such as speaker identity. In particular, acoustic cues related to linguistic distinctive features can be extracted and marked from the speech signal. These acoustic cues can be used to infer the intended underlying phoneme sequence in an utterance. This study describes a framework for labeling acoustic cues in speech, including a suite of canonical cue prediction algorithms that facilitates manual labeling and provides a standard for analyzing variations in the surface realizations. A brief examination of subsets of annotated speech data shows that labeling acoustic cues opens the possibility of detailed analyses of cue modification patterns in speech.

摘要

声学线索是语音信号中的特征模式，提供词汇、韵律或其他信息，如说话人身份。特别是，与语言区别特征相关的声学线索可以从语音信号中提取并标记出来。这些声学线索可用于推断言语中预期的潜在音素序列。本研究描述了一种用于标记语音中声学线索的框架，包括一系列规范的线索预测算法，这些算法有助于手动标记，并为分析表面实现中的变化提供了标准。对标注语音数据的子集进行的简要检查表明，标记声学线索为详细分析语音中线索修改模式提供了可能。