Suppr超能文献

为发音语音合成建模协同发音。

Modeling consonant-vowel coarticulation for articulatory speech synthesis.

机构信息

Department of Phoniatrics, Pedaudiology, and Communication Disorders, University Hospital Aachen and RWTH Aachen University, Aachen, Germany.

出版信息

PLoS One. 2013 Apr 16;8(4):e60603. doi: 10.1371/journal.pone.0060603. Print 2013.

Abstract

A central challenge for articulatory speech synthesis is the simulation of realistic articulatory movements, which is critical for the generation of highly natural and intelligible speech. This includes modeling coarticulation, i.e., the context-dependent variation of the articulatory and acoustic realization of phonemes, especially of consonants. Here we propose a method to simulate the context-sensitive articulation of consonants in consonant-vowel syllables. To achieve this, the vocal tract target shape of a consonant in the context of a given vowel is derived as the weighted average of three measured and acoustically-optimized reference vocal tract shapes for that consonant in the context of the corner vowels /a/, /i/, and /u/. The weights are determined by mapping the target shape of the given context vowel into the vowel subspace spanned by the corner vowels. The model was applied for the synthesis of consonant-vowel syllables with the consonants /b/, /d/, /g/, /l/, /r/, /m/, /n/ in all combinations with the eight long German vowels. In a perception test, the mean recognition rate for the consonants in the isolated syllables was 82.4%. This demonstrates the potential of the approach for highly intelligible articulatory speech synthesis.

摘要

对于发音语音合成来说,一个核心挑战是对真实发音动作的模拟,这对于生成高度自然和可理解的语音至关重要。这包括对协同发音的建模,即音位(尤其是辅音)的发音和声学表现的上下文相关变化。在这里,我们提出了一种方法来模拟辅音-元音音节中辅音的上下文敏感发音。为了实现这一点,给定元音环境下的辅音的声道目标形状被推导为该辅音在角元音 /a/、/i/ 和 /u/ 环境下三个经过测量和声学优化的参考声道形状的加权平均值。权重通过将给定上下文元音的目标形状映射到由角元音张成的元音子空间来确定。该模型应用于合成辅音-元音音节,其中辅音 /b/、/d/、/g/、/l/、/r/、/m/、/n/ 与八个长德语元音中的每一个进行组合。在感知测试中,孤立音节中辅音的平均识别率为 82.4%。这证明了该方法在高度可理解的发音语音合成方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/103f/3628899/e8d1db0fc2c9/pone.0060603.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验