Suppr超能文献

清晰语音对语音合成语音的可懂度增益:说话风格和视觉伪装的影响。

The clear speech intelligibility benefit for text-to-speech voices: Effects of speaking style and visual guise.

机构信息

Department of Linguistics, University of California, Davis, 469 Kerr Hall, One Shields Avenue, Davis, California 95616, USA

出版信息

JASA Express Lett. 2022 Apr;2(4):045204. doi: 10.1121/10.0010274.

Abstract

This study examined how speaking style and guise influence the intelligibility of text-to-speech (TTS) and naturally produced human voices. Results showed that TTS voices were less intelligible overall. Although using a clear speech style improved intelligibility for both human and TTS voices (using "newscaster" neural TTS), the clear speech effect was stronger for TTS voices. Finally, a visual device guise decreased intelligibility, regardless of voice type. The results suggest that both speaking style and visual guise affect intelligibility of human and TTS voices. Findings are discussed in terms of theories about the role of social information in speech perception.

摘要

本研究考察了说话风格和伪装对文本转语音(TTS)和自然产生的人类声音的可理解性的影响。结果表明,TTS 声音的整体可理解性较低。虽然使用清晰的说话风格可以提高人类和 TTS 声音的可理解性(使用“新闻主播”神经 TTS),但清晰说话风格对 TTS 声音的影响更强。最后,无论声音类型如何,视觉设备伪装都会降低可理解性。这些结果表明,说话风格和视觉伪装都会影响人类和 TTS 声音的可理解性。研究结果根据关于社会信息在言语感知中的作用的理论进行了讨论。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验