Suppr超能文献

人机错误解决过程中全局和局部过度清晰发音的建模。

Modeling global and focal hyperarticulation during human-computer error resolution.

作者信息

Oviatt S, Levow G A, Moreton E, MacEachern M

机构信息

Department of Computer Science, Oregon Graduate Institute of Science and Technology, Portland 97291, USA.

出版信息

J Acoust Soc Am. 1998 Nov;104(5):3080-98. doi: 10.1121/1.423888.

Abstract

When resolving errors with interactive systems, people sometimes hyperarticulate--or adopt a clarified style of speech that has been associated with increased recognition errors. The primary goals of the present study were: (1) to provide a comprehensive analysis of acoustic, prosodic, and phonological adaptations to speech during human-computer error resolution after different types of recognition error; and (2) to examine changes in speech during both global and focal utterance repairs. A semi-automatic simulation method with a novel error-generation capability was used to compare speech immediately before and after system recognition errors. Matched original-repeat utterance pairs then were analyzed for type and magnitude of linguistic adaption during global and focal repairs. Results indicated that the primary hyperarticulate changes in speech following all error types were durational, with increases in number and length of pauses most noteworthy. Speech also was adapted toward a more deliberate and hyperclear articulatory style. During focal error repairs, large durational effects functioned together with pitch and amplitude to provide selective prominence marking of the repair region. These results corroborate and generalize the computer-elicited hyperarticulate adaptation model (CHAM). Implications are discussed for improved error handling in next-generation spoken language and multimodal systems.

摘要

在解决交互式系统的错误时,人们有时会过度清晰表达——或者采用一种与识别错误增加相关的清晰的言语风格。本研究的主要目标是:(1)全面分析在不同类型的识别错误后,人机错误解决过程中语音的声学、韵律和音系学调整;(2)研究全局和局部话语修复过程中语音的变化。一种具有新颖错误生成能力的半自动模拟方法被用于比较系统识别错误前后的语音。然后,对匹配的原始-重复话语对进行分析,以确定全局和局部修复过程中语言调整的类型和程度。结果表明,所有错误类型后语音的主要过度清晰表达变化是时长方面的,停顿的数量和长度增加最为显著。语音也朝着更刻意、更清晰的发音风格调整。在局部错误修复过程中,大的时长效应与音高和振幅共同作用,为修复区域提供选择性的突出标记。这些结果证实并推广了计算机引发的过度清晰表达适应模型(CHAM)。讨论了对下一代口语和多模态系统中改进错误处理的启示。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验