Suppr超能文献

延长说话时间可以恢复同时说话的双语机器人的理解能力。

Expansion in speech time can restore comprehension in a simultaneously speaking bilingual robot.

作者信息

Pourfannan Hamed, Mahzoon Hamed, Yoshikawa Yuichiro, Ishiguro Hiroshi

机构信息

Intelligent Robotics Laboratory (Hiroshi Ishiguro's Laboratory), Department of Systems Innovation, Graduate School of Engineering Science, Osaka University, Osaka, Japan.

Institute for Open and Transdisciplinary Research Initiatives (OTRI), Osaka University, Osaka, Japan.

出版信息

Front Robot AI. 2023 Mar 1;9:1032811. doi: 10.3389/frobt.2022.1032811. eCollection 2022.

Abstract

In this study, the development of a social robot, capable of giving speech simultaneously in more than one language was in mind. However, the negative effect of background noise on speech comprehension is well-documented in previous works. This deteriorating effect is more highlighted when the background noise has speech-like properties. Hence, the presence of speech as the background noise in a simultaneously speaking bilingual robot can be fatal for the speech comprehension of each person listening to the robot. To improve speech comprehension and consequently, user experience in the intended bilingual robot, the effect of time expansion on speech comprehension in a multi-talker speech scenario was investigated. Sentence recognition, speech comprehension, and subjective evaluation tasks were implemented in the study. The obtained results suggest that a reduced speech rate, leading to an expansion in the speech time, in addition to increased pause duration in both the target and background speeches can lead to statistically significant improvement in both sentence recognition, and speech comprehension of participants. More interestingly, participants got a higher score in the time-expanded multi-talker speech than in the standard-speed single-talker speech in the speech comprehension and, in the sentence recognition task. However, this positive effect could not be attributed merely to the time expansion, as we could not repeat the same positive effect in a time-expanded single-talker speech. The results obtained in this study suggest a facilitating effect of the presence of the background speech in a simultaneously speaking bilingual robot provided that both languages are presented in a time-expanded manner. The implications of such a simultaneously speaking robot are discussed.

摘要

在本研究中,我们设想开发一种能够同时用多种语言进行语音输出的社交机器人。然而,背景噪声对语音理解的负面影响在以往的研究中已有充分记录。当背景噪声具有类似语音的特性时,这种恶化效应会更加明显。因此,在一个同时说两种语言的机器人中,存在类似语音的背景噪声可能会对每个听机器人讲话的人的语音理解造成致命影响。为了提高预期的双语机器人的语音理解能力,并进而提升用户体验,我们研究了时间扩展对多说话者语音场景中语音理解的影响。本研究实施了句子识别、语音理解和主观评价任务。所得结果表明,降低语速(导致语音时间延长),以及增加目标语音和背景语音中的停顿持续时间,均可在统计上显著提高参与者的句子识别能力和语音理解能力。更有趣的是,在语音理解和句子识别任务中,参与者在时间扩展的多说话者语音中获得的分数高于标准语速的单说话者语音。然而,这种积极效果不能仅仅归因于时间扩展,因为我们在时间扩展的单说话者语音中无法重复同样的积极效果。本研究所得结果表明,在一个同时说两种语言的机器人中,只要两种语言都以时间扩展的方式呈现,背景语音的存在会产生促进作用。我们还讨论了这种同时说话机器人的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9471/10014467/401320d6a58d/frobt-09-1032811-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验