延长说话时间可以恢复同时说话的双语机器人的理解能力。

Expansion in speech time can restore comprehension in a simultaneously speaking bilingual robot.

作者信息

Pourfannan Hamed, Mahzoon Hamed, Yoshikawa Yuichiro, Ishiguro Hiroshi

机构信息

Intelligent Robotics Laboratory (Hiroshi Ishiguro's Laboratory), Department of Systems Innovation, Graduate School of Engineering Science, Osaka University, Osaka, Japan.

Institute for Open and Transdisciplinary Research Initiatives (OTRI), Osaka University, Osaka, Japan.

出版信息

Front Robot AI. 2023 Mar 1;9:1032811. doi: 10.3389/frobt.2022.1032811. eCollection 2022.

DOI:10.3389/frobt.2022.1032811

PMID:36935651

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10014467/

Abstract

In this study, the development of a social robot, capable of giving speech simultaneously in more than one language was in mind. However, the negative effect of background noise on speech comprehension is well-documented in previous works. This deteriorating effect is more highlighted when the background noise has speech-like properties. Hence, the presence of speech as the background noise in a simultaneously speaking bilingual robot can be fatal for the speech comprehension of each person listening to the robot. To improve speech comprehension and consequently, user experience in the intended bilingual robot, the effect of time expansion on speech comprehension in a multi-talker speech scenario was investigated. Sentence recognition, speech comprehension, and subjective evaluation tasks were implemented in the study. The obtained results suggest that a reduced speech rate, leading to an expansion in the speech time, in addition to increased pause duration in both the target and background speeches can lead to statistically significant improvement in both sentence recognition, and speech comprehension of participants. More interestingly, participants got a higher score in the time-expanded multi-talker speech than in the standard-speed single-talker speech in the speech comprehension and, in the sentence recognition task. However, this positive effect could not be attributed merely to the time expansion, as we could not repeat the same positive effect in a time-expanded single-talker speech. The results obtained in this study suggest a facilitating effect of the presence of the background speech in a simultaneously speaking bilingual robot provided that both languages are presented in a time-expanded manner. The implications of such a simultaneously speaking robot are discussed.

摘要

在本研究中，我们设想开发一种能够同时用多种语言进行语音输出的社交机器人。然而，背景噪声对语音理解的负面影响在以往的研究中已有充分记录。当背景噪声具有类似语音的特性时，这种恶化效应会更加明显。因此，在一个同时说两种语言的机器人中，存在类似语音的背景噪声可能会对每个听机器人讲话的人的语音理解造成致命影响。为了提高预期的双语机器人的语音理解能力，并进而提升用户体验，我们研究了时间扩展对多说话者语音场景中语音理解的影响。本研究实施了句子识别、语音理解和主观评价任务。所得结果表明，降低语速（导致语音时间延长），以及增加目标语音和背景语音中的停顿持续时间，均可在统计上显著提高参与者的句子识别能力和语音理解能力。更有趣的是，在语音理解和句子识别任务中，参与者在时间扩展的多说话者语音中获得的分数高于标准语速的单说话者语音。然而，这种积极效果不能仅仅归因于时间扩展，因为我们在时间扩展的单说话者语音中无法重复同样的积极效果。本研究所得结果表明，在一个同时说两种语言的机器人中，只要两种语言都以时间扩展的方式呈现，背景语音的存在会产生促进作用。我们还讨论了这种同时说话机器人的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9471/10014467/401320d6a58d/frobt-09-1032811-g001.jpg

相似文献

Expansion in speech time can restore comprehension in a simultaneously speaking bilingual robot.延长说话时间可以恢复同时说话的双语机器人的理解能力。

Front Robot AI. 2023 Mar 1;9:1032811. doi: 10.3389/frobt.2022.1032811. eCollection 2022.

Talker- and language-specific effects on speech intelligibility in noise assessed with bilingual talkers: Which language is more robust against noise and reverberation?使用双语者评估噪声环境下特定说话者和特定语言对言语可懂度的影响：哪种语言对噪声和混响更具抗性？

Int J Audiol. 2015;54 Suppl 2:23-34. doi: 10.3109/14992027.2015.1088174. Epub 2015 Oct 21.

Towards a simultaneously speaking bilingual robot: Primary study on the effect of gender and pitch of the robot's voice.迈向同时会说双语的机器人：关于机器人声音性别和音高效果的初步研究。

PLoS One. 2022 Dec 28;17(12):e0278852. doi: 10.1371/journal.pone.0278852. eCollection 2022.

Listening Effort by Native and Nonnative Listeners Due to Noise, Reverberation, and Talker Foreign Accent During English Speech Perception.母语和非母语听者在英语语音感知中因噪声、混响和说话者外国口音而产生的听力努力。

J Speech Lang Hear Res. 2019 Apr 15;62(4):1068-1081. doi: 10.1044/2018_JSLHR-H-17-0423.

Effects of Language History on Sentence Recognition in Noise or Two-Talker Speech: Monolingual, Early Bilingual, and Late Bilingual Speakers of English.语言历史对噪声环境或双说话者语音中句子识别的影响：单语、早期双语和晚期双语英语使用者

Am J Audiol. 2019 Dec 16;28(4):935-946. doi: 10.1044/2019_AJA-18-0194. Epub 2019 Nov 7.

Working-memory disruption by task-irrelevant talkers depends on degree of talker familiarity.与任务无关的交谈者对工作记忆的干扰取决于交谈者的熟悉程度。

Atten Percept Psychophys. 2019 May;81(4):1108-1118. doi: 10.3758/s13414-019-01727-2.

Bilingualism and Speech Understanding in Noise: Auditory and Linguistic Factors.双语与噪声环境下的言语理解：听觉与语言因素

J Am Acad Audiol. 2019 Feb;30(2):115-130. doi: 10.3766/jaaa.17082. Epub 2018 Jan 10.

Revisiting the talker recognition advantage in bilingual infants.重新审视双语婴儿在说话人识别方面的优势。

J Exp Child Psychol. 2022 Feb;214:105276. doi: 10.1016/j.jecp.2021.105276. Epub 2021 Sep 8.

Rerouting Hearing Aid Systems for Overcoming Simulated Unilateral Hearing in Dynamic Listening Situations.用于在动态聆听情境中克服模拟单侧听力的重新路由助听器系统。

Ear Hear. 2020 Jul/Aug;41(4):790-803. doi: 10.1097/AUD.0000000000000800.

Implicit Talker Training Improves Comprehension of Auditory Speech in Noise.隐性说话者训练可提高噪声环境下听觉言语的理解能力。

Front Psychol. 2017 Sep 14;8:1584. doi: 10.3389/fpsyg.2017.01584. eCollection 2017.

本文引用的文献

PLoS One. 2022 Dec 28;17(12):e0278852. doi: 10.1371/journal.pone.0278852. eCollection 2022.

How Pause Duration Influences Impressions of English Speech: Comparison Between Native and Non-native Speakers.停顿时长如何影响对英语演讲的印象：以母语者和非母语者为例的比较

Front Psychol. 2022 Feb 11;13:778018. doi: 10.3389/fpsyg.2022.778018. eCollection 2022.

Linguistic processing of task-irrelevant speech at a cocktail party.鸡尾酒会上与任务无关的言语的语言处理。

Elife. 2021 May 4;10:e65096. doi: 10.7554/eLife.65096.

Distinct sensitivity to spectrotemporal modulation supports brain asymmetry for speech and melody.不同的频谱和时变调制敏感性支持大脑对言语和旋律的不对称性。

Science. 2020 Feb 28;367(6481):1043-1047. doi: 10.1126/science.aaz3468.

The Effect of Noise Exposure on Cognitive Performance and Brain Activity Patterns.噪声暴露对认知表现和脑活动模式的影响。

Open Access Maced J Med Sci. 2019 Aug 30;7(17):2924-2931. doi: 10.3889/oamjms.2019.742. eCollection 2019 Sep 15.

Immediate Passage Comprehension and Encoding of Information Into Long-Term Memory in Children With Normal Hearing: The Effect of Voice Quality and Multitalker Babble Noise.听力正常儿童对信息的即时段落理解及编码至长期记忆：嗓音质量和多说话者嘈杂噪音的影响

Am J Audiol. 2018 Jun 8;27(2):231-237. doi: 10.1044/2018_AJA-17-0061.

Listening Effort: How the Cognitive Consequences of Acoustic Challenge Are Reflected in Brain and Behavior.聆听努力：听觉挑战的认知后果如何在大脑和行为中反映出来。

Ear Hear. 2018 Mar/Apr;39(2):204-214. doi: 10.1097/AUD.0000000000000494.

Relatively effortless listening promotes understanding and recall of medical instructions in older adults.相对轻松的听力有助于老年人理解和记住医学指示。

Front Psychol. 2015 Jun 9;6:778. doi: 10.3389/fpsyg.2015.00778. eCollection 2015.

The slower the better? Does the speaker's speech rate influence children's performance on a language comprehension test?

Int J Speech Lang Pathol. 2014 Apr;16(2):181-90. doi: 10.3109/17549507.2013.845690. Epub 2013 Oct 25.

Effects of irrelevant speech and traffic noise on speech perception and cognitive performance in elementary school children.无关言语和交通噪音对小学生言语感知及认知表现的影响。

Noise Health. 2007 Jul-Sep;9(36):64-74. doi: 10.4103/1463-1741.36982.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

延长说话时间可以恢复同时说话的双语机器人的理解能力。

Expansion in speech time can restore comprehension in a simultaneously speaking bilingual robot.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献