• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

听觉换位思考。

Auditory perspective taking.

机构信息

U.S. Naval Research Laboratory, Washington, DC 20375, USA.

出版信息

IEEE Trans Cybern. 2013 Jun;43(3):957-69. doi: 10.1109/TSMCB.2012.2219524. Epub 2012 Oct 18.

DOI:10.1109/TSMCB.2012.2219524
PMID:23096077
Abstract

Effective communication with a mobile robot using speech is a difficult problem even when you can control the auditory scene. Robot self-noise or ego noise, echoes and reverberation, and human interference are all common sources of decreased intelligibility. Moreover, in real-world settings, these problems are routinely aggravated by a variety of sources of background noise. Military scenarios can be punctuated by high decibel noise from materiel and weaponry that would easily overwhelm a robot's normal speaking volume. Moreover, in nonmilitary settings, fans, computers, alarms, and transportation noise can cause enough interference to make a traditional speech interface unusable. This work presents and evaluates a prototype robotic interface that uses perspective taking to estimate the effectiveness of its own speech presentation and takes steps to improve intelligibility for human listeners.

摘要

即使可以控制听觉场景,使用语音与移动机器人进行有效通信也是一个难题。机器人自身噪声或本底噪声、回声和混响以及人为干扰都是可懂度降低的常见原因。此外,在现实环境中,这些问题通常会因各种背景噪声源而加剧。军事场景中,材料和武器会产生高分贝的噪声,很容易盖过机器人的正常说话音量。此外,在非军事环境中,风扇、计算机、警报器和交通噪声会造成足够的干扰,使传统的语音界面无法使用。本工作提出并评估了一种使用换位思考来估计自身语音表达有效性的机器人接口原型,并采取措施提高人类听众的可懂度。

相似文献

1
Auditory perspective taking.听觉换位思考。
IEEE Trans Cybern. 2013 Jun;43(3):957-69. doi: 10.1109/TSMCB.2012.2219524. Epub 2012 Oct 18.
2
Application of real-time loudness models can improve speech recognition for cochlear implant users.实时响度模型的应用可以提高人工耳蜗使用者的语音识别能力。
IEEE Trans Neural Syst Rehabil Eng. 2013 Jan;21(1):81-7. doi: 10.1109/TNSRE.2012.2213841. Epub 2012 Sep 3.
3
Multiexpert automatic speech recognition using acoustic and myoelectric signals.使用声学和肌电信号的多专家自动语音识别
IEEE Trans Biomed Eng. 2006 Apr;53(4):676-85. doi: 10.1109/TBME.2006.870224.
4
Human-inspired sound environment recognition system for assistive vehicles.用于辅助车辆的仿人声音环境识别系统。
J Neural Eng. 2015 Feb;12(1):016012. doi: 10.1088/1741-2560/12/1/016012. Epub 2015 Jan 14.
5
Joint attention by gaze interpolation and saliency.基于注视插值和显著度的共同注意。
IEEE Trans Cybern. 2013 Jun;43(3):829-42. doi: 10.1109/TSMCB.2012.2216979. Epub 2012 Oct 3.
6
Language bootstrapping: learning word meanings from perception-action association.语言自引导:从感知 - 行动关联中学习词义。
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):660-71. doi: 10.1109/TSMCB.2011.2172420. Epub 2011 Nov 16.
7
A generalized time-frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system.一种基于人类听觉系统小波滤波器组建模的广义时频减法鲁棒语音增强方法。
IEEE Trans Syst Man Cybern B Cybern. 2007 Aug;37(4):877-89. doi: 10.1109/tsmcb.2007.895365.
8
Discrimination of pathological voices using a time-frequency approach.使用时频方法鉴别病理性嗓音。
IEEE Trans Biomed Eng. 2005 Mar;52(3):421-30. doi: 10.1109/TBME.2004.842962.
9
Telephony-based voice pathology assessment using automated speech analysis.基于电话的语音病理学评估:使用自动语音分析
IEEE Trans Biomed Eng. 2006 Mar;53(3):468-77. doi: 10.1109/TBME.2005.869776.
10
Dysphonic speech reconstruction: construction of a novel system for effective and efficient communication.
IEEE Eng Med Biol Mag. 2010 Mar-Apr;29(2):135-8. doi: 10.1109/MEMB.2009.935725.

引用本文的文献

1
Active Volume Control in Smart Phones Based on User Activity and Ambient Noise.基于用户活动和环境噪声的智能手机主动音量控制。
Sensors (Basel). 2020 Jul 24;20(15):4117. doi: 10.3390/s20154117.