• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积递归神经网络的用于听力研究的双麦克风到达方向估计方法

CONVOLUTIONAL RECURRENT NEURAL NETWORK BASED DIRECTION OF ARRIVAL ESTIMATION METHOD USING TWO MICROPHONES FOR HEARING STUDIES.

作者信息

Küçük Abdullah, Panahi Issa M S

机构信息

The University of Texas at Dallas Department of Electrical and Computer Engineering, 800 West Campbell Richardson, TX 75080, USA.

出版信息

IEEE Int Workshop Mach Learn Signal Process. 2020 Sep;2020. doi: 10.1109/mlsp49062.2020.9231693. Epub 2020 Oct 20.

DOI:10.1109/mlsp49062.2020.9231693
PMID:33972890
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8106976/
Abstract

This work proposes a convolutional recurrent neural network (CRNN) based direction of arrival (DOA) angle estimation method, implemented on the Android smartphone for hearing aid applications. The proposed app provides a 'visual' indication of the direction of a talker on the screen of Android smartphones for improving the hearing of people with hearing disorders. We use real and imaginary parts of short-time Fourier transform (STFT) as a feature set for the proposed CRNN architecture for DOA angle estimation. Real smartphone recordings are utilized for assessing performance of the proposed method. The accuracy of the proposed method reaches 87.33% for unseen (untrained) environments. This work also presents real-time inference of the proposed method, which is done on an Android smartphone using only its two built-in microphones and no additional component or external hardware. The real-time implementation also proves the generalization and robustness of the proposed CRNN based model.

摘要

这项工作提出了一种基于卷积递归神经网络(CRNN)的到达方向(DOA)角度估计方法,该方法在安卓智能手机上实现,用于助听器应用。所提出的应用程序在安卓智能手机屏幕上提供说话者方向的“可视化”指示,以改善听力障碍者的听力。我们使用短时傅里叶变换(STFT)的实部和虚部作为所提出的用于DOA角度估计的CRNN架构的特征集。利用真实的智能手机录音来评估所提出方法的性能。在未见过(未训练)的环境中,所提出方法的准确率达到87.33%。这项工作还展示了所提出方法的实时推理,这仅使用安卓智能手机的两个内置麦克风,无需额外组件或外部硬件即可在安卓智能手机上完成。实时实现也证明了所提出的基于CRNN的模型的通用性和鲁棒性。

相似文献

1
CONVOLUTIONAL RECURRENT NEURAL NETWORK BASED DIRECTION OF ARRIVAL ESTIMATION METHOD USING TWO MICROPHONES FOR HEARING STUDIES.基于卷积递归神经网络的用于听力研究的双麦克风到达方向估计方法
IEEE Int Workshop Mach Learn Signal Process. 2020 Sep;2020. doi: 10.1109/mlsp49062.2020.9231693. Epub 2020 Oct 20.
2
Real-time Convolutional Neural Network based Speech Source Localization on Smartphone.基于实时卷积神经网络的智能手机语音源定位
IEEE Access. 2019;7:169969-169978. doi: 10.1109/access.2019.2955049. Epub 2019 Nov 22.
3
Real-Time Estimation of Direction of Arrival of Speech Source Using Three Microphones.使用三个麦克风实时估计语音源的到达方向
IEEE Workshop Signal Process Syst. 2020 Oct;2020. doi: 10.1109/sips50750.2020.9195217. Epub 2020 Sep 23.
4
Direction of arrival estimation using deep neural network for hearing aid applications using smartphone.使用深度神经网络进行到达方向估计以用于智能手机助听器应用。
Proc Meet Acoust. 2019 Dec 2;39(1). doi: 10.1121/2.0001256. Epub 2020 Jun 22.
5
Real-time Smartphone application for improving spatial awareness of Hearing Assistive Devices.用于提高听力辅助设备空间感知能力的实时智能手机应用程序。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:433-436. doi: 10.1109/EMBC.2018.8512318.
6
Spectral Flux-Based Convolutional Neural Network Architecture for Speech Source Localization and Its Real-Time Implementation.基于谱通量的卷积神经网络架构用于语音源定位及其实时实现
IEEE Access. 2020;8:197047-197058. doi: 10.1109/access.2020.3033533. Epub 2020 Oct 26.
7
Efficient two-microphone speech enhancement using basic recurrent neural network cell for hearing and hearing aids.使用基本递归神经网络单元实现用于听力和助听器的高效双麦克风语音增强
J Acoust Soc Am. 2020 Jul;148(1):389. doi: 10.1121/10.0001600.
8
Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition.使用随机奇异值分解的稳健三麦克风语音源定位
IEEE Access. 2021;9:157800-157811. doi: 10.1109/access.2021.3130180. Epub 2021 Nov 23.
9
Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones.基于卷积递归神经网络的多麦克风隧道事件检测
Sensors (Basel). 2019 Jun 14;19(12):2695. doi: 10.3390/s19122695.
10
Non-Uniform Microphone Arrays for Robust Speech Source Localization for Smartphone-Assisted Hearing Aid Devices.用于智能手机辅助助听器设备的稳健语音源定位的非均匀麦克风阵列
J Signal Process Syst. 2018 Oct;90(10):1415-1435. doi: 10.1007/s11265-017-1297-8. Epub 2017 Nov 9.

引用本文的文献

1
Centroid Optimization of DNN Classification in DOA Estimation for UAV.基于无人机 DOA 估计的 DNN 分类的质心优化。
Sensors (Basel). 2023 Feb 24;23(5):2513. doi: 10.3390/s23052513.
2
Smartphone-based single-channel speech enhancement application for hearing aids.基于智能手机的单通道语音增强助听器应用。
J Acoust Soc Am. 2021 Sep;150(3):1663. doi: 10.1121/10.0006045.

本文引用的文献

1
Real-time Convolutional Neural Network based Speech Source Localization on Smartphone.基于实时卷积神经网络的智能手机语音源定位
IEEE Access. 2019;7:169969-169978. doi: 10.1109/access.2019.2955049. Epub 2019 Nov 22.
2
A Real-Time Convolutional Neural Network Based Speech Enhancement for Hearing Impaired Listeners Using Smartphone.一种基于实时卷积神经网络的、使用智能手机的听力受损者语音增强方法。
IEEE Access. 2019;7:78421-78433. doi: 10.1109/access.2019.2922370. Epub 2019 Jun 12.
3
Real-time Smartphone application for improving spatial awareness of Hearing Assistive Devices.用于提高听力辅助设备空间感知能力的实时智能手机应用程序。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:433-436. doi: 10.1109/EMBC.2018.8512318.
4
Influence of MVDR beamformer on a Speech Enhancement based Smartphone application for Hearing Aids.最小方差无失真响应(MVDR)波束形成器对基于语音增强的助听器智能手机应用的影响。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:417-420. doi: 10.1109/EMBC.2018.8512369.
5
A multichannel speech enhancement method for functional MRI systems using a distributed microphone array.一种用于功能磁共振成像系统的、使用分布式麦克风阵列的多通道语音增强方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:6946-9. doi: 10.1109/IEMBS.2009.5333748.
6
Effects on sound localization of configuration and type of hearing impairment.听力损失的构型和类型对声音定位的影响。
J Acoust Soc Am. 1994 Feb;95(2):992-1005. doi: 10.1121/1.408404.
7
Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise.用于测量安静和噪声环境下言语接受阈值的噪声中听力测试的开发。
J Acoust Soc Am. 1994 Feb;95(2):1085-99. doi: 10.1121/1.408469.