• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于新型音频特征的混合神经网络车辆类型识别

Hybrid neural network based on novel audio feature for vehicle type identification.

机构信息

School of Instrument and Electronics, North University of China, Taiyuan, 030051, China.

Key Laboratory of Instrumentation Science and Dynamic Measurement (North University of China), Ministry of Education, Taiyuan, 030051, China.

出版信息

Sci Rep. 2021 Apr 7;11(1):7648. doi: 10.1038/s41598-021-87399-1.

DOI:10.1038/s41598-021-87399-1
PMID:33828216
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8027866/
Abstract

Due to the audio information of different types of vehicle models are distinct, the vehicle information can be identified by the audio signal of vehicle accurately. In real life, in order to determine the type of vehicle, we do not need to obtain the visual information of vehicles and just need to obtain the audio information. In this paper, we extract and stitching different features from different aspects: Mel frequency cepstrum coefficients in perceptual characteristics, pitch class profile in psychoacoustic characteristics and short-term energy in acoustic characteristics. In addition, we improve the neural networks classifier by fusing the LSTM unit into the convolutional neural networks. At last, we put the novel feature to the hybrid neural networks to recognize different vehicles. The results suggest the novel feature we proposed in this paper can increase the recognition rate by 7%; destroying the training data randomly by superimposing different kinds of noise can improve the anti-noise ability in our identification system; and LSTM has great advantages in modeling time series, adding LSTM to the networks can improve the recognition rate of 3.39%.

摘要

由于不同类型车辆模型的音频信息具有明显差异,因此可以通过车辆的音频信号准确识别车辆信息。在现实生活中,为了确定车辆的类型,我们不需要获取车辆的视觉信息,只需要获取音频信息。在本文中,我们从不同方面提取和拼接不同的特征:感知特征中的梅尔频率倒谱系数、心理声学特征中的音高类轮廓和声学特征中的短时能量。此外,我们通过将 LSTM 单元融合到卷积神经网络中来改进神经网络分类器。最后,我们将新特征应用于混合神经网络以识别不同的车辆。结果表明,本文提出的新特征可以将识别率提高 7%;通过叠加不同类型的噪声随机破坏训练数据,可以提高识别系统的抗噪声能力;LSTM 在时间序列建模方面具有很大的优势,在网络中添加 LSTM 可以将识别率提高 3.39%。

相似文献

1
Hybrid neural network based on novel audio feature for vehicle type identification.基于新型音频特征的混合神经网络车辆类型识别
Sci Rep. 2021 Apr 7;11(1):7648. doi: 10.1038/s41598-021-87399-1.
2
An Incremental Class-Learning Approach with Acoustic Novelty Detection for Acoustic Event Recognition.基于声学新颖性检测的增量式类学习方法在声学事件识别中的应用。
Sensors (Basel). 2021 Oct 5;21(19):6622. doi: 10.3390/s21196622.
3
A Music Emotion Classification Model Based on the Improved Convolutional Neural Network.基于改进卷积神经网络的音乐情绪分类模型。
Comput Intell Neurosci. 2022 Feb 14;2022:6749622. doi: 10.1155/2022/6749622. eCollection 2022.
4
GIS Partial Discharge Pattern Recognition Based on a Novel Convolutional Neural Networks and Long Short-Term Memory.基于新型卷积神经网络和长短期记忆的GIS局部放电模式识别
Entropy (Basel). 2021 Jun 18;23(6):774. doi: 10.3390/e23060774.
5
Noise-robust acoustic signature recognition using nonlinear Hebbian learning.基于非线性海伯学习的抗噪声特征识别。
Neural Netw. 2010 Dec;23(10):1252-63. doi: 10.1016/j.neunet.2010.07.003. Epub 2010 Jul 23.
6
Dance emotion recognition based on linear predictive Meir frequency cepstrum coefficient and bidirectional long short-term memory from robot environment.基于线性预测梅尔频率倒谱系数和来自机器人环境的双向长短期记忆的舞蹈情感识别
Front Neurorobot. 2022 Nov 11;16:1067729. doi: 10.3389/fnbot.2022.1067729. eCollection 2022.
7
Research on Audio Recognition Based on the Deep Neural Network in Music Teaching.基于深度神经网络的音乐教学中的音频识别研究。
Comput Intell Neurosci. 2022 May 27;2022:7055624. doi: 10.1155/2022/7055624. eCollection 2022.
8
Underwater single-channel acoustic signal multitarget recognition using convolutional neural networks.基于卷积神经网络的水下单通道声信号多目标识别
J Acoust Soc Am. 2022 Mar;151(3):2245. doi: 10.1121/10.0009852.
9
An Investigation of Deep Learning Models for EEG-Based Emotion Recognition.基于脑电图的情绪识别深度学习模型研究
Front Neurosci. 2020 Dec 23;14:622759. doi: 10.3389/fnins.2020.622759. eCollection 2020.
10
Using long short term memory and convolutional neural networks for driver drowsiness detection.使用长短时记忆和卷积神经网络进行驾驶员瞌睡检测。
Accid Anal Prev. 2021 Jun;156:106107. doi: 10.1016/j.aap.2021.106107. Epub 2021 Apr 10.

引用本文的文献

1
Automated identification and assessment of environmental noise sources.环境噪声源的自动识别与评估
Heliyon. 2023 Jan 9;9(1):e12846. doi: 10.1016/j.heliyon.2023.e12846. eCollection 2023 Jan.
2
Improving Misfire Fault Diagnosis with Cascading Architectures via Acoustic Vehicle Characterization.通过声学风控车辆特征实现级联架构下的失火故障诊断改进。
Sensors (Basel). 2022 Oct 12;22(20):7736. doi: 10.3390/s22207736.
3
Research on imaging method of driver's attention area based on deep neural network.基于深度神经网络的驾驶员注意区域成像方法研究。

本文引用的文献

1
Supervised Learning of Gene Regulatory Networks.基因调控网络的监督学习
Curr Protoc Plant Biol. 2020 Jun;5(2):e20106. doi: 10.1002/cppb.20106.
2
Disease Classification in Eggplant Using Pre-trained VGG16 and MSVM.利用预训练的 VGG16 和 MSVM 对茄子进行疾病分类。
Sci Rep. 2020 Feb 11;10(1):2322. doi: 10.1038/s41598-020-59108-x.
3
Machine Learning Approach for Prescriptive Plant Breeding.机器学习在作物精准育种中的应用
Sci Rep. 2022 Sep 30;12(1):16427. doi: 10.1038/s41598-022-20829-w.
4
A lightweight YOLOv3 algorithm used for safety helmet detection.一种用于安全头盔检测的轻量级 YOLOv3 算法。
Sci Rep. 2022 Jun 29;12(1):10981. doi: 10.1038/s41598-022-15272-w.
5
Infusing Expert Knowledge Into a Deep Neural Network Using Attention Mechanism for Personalized Learning Environments.利用注意力机制将专家知识融入深度神经网络以构建个性化学习环境
Front Artif Intell. 2022 Jun 3;5:921476. doi: 10.3389/frai.2022.921476. eCollection 2022.
6
Fast environmental sound classification based on resource adaptive convolutional neural network.基于资源自适应卷积神经网络的快速环境声音分类
Sci Rep. 2022 Apr 22;12(1):6599. doi: 10.1038/s41598-022-10382-x.
Sci Rep. 2019 Nov 20;9(1):17132. doi: 10.1038/s41598-019-53451-4.
4
Relevant Word Order Vectorization for Improved Natural Language Processing in Electronic Health Records.相关词序向量化提高电子健康记录中的自然语言处理能力。
Sci Rep. 2019 Jun 25;9(1):9253. doi: 10.1038/s41598-019-45705-y.
5
Predication of different stages of Alzheimer's disease using neighborhood component analysis and ensemble decision tree.使用邻域成分分析和集成决策树预测阿尔茨海默病的不同阶段。
J Neurosci Methods. 2018 May 15;302:35-41. doi: 10.1016/j.jneumeth.2018.02.014. Epub 2018 Feb 24.
6
Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference.基于语音包络作为时间参考的脑启发式语音分割用于自动语音识别。
Sci Rep. 2016 Nov 23;6:37647. doi: 10.1038/srep37647.