• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于资源自适应卷积神经网络的快速环境声音分类

Fast environmental sound classification based on resource adaptive convolutional neural network.

作者信息

Fang Zheng, Yin Bo, Du Zehua, Huang Xianqing

机构信息

College of Information Science and Engineering, Ocean University of China, Qingdao, China.

Pilot National Laboratory for Marine Science and Technology, Qingdao, China.

出版信息

Sci Rep. 2022 Apr 22;12(1):6599. doi: 10.1038/s41598-022-10382-x.

DOI:10.1038/s41598-022-10382-x
PMID:35459273
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9033781/
Abstract

Recently, with the construction of smart city, the research on environmental sound classification (ESC) has attracted the attention of academia and industry. The development of convolutional neural network (CNN) makes the accuracy of ESC reach a higher level, but the accuracy improvement brought by CNN is often accompanied by the deepening of network layers, which leads to the rapid growth of parameters and floating-point operations (FLOPs). Therefore, it is difficult to transplant CNN model to embedded devices, and the classification speed is also difficult to accept. In order to reduce the hardware requirements of running CNN and improve the speed of ESC, this paper proposes a resource adaptive convolutional neural network (RACNN). RACNN uses a novel resource adaptive convolutional (RAC) module, which can generate the same number of feature maps as conventional convolution operations more cheaply, and extract the time and frequency features of audio efficiently. The RAC block based on the RAC module is designed to build the lightweight RACNN model, and the RAC module can also be used to upgrade the existing CNN model. Experiments based on public datasets show that RACNN achieves higher performance than the state-of-the-art methods with lower computational complexity.

摘要

近年来,随着智慧城市的建设,环境声音分类(ESC)研究受到学术界和工业界的关注。卷积神经网络(CNN)的发展使ESC的准确率达到了更高水平,但CNN带来的准确率提升往往伴随着网络层数的加深,这导致参数和浮点运算(FLOP)快速增长。因此,难以将CNN模型移植到嵌入式设备上,分类速度也难以令人接受。为了降低运行CNN的硬件要求并提高ESC的速度,本文提出了一种资源自适应卷积神经网络(RACNN)。RACNN使用了一种新颖的资源自适应卷积(RAC)模块,该模块能够以更低的成本生成与传统卷积操作相同数量的特征图,并有效地提取音频的时间和频率特征。基于RAC模块设计的RAC块用于构建轻量级的RACNN模型,RAC模块还可用于升级现有的CNN模型。基于公共数据集的实验表明,RACNN在较低的计算复杂度下比现有方法具有更高的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/82623e220b16/41598_2022_10382_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/8dd282b8b39a/41598_2022_10382_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/05a11855fc6f/41598_2022_10382_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/711cc5a9dec2/41598_2022_10382_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/de3a726af22a/41598_2022_10382_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/92b33a0be0d1/41598_2022_10382_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/d09639cce106/41598_2022_10382_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/bb0063c62d53/41598_2022_10382_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/adcc11552939/41598_2022_10382_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/e987ad585c86/41598_2022_10382_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/6c927b93f191/41598_2022_10382_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/2273ce3daa6a/41598_2022_10382_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/b21d75f3465d/41598_2022_10382_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/e2aab5c25a05/41598_2022_10382_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/e12cc6af8b5c/41598_2022_10382_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/82623e220b16/41598_2022_10382_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/8dd282b8b39a/41598_2022_10382_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/05a11855fc6f/41598_2022_10382_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/711cc5a9dec2/41598_2022_10382_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/de3a726af22a/41598_2022_10382_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/92b33a0be0d1/41598_2022_10382_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/d09639cce106/41598_2022_10382_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/bb0063c62d53/41598_2022_10382_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/adcc11552939/41598_2022_10382_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/e987ad585c86/41598_2022_10382_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/6c927b93f191/41598_2022_10382_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/2273ce3daa6a/41598_2022_10382_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/b21d75f3465d/41598_2022_10382_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/e2aab5c25a05/41598_2022_10382_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/e12cc6af8b5c/41598_2022_10382_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1248/9033781/82623e220b16/41598_2022_10382_Fig15_HTML.jpg

相似文献

1
Fast environmental sound classification based on resource adaptive convolutional neural network.基于资源自适应卷积神经网络的快速环境声音分类
Sci Rep. 2022 Apr 22;12(1):6599. doi: 10.1038/s41598-022-10382-x.
2
DM-CNN: Dynamic Multi-scale Convolutional Neural Network with uncertainty quantification for medical image classification.DM-CNN:具有不确定性量化的动态多尺度卷积神经网络,用于医学图像分类。
Comput Biol Med. 2024 Jan;168:107758. doi: 10.1016/j.compbiomed.2023.107758. Epub 2023 Nov 29.
3
ESC-NAS: Environment Sound Classification Using Hardware-Aware Neural Architecture Search for the Edge.ESC-NAS:利用硬件感知神经架构搜索进行边缘环境声音分类
Sensors (Basel). 2024 Jun 9;24(12):3749. doi: 10.3390/s24123749.
4
CEModule: A Computation Efficient Module for Lightweight Convolutional Neural Networks.CEModule:一种用于轻量级卷积神经网络的计算高效模块。
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6069-6080. doi: 10.1109/TNNLS.2021.3133127. Epub 2023 Sep 1.
5
An Image Classification Method Based on Adaptive Attention Mechanism and Feature Extraction Network.基于自适应注意力机制和特征提取网络的图像分类方法。
Comput Intell Neurosci. 2023 Feb 17;2023:4305594. doi: 10.1155/2023/4305594. eCollection 2023.
6
fMRI volume classification using a 3D convolutional neural network robust to shifted and scaled neuronal activations.使用对移位和缩放神经元激活具有鲁棒性的 3D 卷积神经网络进行 fMRI 体积分类。
Neuroimage. 2020 Dec;223:117328. doi: 10.1016/j.neuroimage.2020.117328. Epub 2020 Sep 5.
7
Lightweight deep convolutional neural network for background sound classification in speech signals.用于语音信号中背景声音分类的轻量级深度卷积神经网络。
J Acoust Soc Am. 2022 Apr;151(4):2773. doi: 10.1121/10.0010257.
8
BSN-ESC: A Big-Small Network-Based Environmental Sound Classification Method for AIoT Applications.BSN-ESC:一种用于物联网应用的基于大小网络的环境声音分类方法。
Sensors (Basel). 2023 Jul 28;23(15):6767. doi: 10.3390/s23156767.
9
A Hyperspectral Image Classification Method Based on the Nonlocal Attention Mechanism of a Multiscale Convolutional Neural Network.基于多尺度卷积神经网络的非局部注意力机制的高光谱图像分类方法。
Sensors (Basel). 2023 Mar 16;23(6):3190. doi: 10.3390/s23063190.
10
Attention-aided lightweight networks friendly to smart weeding robot hardware resources for crops and weeds semantic segmentation.面向作物与杂草语义分割的、对智能除草机器人硬件资源友好的注意力辅助轻量级网络。
Front Plant Sci. 2023 Dec 21;14:1320448. doi: 10.3389/fpls.2023.1320448. eCollection 2023.

引用本文的文献

1
Automated detection of wolf howls using audio spectrogram transformers.使用音频频谱图变换器自动检测狼嚎声。
Sci Rep. 2025 Jul 22;15(1):26641. doi: 10.1038/s41598-025-11413-z.
2
A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification.深度学习中用于森林声音分类的预处理和模型压缩技术的比较研究
Sensors (Basel). 2024 Feb 9;24(4):1149. doi: 10.3390/s24041149.
3
Artificial Intelligence for Automatic Monitoring of Respiratory Health Conditions in Smart Swine Farming.智能养猪场中用于自动监测呼吸健康状况的人工智能

本文引用的文献

1
Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture.基于开源深度学习架构的原始声音波形的鸟类叫声的生物声学分类。
Sci Rep. 2021 Aug 3;11(1):15733. doi: 10.1038/s41598-021-95076-6.
2
Hybrid neural network based on novel audio feature for vehicle type identification.基于新型音频特征的混合神经网络车辆类型识别
Sci Rep. 2021 Apr 7;11(1):7648. doi: 10.1038/s41598-021-87399-1.
3
Squeeze-and-Excitation Networks.挤压激励网络。
Animals (Basel). 2023 Jun 2;13(11):1860. doi: 10.3390/ani13111860.
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2011-2023. doi: 10.1109/TPAMI.2019.2913372. Epub 2019 Apr 29.
4
Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion.基于决策级融合的双流卷积神经网络环境声音分类
Sensors (Basel). 2019 Apr 11;19(7):1733. doi: 10.3390/s19071733.