• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估预训练卷积神经网络在嵌入式系统上进行音频分类的性能,以实现智能城市中的异常检测。

Evaluating the Performance of Pre-Trained Convolutional Neural Network for Audio Classification on Embedded Systems for Anomaly Detection in Smart Cities.

机构信息

Department of Engineering Sciences and Technology (INDI), Vrije Universiteit Brussel (VUB), 1050 Brussels, Belgium.

SIGL Laboratory, National School of Applied Sciences of Tetuan, Abdelmalek Essaadi University, Tetuan 93000, Morocco.

出版信息

Sensors (Basel). 2023 Jul 7;23(13):6227. doi: 10.3390/s23136227.

DOI:10.3390/s23136227
PMID:37448075
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10347208/
Abstract

Environmental Sound Recognition (ESR) plays a crucial role in smart cities by accurately categorizing audio using well-trained Machine Learning (ML) classifiers. This application is particularly valuable for cities that analyzed environmental sounds to gain insight and data. However, deploying deep learning (DL) models on resource-constrained embedded devices, such as Raspberry Pi (RPi) or Tensor Processing Units (TPUs), poses challenges. In this work, an evaluation of an existing pre-trained model for deployment on Raspberry Pi (RPi) and TPU platforms other than a laptop is proposed. We explored the impact of the retraining parameters and compared the sound classification performance across three datasets: ESC-10, BDLib, and Urban Sound. Our results demonstrate the effectiveness of the pre-trained model for transfer learning in embedded systems. On laptops, the accuracy rates reached 96.6% for ESC-10, 100% for BDLib, and 99% for Urban Sound. On RPi, the accuracy rates were 96.4% for ESC-10, 100% for BDLib, and 95.3% for Urban Sound, while on RPi with Coral TPU, the rates were 95.7% for ESC-10, 100% for BDLib and 95.4% for the Urban Sound. Utilizing pre-trained models reduces the computational requirements, enabling faster inference. Leveraging pre-trained models in embedded systems accelerates the development, deployment, and performance of various real-time applications.

摘要

环境声音识别 (ESR) 通过使用经过良好训练的机器学习 (ML) 分类器准确地对音频进行分类,在智慧城市中发挥着至关重要的作用。这种应用对于分析环境声音以获取洞察和数据的城市特别有价值。然而,在资源受限的嵌入式设备(如 Raspberry Pi (RPi) 或 Tensor Processing Units (TPU))上部署深度学习 (DL) 模型存在挑战。在这项工作中,提出了一种在 Raspberry Pi (RPi) 和 TPU 平台上而不是笔记本电脑上部署现有预训练模型的评估方法。我们探讨了重新训练参数的影响,并比较了三个数据集(ESC-10、BDLib 和 Urban Sound)的声音分类性能。我们的结果表明,该预训练模型在嵌入式系统中的迁移学习中是有效的。在笔记本电脑上,ESC-10 的准确率达到 96.6%,BDLib 的准确率达到 100%,Urban Sound 的准确率达到 99%。在 RPi 上,ESC-10 的准确率为 96.4%,BDLib 的准确率为 100%,Urban Sound 的准确率为 95.3%,而在带有 Coral TPU 的 RPi 上,ESC-10 的准确率为 95.7%,BDLib 的准确率为 100%,Urban Sound 的准确率为 95.4%。使用预训练模型可以降低计算要求,实现更快的推断。在嵌入式系统中利用预训练模型可以加速各种实时应用的开发、部署和性能。

相似文献

1
Evaluating the Performance of Pre-Trained Convolutional Neural Network for Audio Classification on Embedded Systems for Anomaly Detection in Smart Cities.评估预训练卷积神经网络在嵌入式系统上进行音频分类的性能,以实现智能城市中的异常检测。
Sensors (Basel). 2023 Jul 7;23(13):6227. doi: 10.3390/s23136227.
2
Sound Classification and Processing of Urban Environments: A Systematic Literature Review.城市环境中的声音分类与处理:系统文献综述。
Sensors (Basel). 2022 Nov 8;22(22):8608. doi: 10.3390/s22228608.
3
An Automatic Classification System for Environmental Sound in Smart Cities.智能城市中环境声音的自动分类系统
Sensors (Basel). 2023 Jul 31;23(15):6823. doi: 10.3390/s23156823.
4
Fast environmental sound classification based on resource adaptive convolutional neural network.基于资源自适应卷积神经网络的快速环境声音分类
Sci Rep. 2022 Apr 22;12(1):6599. doi: 10.1038/s41598-022-10382-x.
5
ESC-NAS: Environment Sound Classification Using Hardware-Aware Neural Architecture Search for the Edge.ESC-NAS:利用硬件感知神经架构搜索进行边缘环境声音分类
Sensors (Basel). 2024 Jun 9;24(12):3749. doi: 10.3390/s24123749.
6
Graph-Based Audio Classification Using Pre-Trained Models and Graph Neural Networks.使用预训练模型和图神经网络的基于图的音频分类
Sensors (Basel). 2024 Mar 26;24(7):2106. doi: 10.3390/s24072106.
7
BSN-ESC: A Big-Small Network-Based Environmental Sound Classification Method for AIoT Applications.BSN-ESC:一种用于物联网应用的基于大小网络的环境声音分类方法。
Sensors (Basel). 2023 Jul 28;23(15):6767. doi: 10.3390/s23156767.
8
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data.深度频谱精简版:一种用于从分散数据进行嵌入式语音和音频处理的高效节能迁移学习框架。
Front Artif Intell. 2022 Mar 17;5:856232. doi: 10.3389/frai.2022.856232. eCollection 2022.
9
Transfer of Learning from Vision to Touch: A Hybrid Deep Convolutional Neural Network for Visuo-Tactile 3D Object Recognition.从视觉到触觉的迁移学习:用于视触 3D 物体识别的混合深度卷积神经网络。
Sensors (Basel). 2020 Dec 27;21(1):113. doi: 10.3390/s21010113.
10
Transformers for Urban Sound Classification-A Comprehensive Performance Evaluation.用于城市声音分类的变压器-全面性能评估。
Sensors (Basel). 2022 Nov 16;22(22):8874. doi: 10.3390/s22228874.

引用本文的文献

1
A dataset for environmental sound recognition in embedded systems for autonomous vehicles.用于自动驾驶车辆嵌入式系统中环境声音识别的数据集。
Sci Data. 2025 Jul 5;12(1):1148. doi: 10.1038/s41597-025-05446-2.

本文引用的文献

1
Environmental sound classification using temporal-frequency attention based convolutional neural network.基于时频注意力的卷积神经网络的环境声音分类。
Sci Rep. 2021 Nov 3;11(1):21552. doi: 10.1038/s41598-021-01045-4.
2
A Digital Signal Processor Based Acoustic Sensor for Outdoor Noise Monitoring in Smart Cities.基于数字信号处理器的智能城市户外噪声监测声传感器。
Sensors (Basel). 2020 Jan 22;20(3):605. doi: 10.3390/s20030605.
3
Machine Learning in Agriculture: A Review.农业中的机器学习:综述。
Sensors (Basel). 2018 Aug 14;18(8):2674. doi: 10.3390/s18082674.