• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

枪击和类枪击音频事件的数据收集、建模和分类:案例研究。

Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study.

机构信息

EECS Department, Florida Atlantic University, Boca Raton, FL 33431, USA.

ECE Department, Georgia Tech, Atlanta, GA 30332, USA.

出版信息

Sensors (Basel). 2021 Nov 3;21(21):7320. doi: 10.3390/s21217320.

DOI:10.3390/s21217320
PMID:34770635
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8587567/
Abstract

Distinguishing between a dangerous audio event like a gun firing and other non-life-threatening events, such as a plastic bag bursting, can mean the difference between life and death and, therefore, the necessary and unnecessary deployment of public safety personnel. Sounds generated by plastic bag explosions are often confused with real gunshot sounds, by either humans or computer algorithms. As a case study, the research reported in this paper offers insight into sounds of plastic bag explosions and gunshots. An experimental study in this research reveals that a deep learning-based classification model trained with a popular urban sound dataset containing gunshot sounds cannot distinguish plastic bag pop sounds from gunshot sounds. This study further shows that the same deep learning model, if trained with a dataset containing plastic pop sounds, can effectively detect the non-life-threatening sounds. For this purpose, first, a collection of plastic bag-popping sounds was recorded in different environments with varying parameters, such as plastic bag size and distance from the recording microphones. The audio clips' duration ranged from 400 ms to 600 ms. This collection of data was then used, together with a gunshot sound dataset, to train a classification model based on a convolutional neural network (CNN) to differentiate life-threatening gunshot events from non-life-threatening plastic bag explosion events. A comparison between two feature extraction methods, the Mel-frequency cepstral coefficients (MFCC) and Mel-spectrograms, was also done. Experimental studies conducted in this research show that once the plastic bag pop sounds are injected into model training, the CNN classification model performs well in distinguishing actual gunshot sounds from plastic bag sounds.

摘要

区分危险的音频事件,如枪声和其他非危及生命的事件,如塑料袋爆裂,可以决定公共安全人员的必要和不必要部署。塑料袋爆炸产生的声音经常被人类或计算机算法误认为是真实的枪声。作为一个案例研究,本文所报告的研究提供了对塑料袋爆炸和枪声的深入了解。本研究中的一项实验研究表明,使用包含枪声的流行城市声音数据集训练的基于深度学习的分类模型无法区分塑料袋爆裂声和枪声。这项研究进一步表明,如果使用包含塑料袋爆裂声的数据集来训练相同的深度学习模型,就可以有效地检测到非危及生命的声音。为此,首先在不同的环境中记录了不同尺寸塑料袋的爆裂声,这些环境的参数有所不同,例如塑料袋的尺寸和与录音麦克风的距离。音频剪辑的持续时间从 400 毫秒到 600 毫秒不等。然后,将此数据集与枪声数据集一起用于训练基于卷积神经网络(CNN)的分类模型,以区分危及生命的枪击事件和非危及生命的塑料袋爆炸事件。还比较了两种特征提取方法,梅尔频率倒谱系数(MFCC)和梅尔频谱图。本研究中的实验研究表明,一旦将塑料袋爆裂声注入模型训练中,CNN 分类模型就能很好地区分实际枪声和塑料袋声。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/7b640da7bf38/sensors-21-07320-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/2be95d1f152b/sensors-21-07320-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/805d5f43d647/sensors-21-07320-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/33cad7b576d6/sensors-21-07320-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/172cc1fd35b4/sensors-21-07320-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/becebfa99766/sensors-21-07320-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/e3fe5d4863ff/sensors-21-07320-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/bdc13e47c888/sensors-21-07320-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/e8640f522df2/sensors-21-07320-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/ac9a6987af70/sensors-21-07320-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/7b640da7bf38/sensors-21-07320-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/2be95d1f152b/sensors-21-07320-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/805d5f43d647/sensors-21-07320-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/33cad7b576d6/sensors-21-07320-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/172cc1fd35b4/sensors-21-07320-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/becebfa99766/sensors-21-07320-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/e3fe5d4863ff/sensors-21-07320-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/bdc13e47c888/sensors-21-07320-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/e8640f522df2/sensors-21-07320-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/ac9a6987af70/sensors-21-07320-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f7/8587567/7b640da7bf38/sensors-21-07320-g010.jpg

相似文献

1
Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study.枪击和类枪击音频事件的数据收集、建模和分类:案例研究。
Sensors (Basel). 2021 Nov 3;21(21):7320. doi: 10.3390/s21217320.
2
Measurements, Analysis, Classification, and Detection of Gunshot and Gunshot-like Sounds.枪声和类枪声的测量、分析、分类和检测。
Sensors (Basel). 2022 Nov 25;22(23):9170. doi: 10.3390/s22239170.
3
Crackle Detection In Lung Sounds Using Transfer Learning And Multi-Input Convolutional Neural Networks.使用迁移学习和多输入卷积神经网络进行肺部声音的爆裂音检测。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:80-83. doi: 10.1109/EMBC46164.2021.9630577.
4
Heart sound classification based on improved MFCC features and convolutional recurrent neural networks.基于改进 MFCC 特征和卷积循环神经网络的心音分类。
Neural Netw. 2020 Oct;130:22-32. doi: 10.1016/j.neunet.2020.06.015. Epub 2020 Jun 23.
5
Comparative study of respiratory sounds classification methods based on cepstral analysis and artificial neural networks.基于倒谱分析和人工神经网络的呼吸音分类方法的比较研究。
Comput Biol Med. 2024 Mar;171:108190. doi: 10.1016/j.compbiomed.2024.108190. Epub 2024 Feb 20.
6
DCNN for Pig Vocalization and Non-Vocalization Classification: Evaluate Model Robustness with New Data.用于猪发声与非发声分类的深度卷积神经网络:使用新数据评估模型稳健性
Animals (Basel). 2024 Jul 9;14(14):2029. doi: 10.3390/ani14142029.
7
Comparative Study of Popular Deep Learning Models for Machining Roughness Classification Using Sound and Force Signals.基于声音和力信号的常用深度学习模型用于加工粗糙度分类的比较研究
Micromachines (Basel). 2021 Nov 29;12(12):1484. doi: 10.3390/mi12121484.
8
Heart sound classification based on equal scale frequency cepstral coefficients and deep learning.基于等比例频率倒谱系数和深度学习的心音分类。
Biomed Tech (Berl). 2023 Feb 15;68(3):285-295. doi: 10.1515/bmt-2021-0254. Print 2023 Jun 27.
9
Feature-Based Fusion Using CNN for Lung and Heart Sound Classification.基于特征融合的 CNN 用于心肺音分类。
Sensors (Basel). 2022 Feb 16;22(4):1521. doi: 10.3390/s22041521.
10
Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion.基于决策级融合的双流卷积神经网络环境声音分类
Sensors (Basel). 2019 Apr 11;19(7):1733. doi: 10.3390/s19071733.

引用本文的文献

1
A multi-firearm, multi-orientation audio dataset of gunshots.一个包含多种枪支、多种射击方向的枪声音频数据集。
Data Brief. 2023 Mar 25;48:109091. doi: 10.1016/j.dib.2023.109091. eCollection 2023 Jun.
2
Measurements, Analysis, Classification, and Detection of Gunshot and Gunshot-like Sounds.枪声和类枪声的测量、分析、分类和检测。
Sensors (Basel). 2022 Nov 25;22(23):9170. doi: 10.3390/s22239170.