• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DEW:一种用于检测罕见声音事件的小波方法。

DEW: A wavelet approach of rare sound event detection.

机构信息

Department of Electrical Engineering, University of Engineering and Technology, Peshawar, Pakistan.

Intelligent Information Processing Lab, National Center of Artificial Intelligence, University of Engineering and Technology, Peshawar, Pakistan.

出版信息

PLoS One. 2024 Mar 28;19(3):e0300444. doi: 10.1371/journal.pone.0300444. eCollection 2024.

DOI:10.1371/journal.pone.0300444
PMID:38547253
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10977878/
Abstract

This paper presents a novel sound event detection (SED) system for rare events occurring in an open environment. Wavelet multiresolution analysis (MRA) is used to decompose the input audio clip of 30 seconds into five levels. Wavelet denoising is then applied on the third and fifth levels of MRA to filter out the background. Significant transitions, which may represent the onset of a rare event, are then estimated in these two levels by combining the peak-finding algorithm with the K-medoids clustering algorithm. The small portions of one-second duration, called 'chunks' are cropped from the input audio signal corresponding to the estimated locations of the significant transitions. Features from these chunks are extracted by the wavelet scattering network (WSN) and are given as input to a support vector machine (SVM) classifier, which classifies them. The proposed SED framework produces an error rate comparable to the SED systems based on convolutional neural network (CNN) architecture. Also, the proposed algorithm is computationally efficient and lightweight as compared to deep learning models, as it has no learnable parameter. It requires only a single epoch of training, which is 5, 10, 200, and 600 times lesser than the models based on CNNs and deep neural networks (DNNs), CNN with long short-term memory (LSTM) network, convolutional recurrent neural network (CRNN), and CNN respectively. The proposed model neither requires concatenation with previous frames for anomaly detection nor any additional training data creation needed for other comparative deep learning models. It needs to check almost 360 times fewer chunks for the presence of rare events than the other baseline systems used for comparison in this paper. All these characteristics make the proposed system suitable for real-time applications on resource-limited devices.

摘要

本文提出了一种新颖的用于开放环境中罕见事件检测(SED)的系统。小波多分辨率分析(MRA)用于将 30 秒的输入音频剪辑分解为五个级别。然后,在 MRA 的第三级和第五级应用小波去噪来过滤背景。通过将峰值查找算法与 K-均值聚类算法相结合,在这两个级别中估计可能表示罕见事件开始的显著转换。然后,从输入音频信号中对应于估计的显著转换位置的一小部分,持续时间为 1 秒的部分,称为“块”。从这些块中提取特征,由小波散射网络(WSN)提取,并作为输入提供给支持向量机(SVM)分类器,对其进行分类。所提出的 SED 框架产生的错误率可与基于卷积神经网络(CNN)架构的 SED 系统相媲美。此外,与深度学习模型相比,所提出的算法在计算上更有效且更轻量级,因为它没有可学习的参数。它只需要一个训练的单个时期,这比基于 CNN 和深度神经网络(DNN)、具有长短期记忆(LSTM)网络的 CNN、卷积递归神经网络(CRNN)和 CNN 的模型分别少 5、10、200 和 600 倍。所提出的模型既不需要为异常检测与前一帧拼接,也不需要为其他比较的深度学习模型创建额外的训练数据。与本文用于比较的其他基线系统相比,它需要检查罕见事件存在的块数几乎少 360 倍。所有这些特性使得所提出的系统适合在资源有限的设备上进行实时应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/973b3b9a6827/pone.0300444.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/bdf65c28dd83/pone.0300444.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/6635e26497cc/pone.0300444.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/20fe9999db60/pone.0300444.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/716a56a9ea69/pone.0300444.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/c71c0359df71/pone.0300444.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/973b3b9a6827/pone.0300444.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/bdf65c28dd83/pone.0300444.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/6635e26497cc/pone.0300444.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/20fe9999db60/pone.0300444.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/716a56a9ea69/pone.0300444.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/c71c0359df71/pone.0300444.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e0/10977878/973b3b9a6827/pone.0300444.g006.jpg

相似文献

1
DEW: A wavelet approach of rare sound event detection.DEW:一种用于检测罕见声音事件的小波方法。
PLoS One. 2024 Mar 28;19(3):e0300444. doi: 10.1371/journal.pone.0300444. eCollection 2024.
2
Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones.基于卷积递归神经网络的多麦克风隧道事件检测
Sensors (Basel). 2019 Jun 14;19(12):2695. doi: 10.3390/s19122695.
3
A Novel Method for Classifying Liver and Brain Tumors Using Convolutional Neural Networks, Discrete Wavelet Transform and Long Short-Term Memory Networks.基于卷积神经网络、离散小波变换和长短时记忆网络的肝脏和脑肿瘤分类新方法。
Sensors (Basel). 2019 Apr 28;19(9):1992. doi: 10.3390/s19091992.
4
Deep Learning with LPC and Wavelet Algorithms for Driving Fault Diagnosis.基于 LPC 和小波算法的深度学习在驾驶故障诊断中的应用。
Sensors (Basel). 2022 Sep 19;22(18):7072. doi: 10.3390/s22187072.
5
Mixed convolutional and long short-term memory network for the detection of lethal ventricular arrhythmia.混合卷积和长短时记忆网络用于致命性室性心律失常检测。
PLoS One. 2019 May 20;14(5):e0216756. doi: 10.1371/journal.pone.0216756. eCollection 2019.
6
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
7
Fundamental Heart Sound Classification using the Continuous Wavelet Transform and Convolutional Neural Networks.基于连续小波变换和卷积神经网络的基本心音分类
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:409-412. doi: 10.1109/EMBC.2018.8512284.
8
Real driving environment EEG-based detection of driving fatigue using the wavelet scattering network.基于小波散射网络的真实驾驶环境中驾驶疲劳的脑电检测。
J Neurosci Methods. 2023 Dec 1;400:109983. doi: 10.1016/j.jneumeth.2023.109983. Epub 2023 Oct 13.
9
[Research on muscle fatigue recognition model based on improved wavelet denoising and long short-term memory].基于改进小波去噪和长短期记忆的肌肉疲劳识别模型研究
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2022 Jun 25;39(3):507-515. doi: 10.7507/1001-5515.202107024.
10
ColoRectalCADx: Expeditious Recognition of Colorectal Cancer with Integrated Convolutional Neural Networks and Visual Explanations Using Mixed Dataset Evidence.结直肠癌计算机辅助诊断(ColoRectalCADx):利用集成卷积神经网络和基于混合数据集证据的可视化解释快速识别结直肠癌
Comput Math Methods Med. 2022 Nov 10;2022:8723957. doi: 10.1155/2022/8723957. eCollection 2022.

本文引用的文献

1
Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion.基于子波散射、并行集成分类器和非线性融合的双耳声场景分类。
Sensors (Basel). 2022 Feb 16;22(4):1535. doi: 10.3390/s22041535.
2
Wavelet Scattering Transform for ECG Beat Classification.基于小波散射变换的心拍分类。
Comput Math Methods Med. 2020 Oct 9;2020:3215681. doi: 10.1155/2020/3215681. eCollection 2020.
3
Residual Error Based Anomaly Detection Using Auto-Encoder in SMD Machine Sound.基于残差误差的自动编码器在贴片机声音中的异常检测
Sensors (Basel). 2018 Apr 24;18(5):1308. doi: 10.3390/s18051308.
4
The electronic stethoscope.电子听诊器。
Biomed Eng Online. 2015 Jul 10;14:66. doi: 10.1186/s12938-015-0056-y.
5
Methods of EEG signal features extraction using linear analysis in frequency and time-frequency domains.在频率域和时频域中使用线性分析提取脑电信号特征的方法。
ISRN Neurosci. 2014 Feb 13;2014:730218. doi: 10.1155/2014/730218. eCollection 2014.
6
Neural classification of lung sounds using wavelet coefficients.基于小波系数的肺音神经分类
Comput Biol Med. 2004 Sep;34(6):523-37. doi: 10.1016/S0010-4825(03)00092-1.