• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度递归神经网络的声异常检测自编码器。

Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection.

机构信息

Machine Intelligence & Signal Processing Group, Technische Universität München, Munich, Germany; audEERING GmbH, Gilching, Germany; Chair of Complex & Intelligent Systems, University of Passau, Passau, Germany.

A3LAB, Department of Information Engineering, Università Politecnica delle Marche, Ancona, Italy.

出版信息

Comput Intell Neurosci. 2017;2017:4694860. doi: 10.1155/2017/4694860. Epub 2017 Jan 15.

DOI:10.1155/2017/4694860
PMID:28182121
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5274684/
Abstract

In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic novelty detection with recurrent neural networks in the form of an autoencoder. In these approaches, auditory spectral features of the next short term frame are predicted from the previous frames by means of Long-Short Term Memory recurrent denoising autoencoders. The reconstruction error between the input and the output of the autoencoder is used as activation signal to detect novel events. There is no evidence of studies focused on comparing previous efforts to automatically recognize novel events from audio signals and giving a broad and in depth evaluation of recurrent neural network-based autoencoders. The present contribution aims to consistently evaluate our recent novel approaches to fill this white spot in the literature and provide insight by extensive evaluations carried out on three databases: A3Novelty, PASCAL CHiME, and PROMETHEUS. Besides providing an extensive analysis of novel and state-of-the-art methods, the article shows how RNN-based autoencoders outperform statistical approaches up to an absolute improvement of 16.4% average -measure over the three databases.

摘要

在新兴的声学新颖性检测领域,大多数研究工作都致力于概率方法,如混合模型或状态空间模型。只有最近的研究才引入了(伪)生成模型,通过递归神经网络以自动编码器的形式进行声学新颖性检测。在这些方法中,通过长短期记忆递归去噪自动编码器,从先前的帧预测下一个短期帧的听觉频谱特征。自动编码器的输入和输出之间的重建误差用作激活信号来检测新事件。没有证据表明有研究集中于比较以前的努力,以自动从音频信号中识别新颖事件,并对基于递归神经网络的自动编码器进行广泛而深入的评估。本研究旨在一致地评估我们最近的新颖方法,以填补文献中的空白,并通过在三个数据库上进行广泛的评估提供深入的见解:A3Novelty、PASCAL CHiME 和 PROMETHEUS。除了对新颖和最先进的方法进行广泛分析外,本文还展示了基于 RNN 的自动编码器如何在三个数据库上平均提高 16.4%的绝对指标,优于统计方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/a3c37a1ea100/CIN2017-4694860.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/f9d11f441b2d/CIN2017-4694860.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/7a733ca23f8c/CIN2017-4694860.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/c27f5b13da70/CIN2017-4694860.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/152f948bd24a/CIN2017-4694860.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/59785950e892/CIN2017-4694860.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/fcdedd246d2a/CIN2017-4694860.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/a3c37a1ea100/CIN2017-4694860.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/f9d11f441b2d/CIN2017-4694860.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/7a733ca23f8c/CIN2017-4694860.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/c27f5b13da70/CIN2017-4694860.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/152f948bd24a/CIN2017-4694860.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/59785950e892/CIN2017-4694860.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/fcdedd246d2a/CIN2017-4694860.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb6d/5274684/a3c37a1ea100/CIN2017-4694860.007.jpg

相似文献

1
Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection.基于深度递归神经网络的声异常检测自编码器。
Comput Intell Neurosci. 2017;2017:4694860. doi: 10.1155/2017/4694860. Epub 2017 Jan 15.
2
An Incremental Class-Learning Approach with Acoustic Novelty Detection for Acoustic Event Recognition.基于声学新颖性检测的增量式类学习方法在声学事件识别中的应用。
Sensors (Basel). 2021 Oct 5;21(19):6622. doi: 10.3390/s21196622.
3
Fault diagnosis of rolling bearings with recurrent neural network-based autoencoders.基于循环神经网络自编码器的滚动轴承故障诊断。
ISA Trans. 2018 Jun;77:167-178. doi: 10.1016/j.isatra.2018.04.005. Epub 2018 Apr 19.
4
A Scheme with Acoustic Emission Hit Removal for the Remaining Useful Life Prediction of Concrete Structures.基于声发射去击中的剩余使用寿命预测的混凝土结构方案。
Sensors (Basel). 2021 Nov 22;21(22):7761. doi: 10.3390/s21227761.
5
A new approach for arrhythmia classification using deep coded features and LSTM networks.基于深度编码特征和长短期记忆网络的心律失常分类新方法。
Comput Methods Programs Biomed. 2019 Jul;176:121-133. doi: 10.1016/j.cmpb.2019.05.004. Epub 2019 May 10.
6
Deep-Learning-Based Approach to Anomaly Detection Techniques for Large Acoustic Data in Machine Operation.基于深度学习的机器运行中大型声学数据异常检测技术方法。
Sensors (Basel). 2021 Aug 12;21(16):5446. doi: 10.3390/s21165446.
7
Structural Health Monitoring of Dams Based on Acoustic Monitoring, Deep Neural Networks, Fuzzy Logic and a CUSUM Control Algorithm.基于声监测、深度神经网络、模糊逻辑和CUSUM 控制算法的大坝结构健康监测。
Sensors (Basel). 2022 Mar 24;22(7):2482. doi: 10.3390/s22072482.
8
A deep learning framework for financial time series using stacked autoencoders and long-short term memory.一种使用堆叠自编码器和长短时记忆的金融时间序列深度学习框架。
PLoS One. 2017 Jul 14;12(7):e0180944. doi: 10.1371/journal.pone.0180944. eCollection 2017.
9
Two-Stream Attention Network for Pain Recognition from Video Sequences.基于双流注意力网络的视频序列疼痛识别
Sensors (Basel). 2020 Feb 4;20(3):839. doi: 10.3390/s20030839.
10
An Intelligent Recurrent Neural Network with Long Short-Term Memory (LSTM) BASED Batch Normalization for Medical Image Denoising.基于长短期记忆 (LSTM) 与批量归一化的智能递归神经网络在医学图像去噪中的应用。
J Med Syst. 2019 Jun 15;43(8):234. doi: 10.1007/s10916-019-1371-9.

引用本文的文献

1
Curiosity: primate neural circuits for novelty and information seeking.好奇心:灵长类动物用于寻求新奇和信息的神经回路。
Nat Rev Neurosci. 2024 Mar;25(3):195-208. doi: 10.1038/s41583-023-00784-9. Epub 2024 Jan 23.
2
Deep Neural Networks for Automatic Flower Species Localization and Recognition.基于深度神经网络的自动花卉物种定位与识别
Comput Intell Neurosci. 2022 Apr 29;2022:9359353. doi: 10.1155/2022/9359353. eCollection 2022.
3
An Incremental Class-Learning Approach with Acoustic Novelty Detection for Acoustic Event Recognition.

本文引用的文献

1
Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces.扩展广义帕累托分布用于高维空间中的新奇性检测
J Signal Process Syst. 2014;74(3):323-339. doi: 10.1007/s11265-013-0835-2. Epub 2013 Aug 16.
2
Fast support vector data descriptions for novelty detection.用于异常检测的快速支持向量数据描述
IEEE Trans Neural Netw. 2010 Aug;21(8):1296-313. doi: 10.1109/TNN.2010.2053853. Epub 2010 Jul 15.
3
Factorial switching linear dynamical systems applied to physiological condition monitoring.
基于声学新颖性检测的增量式类学习方法在声学事件识别中的应用。
Sensors (Basel). 2021 Oct 5;21(19):6622. doi: 10.3390/s21196622.
应用于生理状态监测的阶乘切换线性动态系统。
IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1537-51. doi: 10.1109/TPAMI.2008.191.
4
Face recognition: a convolutional neural-network approach.人脸识别:一种卷积神经网络方法。
IEEE Trans Neural Netw. 1997;8(1):98-113. doi: 10.1109/72.554195.
5
A neural network-based novelty detector for image sequence analysis.一种基于神经网络的用于图像序列分析的新颖性检测器。
IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1664-77. doi: 10.1109/TPAMI.2006.196.
6
Framewise phoneme classification with bidirectional LSTM and other neural network architectures.使用双向长短期记忆网络和其他神经网络架构进行逐帧音素分类。
Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.
7
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.