• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于子波散射、并行集成分类器和非线性融合的双耳声场景分类。

Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion.

机构信息

Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s/n, 4200-465 Porto, Portugal.

Bosch Security Systems S.A., EN109-Zona Industrial de Ovar, 3880-080 Ovar, Portugal.

出版信息

Sensors (Basel). 2022 Feb 16;22(4):1535. doi: 10.3390/s22041535.

DOI:10.3390/s22041535
PMID:35214436
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8878239/
Abstract

The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification systems. This research presents a hybrid method that includes a novel mathematical fusion step which aims to tackle the challenges of ASC accuracy and adaptability of current state-of-the-art models. The proposed method uses a stereo signal, two ensemble classifiers (random subspace), and a novel mathematical fusion step. In the proposed method, a stable, invariant signal representation of the stereo signal is built using Wavelet Scattering Transform (WST). For each mono, i.e., left and right, channel, a different random subspace classifier is trained using WST. A novel mathematical formula for fusion step was developed, its parameters being found using a Genetic algorithm. The results on the DCASE 2017 dataset showed that the proposed method has higher classification accuracy (about 95%), pushing the boundaries of existing methods.

摘要

当开发基于声音的智能系统时,环境声音的分析可能非常有用。声学场景分类(ASC)的定义是在一些预定义的场景中识别记录声音或剪辑的区域。ASC 具有巨大的潜力可用于城市声音事件分类系统。本研究提出了一种混合方法,其中包括一个新的数学融合步骤,旨在解决当前最先进模型的 ASC 准确性和适应性的挑战。所提出的方法使用立体声信号、两个集成分类器(随机子空间)和一个新的数学融合步骤。在提出的方法中,使用小波散射变换(WST)构建立体声信号的稳定、不变的信号表示。对于每个单声道,即左声道和右声道,使用 WST 训练不同的随机子空间分类器。开发了一种新的融合步骤数学公式,其参数使用遗传算法找到。在 DCASE 2017 数据集上的结果表明,所提出的方法具有更高的分类准确性(约 95%),推动了现有方法的边界。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/6d5799b7e481/sensors-22-01535-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/980e46181c4c/sensors-22-01535-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/4bd20171c55b/sensors-22-01535-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/93e29351a68f/sensors-22-01535-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/1b673c7995c8/sensors-22-01535-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/a45c9a6e7bf0/sensors-22-01535-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/0dd5229cc2d8/sensors-22-01535-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/1fb4c6c7c7c7/sensors-22-01535-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/fb7c42465921/sensors-22-01535-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/8ed7dd63b8bb/sensors-22-01535-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/aa2b60c50766/sensors-22-01535-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/0a4f4c0aebc3/sensors-22-01535-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/dc8eea985093/sensors-22-01535-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/476a2d6ad260/sensors-22-01535-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/6d5799b7e481/sensors-22-01535-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/980e46181c4c/sensors-22-01535-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/4bd20171c55b/sensors-22-01535-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/93e29351a68f/sensors-22-01535-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/1b673c7995c8/sensors-22-01535-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/a45c9a6e7bf0/sensors-22-01535-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/0dd5229cc2d8/sensors-22-01535-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/1fb4c6c7c7c7/sensors-22-01535-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/fb7c42465921/sensors-22-01535-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/8ed7dd63b8bb/sensors-22-01535-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/aa2b60c50766/sensors-22-01535-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/0a4f4c0aebc3/sensors-22-01535-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/dc8eea985093/sensors-22-01535-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/476a2d6ad260/sensors-22-01535-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9306/8878239/6d5799b7e481/sensors-22-01535-g014.jpg

相似文献

1
Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion.基于子波散射、并行集成分类器和非线性融合的双耳声场景分类。
Sensors (Basel). 2022 Feb 16;22(4):1535. doi: 10.3390/s22041535.
2
Phenotype recognition with combined features and random subspace classifier ensemble.基于组合特征和随机子空间分类器集成的表型识别。
BMC Bioinformatics. 2011 Apr 30;12:128. doi: 10.1186/1471-2105-12-128.
3
Clustering by Errors: A Self-Organized Multitask Learning Method for Acoustic Scene Classification.基于错误的聚类:用于声场景分类的自组织多任务学习方法。
Sensors (Basel). 2021 Dec 22;22(1):36. doi: 10.3390/s22010036.
4
Classification Framework of the Bearing Faults of an Induction Motor Using Wavelet Scattering Transform-Based Features.基于小波散射变换特征的感应电动机轴承故障分类框架。
Sensors (Basel). 2022 Nov 19;22(22):8958. doi: 10.3390/s22228958.
5
Automatic User Preferences Selection of Smart Hearing Aid Using BioAid.基于 BioAid 的智能助听器自动用户偏好选择
Sensors (Basel). 2022 Oct 20;22(20):8031. doi: 10.3390/s22208031.
6
Classification of heart sounds based on quality assessment and wavelet scattering transform.基于音质评估和小波散射变换的心音分类。
Comput Biol Med. 2021 Oct;137:104814. doi: 10.1016/j.compbiomed.2021.104814. Epub 2021 Aug 28.
7
A flexible analytic wavelet transform based approach for motor-imagery tasks classification in BCI applications.基于灵活分析小波变换的脑机接口应用中运动想象任务分类方法。
Comput Methods Programs Biomed. 2020 Apr;187:105325. doi: 10.1016/j.cmpb.2020.105325. Epub 2020 Jan 18.
8
Acoustic emission corrosion feature extraction and severity prediction using hybrid wavelet packet transform and linear support vector classifier.基于混合小波包变换和线性支持向量分类器的声发射腐蚀特征提取与严重程度预测。
PLoS One. 2021 Dec 16;16(12):e0261040. doi: 10.1371/journal.pone.0261040. eCollection 2021.
9
A changepoint prefilter for sound event detection in long-term bioacoustic recordings.一种用于长期生物声学记录中声音事件检测的变点预滤波器。
J Acoust Soc Am. 2021 Oct;150(4):2469. doi: 10.1121/10.0006534.
10
Fundamental Heart Sound Classification using the Continuous Wavelet Transform and Convolutional Neural Networks.基于连续小波变换和卷积神经网络的基本心音分类
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:409-412. doi: 10.1109/EMBC.2018.8512284.

引用本文的文献

1
Real-Time Acoustic Scene Recognition for Elderly Daily Routines Using Edge-Based Deep Learning.基于边缘深度学习的老年人日常活动实时声学场景识别
Sensors (Basel). 2025 Mar 12;25(6):1746. doi: 10.3390/s25061746.
2
DEW: A wavelet approach of rare sound event detection.DEW:一种用于检测罕见声音事件的小波方法。
PLoS One. 2024 Mar 28;19(3):e0300444. doi: 10.1371/journal.pone.0300444. eCollection 2024.
3
EnViTSA: Ensemble of Vision Transformer with SpecAugment for Acoustic Event Classification.EnViTSA:结合频谱增强的视觉Transformer集成用于声学事件分类

本文引用的文献

1
General audio tagging with ensembling convolutional neural networks and statistical features.基于集成卷积神经网络和统计特征的通用音频标记
J Acoust Soc Am. 2019 Jun;145(6):EL521. doi: 10.1121/1.5111059.
Sensors (Basel). 2023 Nov 10;23(22):9084. doi: 10.3390/s23229084.
4
Classification of Heart Sounds Using Chaogram Transform and Deep Convolutional Neural Network Transfer Learning.基于 Chaogram 变换和深度卷积神经网络迁移学习的心音分类。
Sensors (Basel). 2022 Dec 7;22(24):9569. doi: 10.3390/s22249569.
5
Automatic User Preferences Selection of Smart Hearing Aid Using BioAid.基于 BioAid 的智能助听器自动用户偏好选择
Sensors (Basel). 2022 Oct 20;22(20):8031. doi: 10.3390/s22208031.