利用自适应稀疏复值矩阵分解和 OvsO SVM 实现高效噪声声音事件混合分类。

Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM.

机构信息

School of Software Engineering, Payap University, Chiang Mai 50000, Thailand.

Department of Computer and Information Sciences, Northumbria University, Newcastle upon Tyne NE1 8ST, UK.

出版信息

Sensors (Basel). 2020 Aug 5;20(16):4368. doi: 10.3390/s20164368.

DOI:10.3390/s20164368

PMID:32764362

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7472059/

Abstract

This paper proposes a solution for events classification from a sole noisy mixture that consist of two major steps: a sound-event separation and a sound-event classification. The traditional complex nonnegative matrix factorization (CMF) is extended by cooperation with the optimal adaptive sparsity to decompose a noisy single-channel mixture. The proposed adaptive sparsity CMF algorithm encodes the spectra pattern and estimates the phase of the original signals in time-frequency representation. Their features enhance the temporal decomposition process efficiently. The support vector machine (SVM) based one versus one (OvsO) strategy was applied with a mean supervector to categorize the demixed sound into the matching sound-event class. The first step of the multi-class MSVM method is to segment the separated signal into blocks by sliding demixed signals, then encoding the three features of each block. Mel frequency cepstral coefficients, short-time energy, and short-time zero-crossing rate are learned with multi sound-event classes by the SVM based OvsO method. The mean supervector is encoded from the obtained features. The proposed method has been evaluated with both separation and classification scenarios using real-world single recorded signals and compared with the state-of-the-art separation method. Experimental results confirmed that the proposed method outperformed the state-of-the-art methods.

摘要

本文提出了一种从仅包含两个主要步骤的噪声混合中对事件进行分类的解决方案

声音事件分离和声音事件分类。传统的复杂非负矩阵分解（CMF）通过与最优自适应稀疏性合作进行扩展，以分解噪声单声道混合物。所提出的自适应稀疏 CMF 算法对谱模式进行编码，并估计原始信号在时频表示中的相位。它们的特征有效地增强了时间分解过程。支持向量机（SVM）基于一对一（OvsO）策略与平均超向量一起用于将解混声音分类到匹配的声音事件类中。多类 MSVM 方法的第一步是通过滑动解混信号将分离信号分段，然后对每个块的三个特征进行编码。梅尔频率倒谱系数、短时能量和短时过零率通过基于 SVM 的 OvsO 方法与多个声音事件类进行学习。从获得的特征中对平均超向量进行编码。该方法使用真实记录的单个信号进行了分离和分类场景的评估，并与最先进的分离方法进行了比较。实验结果证实了该方法优于最先进的方法。