基于频谱-时频感受野的描述符和分层级联深度置信网络在吉他演奏技巧分类中的应用。

Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification.

出版信息

IEEE Trans Cybern. 2022 May;52(5):3684-3695. doi: 10.1109/TCYB.2020.3014207. Epub 2022 May 19.

DOI:10.1109/TCYB.2020.3014207

Abstract

Music information retrieval is of great interest in audio signal processing. However, relatively little attention has been paid to the playing techniques of musical instruments. This work proposes an automatic system for classifying guitar playing techniques (GPTs). Automatic classification for GPTs is challenging because some playing techniques differ only slightly from others. This work presents a new framework for GPT classification: it uses a new feature extraction method based on spectral-temporal receptive fields (STRFs) to extract features from guitar sounds. This work applies a supervised deep learning approach to classify GPTs. Specifically, a new deep learning model, called the hierarchical cascade deep belief network (HCDBN), is proposed to perform automatic GPT classification. Several simulations were performed and the datasets of: 1) data on onsets of signals; 2) complete audio signals; and 3) audio signals in a real-world environment are adopted to compare the performance. The proposed system improves upon the F-score by approximately 11.47% in setup 1) and yields an F-score of 96.82% in setup 2). The results in setup 3) demonstrate that the proposed system also works well in a real-world environment. These results show that the proposed system is robust and has very high accuracy in automatic GPT classification.

摘要

音乐信息检索在音频信号处理中非常重要。然而，对于乐器的演奏技巧，相对较少的关注。这项工作提出了一种自动系统来对吉他演奏技巧（GPT）进行分类。由于一些演奏技巧与其他技巧仅略有不同，因此自动对 GPT 进行分类具有挑战性。这项工作提出了一种新的 GPT 分类框架：它使用基于谱时感受野（STRF）的新特征提取方法从吉他声音中提取特征。这项工作应用了监督深度学习方法来对 GPT 进行分类。具体来说，提出了一种新的深度学习模型，称为层次级联深度置信网络（HCDBN），用于执行自动 GPT 分类。进行了多次模拟，并采用了数据集 1）信号起始的数据；2）完整的音频信号；和 3）现实环境中的音频信号来比较性能。所提出的系统在设置 1）中提高了 F 分数约 11.47%，在设置 2）中产生了 96.82%的 F 分数。设置 3）中的结果表明，所提出的系统在现实环境中也能很好地工作。这些结果表明，所提出的系统在自动 GPT 分类中具有稳健性和非常高的准确性。

相似文献

Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification.

IEEE Trans Cybern. 2022 May;52(5):3684-3695. doi: 10.1109/TCYB.2020.3014207. Epub 2022 May 19.

A multimodal dataset for electric guitar playing technique recognition.

Data Brief. 2023 Nov 22;52:109842. doi: 10.1016/j.dib.2023.109842. eCollection 2024 Feb.

Graph-based feature extraction: A new proposal to study the classification of music signals outside the time-frequency domain.

PLoS One. 2020 Nov 12;15(11):e0240915. doi: 10.1371/journal.pone.0240915. eCollection 2020.

Emotion Recognition of Violin Playing Based on Big Data Analysis Technologies.

J Environ Public Health. 2022 Sep 15;2022:8583924. doi: 10.1155/2022/8583924. eCollection 2022.

The Timbre Toolbox: extracting audio descriptors from musical signals.

J Acoust Soc Am. 2011 Nov;130(5):2902-16. doi: 10.1121/1.3642604.

Construction of Intelligent Recognition and Learning Education Platform of National Music Genre Under Deep Learning.

Front Psychol. 2022 May 26;13:843427. doi: 10.3389/fpsyg.2022.843427. eCollection 2022.

Dynamic Orchestration of Brains and Instruments During Free Guitar Improvisation.

Front Integr Neurosci. 2019 Sep 4;13:50. doi: 10.3389/fnint.2019.00050. eCollection 2019.

A Novel Method for Sleep-Stage Classification Based on Sonification of Sleep Electroencephalogram Signals Using Wavelet Transform and Recurrent Neural Network.

Eur Neurol. 2020;83(5):468-486. doi: 10.1159/000511306. Epub 2020 Oct 29.

NLP-based music processing for composer classification.

Sci Rep. 2023 Aug 14;13(1):13228. doi: 10.1038/s41598-023-40332-0.

Musical note onset detection based on a spectral sparsity measure.

EURASIP J Audio Speech Music Process. 2021;2021(1):30. doi: 10.1186/s13636-021-00214-7. Epub 2021 Jul 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于频谱-时频感受野的描述符和分层级联深度置信网络在吉他演奏技巧分类中的应用。

Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification.

出版信息

IEEE Trans Cybern. 2022 May;52(5):3684-3695. doi: 10.1109/TCYB.2020.3014207. Epub 2022 May 19.

DOI:10.1109/TCYB.2020.3014207

PMID:32936758

Abstract

摘要

基于频谱-时频感受野的描述符和分层级联深度置信网络在吉他演奏技巧分类中的应用。

Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification.

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

基于频谱-时频感受野的描述符和分层级联深度置信网络在吉他演奏技巧分类中的应用。

Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification.

出版信息