• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习视阈下的音乐艺术类型分类

The Classification of Music and Art Genres under the Visual Threshold of Deep Learning.

机构信息

School of Music, Henan Vocational Institute of Arts, Zhengzhou, Henan, China.

出版信息

Comput Intell Neurosci. 2022 May 18;2022:4439738. doi: 10.1155/2022/4439738. eCollection 2022.

DOI:10.1155/2022/4439738
PMID:35634048
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9132639/
Abstract

Wireless networks are commonly employed for ambient assisted living applications, and artificial intelligence-enabled event detection and classification processes have become familiar. However, music is a kind of time-series data, and it is challenging to design an effective music genre classification (MGC) system due to a large quantity of music data. Robust MGC techniques necessitate a massive amount of data, which is time-consuming, laborious, and requires expert knowledge. Few studies have focused on the design of music representations extracted directly from input waveforms. In recent times, deep learning (DL) models have been widely used due to their characteristics of automatic extracting advanced features and contextual representation from actual music or processed data. This paper aims to develop a novel deep learning-enabled music genre classification (DLE-MGC) technique. The proposed DLE-MGC technique effectively classifies the music genres into multiple classes by using three subprocesses, namely preprocessing, classification, and hyperparameter optimization. At the initial stage, the Pitch to Vector (Pitch2vec) approach is applied as a preprocessing step where the pitches in the input musical instrument digital interface (MIDI) files are transformed into the vector sequences. Besides, the DLE-MGC technique involves the design of a cat swarm optimization (CSO) with bidirectional long-term memory (BiLSTM) model for the classification process. The DBTMPE technique has gained a moderately increased accuracy of 94.27%, and the DLE-MGC technique has accomplished a better accuracy of 95.87%. The performance validation of the DLE-MGC technique was carried out using the Lakh MIDI music dataset, and the comparative results verified the promising performance of the DLE-MGC technique over current methods.

摘要

无线网络常用于环境辅助生活应用,人工智能支持的事件检测和分类过程已经变得熟悉。然而,音乐是一种时间序列数据,由于音乐数据量大,设计有效的音乐类型分类 (MGC) 系统具有挑战性。稳健的 MGC 技术需要大量的数据,这既耗时、费力,又需要专业知识。很少有研究关注从输入波形中直接提取音乐表示的设计。近年来,由于深度学习 (DL) 模型从实际音乐或处理后的数据中自动提取高级特征和上下文表示的特点,它们得到了广泛的应用。本文旨在开发一种新的基于深度学习的音乐类型分类 (DLE-MGC) 技术。所提出的 DLE-MGC 技术通过使用三个子过程,即预处理、分类和超参数优化,有效地将音乐类型分类为多个类别。在初始阶段,应用音高到向量 (Pitch2vec) 方法作为预处理步骤,将输入乐器数字接口 (MIDI) 文件中的音高转换为向量序列。此外,DLE-MGC 技术还涉及设计具有双向长短期记忆 (BiLSTM) 模型的猫群优化 (CSO) 进行分类过程。DBTMPE 技术的准确率提高到了 94.27%,而 DLE-MGC 技术的准确率提高到了 95.87%。使用 Lakh MIDI 音乐数据集对 DLE-MGC 技术进行了性能验证,比较结果验证了 DLE-MGC 技术优于现有方法的良好性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/75bccfa45e1c/CIN2022-4439738.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/2682b4f09da7/CIN2022-4439738.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/b53660951f31/CIN2022-4439738.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/3f34a0834c5f/CIN2022-4439738.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/fa764ed517c1/CIN2022-4439738.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/1d10253aad7a/CIN2022-4439738.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/75bccfa45e1c/CIN2022-4439738.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/2682b4f09da7/CIN2022-4439738.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/b53660951f31/CIN2022-4439738.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/3f34a0834c5f/CIN2022-4439738.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/fa764ed517c1/CIN2022-4439738.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/1d10253aad7a/CIN2022-4439738.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc0/9132639/75bccfa45e1c/CIN2022-4439738.006.jpg

相似文献

1
The Classification of Music and Art Genres under the Visual Threshold of Deep Learning.深度学习视阈下的音乐艺术类型分类
Comput Intell Neurosci. 2022 May 18;2022:4439738. doi: 10.1155/2022/4439738. eCollection 2022.
2
Optimizing the configuration of deep learning models for music genre classification.优化用于音乐流派分类的深度学习模型配置。
Heliyon. 2024 Jan 17;10(2):e24892. doi: 10.1016/j.heliyon.2024.e24892. eCollection 2024 Jan 30.
3
Construction of Intelligent Recognition and Learning Education Platform of National Music Genre Under Deep Learning.深度学习下民族音乐体裁智能识别与学习教育平台的构建
Front Psychol. 2022 May 26;13:843427. doi: 10.3389/fpsyg.2022.843427. eCollection 2022.
4
An Ensemble of Deep Learning Enabled Brain Stroke Classification Model in Magnetic Resonance Images.基于深度学习的磁共振影像脑卒中风型分类模型的研究
J Healthc Eng. 2022 Nov 18;2022:7815434. doi: 10.1155/2022/7815434. eCollection 2022.
5
A Lightweight Deep Learning-Based Approach for Jazz Music Generation in MIDI Format.一种基于轻量级深度学习的 MIDI 格式爵士音乐生成方法。
Comput Intell Neurosci. 2022 Aug 5;2022:2140895. doi: 10.1155/2022/2140895. eCollection 2022.
6
Music Similarity Detection Guided by Deep Learning Model.深度学习模型指导下的音乐相似度检测
Comput Intell Neurosci. 2023 Feb 20;2023:1263620. doi: 10.1155/2023/1263620. eCollection 2023.
7
NLP-based music processing for composer classification.基于自然语言处理的作曲风格分类研究
Sci Rep. 2023 Aug 14;13(1):13228. doi: 10.1038/s41598-023-40332-0.
8
Music Score Recognition Method Based on Deep Learning.基于深度学习的乐谱识别方法
Comput Intell Neurosci. 2022 Jul 7;2022:3022767. doi: 10.1155/2022/3022767. eCollection 2022.
9
A Multimodal Convolutional Neural Network Model for the Analysis of Music Genre on Children's Emotions Influence Intelligence.用于分析音乐类型对儿童情绪智力影响的多模态卷积神经网络模型。
Comput Intell Neurosci. 2022 Aug 29;2022:5611456. doi: 10.1155/2022/5611456. eCollection 2022.
10
Music Morphology Interaction under Artificial Intelligence in Wireless Network Environment.人工智能在无线网络环境下的音乐形态学交互。
Comput Intell Neurosci. 2022 May 14;2022:9002093. doi: 10.1155/2022/9002093. eCollection 2022.

本文引用的文献

1
Deep Convolutional Neural Network for Inverse Problems in Imaging.基于深度卷积神经网络的医学影像反问题研究
IEEE Trans Image Process. 2017 Sep;26(9):4509-4522. doi: 10.1109/TIP.2017.2713099. Epub 2017 Jun 15.
2
Rationale-Augmented Convolutional Neural Networks for Text Classification.用于文本分类的基于原理增强的卷积神经网络。
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:795-804. doi: 10.18653/v1/d16-1076.
3
Representation learning: a review and new perspectives.表示学习:综述与新视角。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.