• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度学习方法进行乐器识别。

Musical Instrument Identification Using Deep Learning Approach.

机构信息

Multimedia Systems Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Narutowicza 11/12, 80-233 Gdańsk, Poland.

Audio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Narutowicza 11/12, 80-233 Gdańsk, Poland.

出版信息

Sensors (Basel). 2022 Apr 15;22(8):3033. doi: 10.3390/s22083033.

DOI:10.3390/s22083033
PMID:35459018
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9025072/
Abstract

The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata description and a review of related works. This is followed by showing the dataset prepared for the experiment and its division into subsets: training, validation, and evaluation. Then, the analyzed architecture of the neural network model is presented. Based on the described model, training is performed, and several quality metrics are determined for the training and validation sets. The results of the evaluation of the trained network on a separate set are shown. Detailed values for precision, recall, and the number of true and false positive and negative detections are presented. The model efficiency is high, with the metric values ranging from 0.86 for the guitar to 0.99 for drums. Finally, a discussion and a summary of the results obtained follows.

摘要

本工作旨在提出一种新方法,使用针对每个测试乐器的一组单个卷积神经网络 (CNN) 自动识别音频摘录中出现的所有乐器。本文首先回顾了与乐器识别相关的任务。它重点介绍了执行的任务、输入类型、使用的算法和使用的指标。本文首先介绍背景,即元数据描述和相关工作回顾。然后展示为实验准备的数据集及其划分为训练集、验证集和评估集。然后,介绍所分析的神经网络模型架构。基于描述的模型进行训练,并确定训练集和验证集的几个质量指标。展示在单独的数据集上评估训练后的网络的结果。呈现了精度、召回率以及真实和假阳性和阴性检测的数量的详细值。该模型的效率很高,吉他的指标值范围从 0.86 到鼓的 0.99。最后,进行讨论并总结获得的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/727ed3d87f50/sensors-22-03033-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/ae1ddb984a1d/sensors-22-03033-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/5bd0565f1c44/sensors-22-03033-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/81b93397e131/sensors-22-03033-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/549197ab3c12/sensors-22-03033-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/8f4225bf4f51/sensors-22-03033-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/55202000a20a/sensors-22-03033-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/3bda0dde98a0/sensors-22-03033-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/aa9bba624742/sensors-22-03033-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/71358bb4a7ce/sensors-22-03033-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/d5ede441e955/sensors-22-03033-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/641718de0b7a/sensors-22-03033-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/727ed3d87f50/sensors-22-03033-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/ae1ddb984a1d/sensors-22-03033-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/5bd0565f1c44/sensors-22-03033-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/81b93397e131/sensors-22-03033-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/549197ab3c12/sensors-22-03033-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/8f4225bf4f51/sensors-22-03033-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/55202000a20a/sensors-22-03033-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/3bda0dde98a0/sensors-22-03033-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/aa9bba624742/sensors-22-03033-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/71358bb4a7ce/sensors-22-03033-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/d5ede441e955/sensors-22-03033-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/641718de0b7a/sensors-22-03033-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5642/9025072/727ed3d87f50/sensors-22-03033-g012.jpg

相似文献

1
Musical Instrument Identification Using Deep Learning Approach.使用深度学习方法进行乐器识别。
Sensors (Basel). 2022 Apr 15;22(8):3033. doi: 10.3390/s22083033.
2
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.MABAL:一种用于机器辅助骨龄标注的新型深度学习架构。
J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.
3
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
4
Development and Validation of a Model for Laparoscopic Colorectal Surgical Instrument Recognition Using Convolutional Neural Network-Based Instance Segmentation and Videos of Laparoscopic Procedures.基于卷积神经网络实例分割和腹腔镜手术视频的腹腔镜结直肠手术器械识别模型的开发和验证。
JAMA Netw Open. 2022 Aug 1;5(8):e2226265. doi: 10.1001/jamanetworkopen.2022.26265.
5
Convolutional neural network model by deep learning and teaching robot in keyboard musical instrument teaching.深度学习卷积神经网络模型与教学机器人在键盘乐器教学中的应用。
PLoS One. 2023 Oct 26;18(10):e0293411. doi: 10.1371/journal.pone.0293411. eCollection 2023.
6
Metadata information and fundus image fusion neural network for hyperuricemia classification in diabetes.用于糖尿病高尿酸血症分类的元数据信息和眼底图像融合神经网络。
Comput Methods Programs Biomed. 2024 Nov;256:108382. doi: 10.1016/j.cmpb.2024.108382. Epub 2024 Aug 23.
7
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
8
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
9
Hybrid deep learning for computational precision in cardiac MRI segmentation: Integrating Autoencoders, CNNs, and RNNs for enhanced structural analysis.用于心脏磁共振成像分割计算精度的混合深度学习:整合自动编码器、卷积神经网络和循环神经网络以增强结构分析。
Comput Biol Med. 2025 Mar;186:109597. doi: 10.1016/j.compbiomed.2024.109597. Epub 2025 Jan 1.
10
Automatic coronavirus disease 2019 diagnosis based on chest radiography and deep learning - Success story or dataset bias?基于胸部 X 光和深度学习的新型冠状病毒病 2019 自动诊断——成功案例还是数据集偏差?
Med Phys. 2022 Feb;49(2):978-987. doi: 10.1002/mp.15419. Epub 2022 Jan 12.

引用本文的文献

1
Musical instrument classifier for early childhood percussion instruments.用于儿童早期打击乐器的乐器分类器。
PLoS One. 2024 Apr 2;19(4):e0299888. doi: 10.1371/journal.pone.0299888. eCollection 2024.
2
Analytics and Applications of Audio and Image Sensing Techniques.音频和图像感应技术的分析与应用。
Sensors (Basel). 2022 Nov 3;22(21):8443. doi: 10.3390/s22218443.

本文引用的文献

1
Musical Instruments As Sensors.作为传感器的乐器
ACS Omega. 2018 Sep 30;3(9):11026-11032. doi: 10.1021/acsomega.8b01673. Epub 2018 Sep 12.
2
Independent component analysis: algorithms and applications.独立成分分析:算法与应用
Neural Netw. 2000 May-Jun;13(4-5):411-30. doi: 10.1016/s0893-6080(00)00026-5.
3
Computer identification of musical instruments using pattern recognition with cepstral coefficients as features.使用以倒谱系数为特征的模式识别技术对乐器进行计算机识别。
J Acoust Soc Am. 1999 Mar;105(3):1933-41. doi: 10.1121/1.426728.