改进的特征金字塔卷积神经网络，用于有效识别乐谱。

Improved Feature Pyramid Convolutional Neural Network for Effective Recognition of Music Scores.

机构信息

College of Music, Handan University, Handan 056005, Hebei Province, China.

出版信息

Comput Intell Neurosci. 2022 May 9;2022:6071114. doi: 10.1155/2022/6071114. eCollection 2022.

DOI:10.1155/2022/6071114

PMID:35586087

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9110142/

Abstract

Music written by composers and performed by multidimensional instruments is an art form that reflects real-life emotions. Historically, people disseminated music primarily through sheet music recording and oral transmission. Among them, recording music in sheet music form was a great musical invention. It became the carrier of music communication and inheritance, as well as a record of humanity's magnificent music culture. The advent of digital technology solves the problem of difficult musical score storage and distribution. However, there are many drawbacks to using data in image format, and extracting music score information in editable form from image data is currently a challenge. An improved convolutional neural network for musical score recognition is proposed in this paper. Because the traditional convolutional neural network SEGNET misclassifies some pixels, this paper employs the feature pyramid structure. Use additional branch paths to fuse shallow image details, shallow texture features that are beneficial to small objects, and high-level features of global information, enrich the multi-scale semantic information of the model, and alleviate the problem of the lack of multiscale semantic information in the model. Poor recognition performance is caused by semantic information. By comparing the recognition effects of other models, the experimental results show that the proposed musical score recognition model has a higher recognition accuracy and a stronger generalization performance. The improved generalization performance allows the musical score recognition method to be applied to more types of musical score recognition scenarios, and such a recognition model has more practical value.

摘要

由作曲家创作并由多维乐器演奏的音乐是一种反映现实生活情感的艺术形式。历史上，人们主要通过乐谱记录和口头传播来传播音乐。其中，以乐谱形式记录音乐是一项伟大的音乐发明。它成为音乐交流和传承的载体，也是人类壮丽音乐文化的记录。数字技术的出现解决了乐谱存储和分发困难的问题。然而，使用图像格式的数据有很多缺点，从图像数据中提取可编辑形式的乐谱信息是当前面临的挑战。本文提出了一种改进的乐谱识别卷积神经网络。由于传统的卷积神经网络 SEGNET 对一些像素进行了错误分类，因此本文采用了特征金字塔结构。使用附加的分支路径来融合浅层图像细节、有利于小物体的浅层纹理特征以及全局信息的高层特征，丰富模型的多尺度语义信息，并缓解模型中多尺度语义信息不足的问题。语义信息导致识别性能差。通过比较其他模型的识别效果，实验结果表明，所提出的乐谱识别模型具有更高的识别精度和更强的泛化性能。改进的泛化性能使乐谱识别方法能够应用于更多类型的乐谱识别场景，这种识别模型具有更多的实用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c441/9110142/f22196305f0d/CIN2022-6071114.001.jpg

相似文献

Improved Feature Pyramid Convolutional Neural Network for Effective Recognition of Music Scores.改进的特征金字塔卷积神经网络，用于有效识别乐谱。

Comput Intell Neurosci. 2022 May 9;2022:6071114. doi: 10.1155/2022/6071114. eCollection 2022.

Construction of Music Intelligent Creation Model Based on Convolutional Neural Network.基于卷积神经网络的音乐智能创作模型构建。

Comput Intell Neurosci. 2022 Jul 5;2022:2854066. doi: 10.1155/2022/2854066. eCollection 2022.

Construction and Application of a Piano Playing Pitch Recognition Model Based on Neural Network.基于神经网络的钢琴弹奏音高识别模型的构建与应用。

Comput Intell Neurosci. 2022 Sep 17;2022:8431982. doi: 10.1155/2022/8431982. eCollection 2022.

The Importance of Being Familiar: The Role of Semantic Knowledge in the Activation of Emotions and Factual Knowledge from Music in the Semantic Variant of Primary Progressive Aphasia.熟悉的重要性：语义知识在原发性进行性失语症语义变体中对音乐情绪和事实知识激活的作用。

J Alzheimers Dis. 2022;85(1):115-128. doi: 10.3233/JAD-215083.

Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索

IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

A Music Emotion Classification Model Based on the Improved Convolutional Neural Network.基于改进卷积神经网络的音乐情绪分类模型。

Comput Intell Neurosci. 2022 Feb 14;2022:6749622. doi: 10.1155/2022/6749622. eCollection 2022.

Design of Neural Network Model for Cross-Media Audio and Video Score Recognition Based on Convolutional Neural Network Model.基于卷积神经网络模型的跨媒体音视频评分识别神经网络模型设计。

Comput Intell Neurosci. 2022 Jun 13;2022:4626867. doi: 10.1155/2022/4626867. eCollection 2022.

Intelligent Classification Model of Music Emotional Environment Using Convolutional Neural Networks.基于卷积神经网络的音乐情感环境智能分类模型

J Environ Public Health. 2022 Aug 30;2022:7221064. doi: 10.1155/2022/7221064. eCollection 2022.

Embedding topological features into convolutional neural network salient object detection.将拓扑特征嵌入卷积神经网络显著目标检测中。

Neural Netw. 2020 Jan;121:308-318. doi: 10.1016/j.neunet.2019.09.009. Epub 2019 Sep 25.

A Multi-Modal Convolutional Neural Network Model for Intelligent Analysis of the Influence of Music Genres on Children's Emotions.一种用于智能分析音乐流派对儿童情绪影响的多模态卷积神经网络模型。

Comput Intell Neurosci. 2022 Jul 19;2022:4957085. doi: 10.1155/2022/4957085. eCollection 2022.

本文引用的文献

Two-Stream Attention Network for Pain Recognition from Video Sequences.基于双流注意力网络的视频序列疼痛识别

Sensors (Basel). 2020 Feb 4;20(3):839. doi: 10.3390/s20030839.

Breast ultrasound lesions recognition: end-to-end deep learning approaches.乳腺超声病变识别：端到端深度学习方法

J Med Imaging (Bellingham). 2019 Jan;6(1):011007. doi: 10.1117/1.JMI.6.1.011007. Epub 2018 Oct 10.

Hemispheric asymmetries in setticlavio reading.裂脑阅读的半球不对称性。

Neuropsychology. 2018 Mar;32(3):337-343. doi: 10.1037/neu0000430. Epub 2018 Feb 22.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

改进的特征金字塔卷积神经网络，用于有效识别乐谱。

Improved Feature Pyramid Convolutional Neural Network for Effective Recognition of Music Scores.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献