Suppr超能文献

改进的特征金字塔卷积神经网络,用于有效识别乐谱。

Improved Feature Pyramid Convolutional Neural Network for Effective Recognition of Music Scores.

机构信息

College of Music, Handan University, Handan 056005, Hebei Province, China.

出版信息

Comput Intell Neurosci. 2022 May 9;2022:6071114. doi: 10.1155/2022/6071114. eCollection 2022.

Abstract

Music written by composers and performed by multidimensional instruments is an art form that reflects real-life emotions. Historically, people disseminated music primarily through sheet music recording and oral transmission. Among them, recording music in sheet music form was a great musical invention. It became the carrier of music communication and inheritance, as well as a record of humanity's magnificent music culture. The advent of digital technology solves the problem of difficult musical score storage and distribution. However, there are many drawbacks to using data in image format, and extracting music score information in editable form from image data is currently a challenge. An improved convolutional neural network for musical score recognition is proposed in this paper. Because the traditional convolutional neural network SEGNET misclassifies some pixels, this paper employs the feature pyramid structure. Use additional branch paths to fuse shallow image details, shallow texture features that are beneficial to small objects, and high-level features of global information, enrich the multi-scale semantic information of the model, and alleviate the problem of the lack of multiscale semantic information in the model. Poor recognition performance is caused by semantic information. By comparing the recognition effects of other models, the experimental results show that the proposed musical score recognition model has a higher recognition accuracy and a stronger generalization performance. The improved generalization performance allows the musical score recognition method to be applied to more types of musical score recognition scenarios, and such a recognition model has more practical value.

摘要

由作曲家创作并由多维乐器演奏的音乐是一种反映现实生活情感的艺术形式。历史上,人们主要通过乐谱记录和口头传播来传播音乐。其中,以乐谱形式记录音乐是一项伟大的音乐发明。它成为音乐交流和传承的载体,也是人类壮丽音乐文化的记录。数字技术的出现解决了乐谱存储和分发困难的问题。然而,使用图像格式的数据有很多缺点,从图像数据中提取可编辑形式的乐谱信息是当前面临的挑战。本文提出了一种改进的乐谱识别卷积神经网络。由于传统的卷积神经网络 SEGNET 对一些像素进行了错误分类,因此本文采用了特征金字塔结构。使用附加的分支路径来融合浅层图像细节、有利于小物体的浅层纹理特征以及全局信息的高层特征,丰富模型的多尺度语义信息,并缓解模型中多尺度语义信息不足的问题。语义信息导致识别性能差。通过比较其他模型的识别效果,实验结果表明,所提出的乐谱识别模型具有更高的识别精度和更强的泛化性能。改进的泛化性能使乐谱识别方法能够应用于更多类型的乐谱识别场景,这种识别模型具有更多的实用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c441/9110142/f22196305f0d/CIN2022-6071114.001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验