基于边缘计算和长短时记忆递归神经网络的音乐演奏智能辅助系统。

Intelligent auxiliary system for music performance under edge computing and long short-term recurrent neural networks.

机构信息

KU School of Music, Lawrence, Kansas, United States of America.

出版信息

PLoS One. 2023 May 8;18(5):e0285496. doi: 10.1371/journal.pone.0285496. eCollection 2023.

DOI:10.1371/journal.pone.0285496

PMID:37155635

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10166492/

Abstract

Music performance action generation can be applied in multiple real-world scenarios as a research hotspot in computer vision and cross-sequence analysis. However, the current generation methods of music performance actions have consistently ignored the connection between music and performance actions, resulting in a strong sense of separation between visual and auditory content. This paper first analyzes the attention mechanism, Recurrent Neural Network (RNN), and long and short-term RNN. The long and short-term RNN is suitable for sequence data with a strong temporal correlation. Based on this, the current learning method is improved. A new model that combines attention mechanisms and long and short-term RNN is proposed, which can generate performance actions based on music beat sequences. In addition, image description generative models with attention mechanisms are adopted technically. Combined with the RNN abstract structure that does not consider recursion, the abstract network structure of RNN-Long Short-Term Memory (LSTM) is optimized. Through music beat recognition and dance movement extraction technology, data resources are allocated and adjusted in the edge server architecture. The metric for experimental results and evaluation is the model loss function value. The superiority of the proposed model is mainly reflected in the high accuracy and low consumption rate of dance movement recognition. The experimental results show that the result of the loss function of the model is at least 0.00026, and the video effect is the best when the number of layers of the LSTM module in the model is 3, the node value is 256, and the Lookback value is 15. The new model can generate harmonious and prosperous performance action sequences based on ensuring the stability of performance action generation compared with the other three models of cross-domain sequence analysis. The new model has an excellent performance in combining music and performance actions. This paper has practical reference value for promoting the application of edge computing technology in intelligent auxiliary systems for music performance.

摘要

音乐表演动作生成可应用于多个现实场景，是计算机视觉和跨序列分析领域的研究热点。然而，目前的音乐表演动作生成方法一直忽略音乐与表演动作之间的联系，导致视觉与听觉内容之间存在强烈的分离感。本文首先分析了注意力机制、递归神经网络（RNN）和长短时记忆 RNN，长短时记忆 RNN 适用于具有强时间相关性的序列数据。在此基础上，改进了当前的学习方法，提出了一种新的注意力机制和长短时记忆 RNN 相结合的模型，可基于音乐节拍序列生成表演动作。此外，还采用了带有注意力机制的图像描述生成模型，结合不考虑递归的 RNN 抽象结构，对 RNN 长短时记忆（LSTM）的抽象网络结构进行优化。通过音乐节拍识别和舞蹈动作提取技术，在边缘服务器架构中分配和调整数据资源。实验结果和评价的度量标准是模型损失函数值。所提出模型的优势主要体现在舞蹈动作识别的高精度和低消耗率上。实验结果表明，模型损失函数的结果至少为 0.00026，当模型中 LSTM 模块的层数为 3、节点值为 256、Lookback 值为 15 时，视频效果最佳。与跨域序列分析的其他三个模型相比，新模型在保证表演动作生成稳定性的同时，能够生成和谐繁荣的表演动作序列。新模型在音乐与表演动作的结合方面表现出色。本文对促进边缘计算技术在音乐表演智能辅助系统中的应用具有实际参考价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd21/10166492/7c36d0e7ff8b/pone.0285496.g001.jpg

相似文献

Intelligent auxiliary system for music performance under edge computing and long short-term recurrent neural networks.

PLoS One. 2023 May 8;18(5):e0285496. doi: 10.1371/journal.pone.0285496. eCollection 2023.

A Study of Two-Way Short- and Long-Term Memory Network Intelligent Computing IoT Model-Assisted Home Education Attention Mechanism.

Comput Intell Neurosci. 2021 Dec 21;2021:3587884. doi: 10.1155/2021/3587884. eCollection 2021.

The Collection and Recognition Method of Music and Dance Movement Based on Intelligent Sensor.

Comput Intell Neurosci. 2022 Jun 3;2022:2654892. doi: 10.1155/2022/2654892. eCollection 2022.

Recognition of musical beat and style and applications in interactive humanoid robot.

Front Neurorobot. 2022 Aug 4;16:875058. doi: 10.3389/fnbot.2022.875058. eCollection 2022.

Music Composition and Emotion Recognition Using Big Data Technology and Neural Network Algorithm.

Comput Intell Neurosci. 2021 Dec 16;2021:5398922. doi: 10.1155/2021/5398922. eCollection 2021.

Pose Estimation-Assisted Dance Tracking System Based on Convolutional Neural Network.

Comput Intell Neurosci. 2022 Jun 3;2022:2301395. doi: 10.1155/2022/2301395. eCollection 2022.

Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism.

Comput Intell Neurosci. 2021 Oct 14;2021:7088837. doi: 10.1155/2021/7088837. eCollection 2021.

CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise.

IEEE J Transl Eng Health Med. 2023 Jun 2;11:351-359. doi: 10.1109/JTEHM.2023.3282245. eCollection 2023.

A Music Emotion Classification Model Based on the Improved Convolutional Neural Network.

Comput Intell Neurosci. 2022 Feb 14;2022:6749622. doi: 10.1155/2022/6749622. eCollection 2022.

Emotion Recognition of Violin Playing Based on Big Data Analysis Technologies.

J Environ Public Health. 2022 Sep 15;2022:8583924. doi: 10.1155/2022/8583924. eCollection 2022.

引用本文的文献

The data mining and high-performance network model of tourism electronic word of mouth for analysis of factors influencing tourists' purchasing behavior.

Sci Rep. 2024 Dec 4;14(1):30237. doi: 10.1038/s41598-024-75794-3.

Volleyball training video classification description using the BiLSTM fusion attention mechanism.

Heliyon. 2024 Jul 16;10(15):e34735. doi: 10.1016/j.heliyon.2024.e34735. eCollection 2024 Aug 15.

本文引用的文献

Fine-Tuned DenseNet-169 for Breast Cancer Metastasis Prediction Using FastAI and 1-Cycle Policy.

Sensors (Basel). 2022 Apr 13;22(8):2988. doi: 10.3390/s22082988.

Classification of Skin Disease Using Deep Learning Neural Networks with MobileNet V2 and LSTM.

Sensors (Basel). 2021 Apr 18;21(8):2852. doi: 10.3390/s21082852.

Adaptive Global Sliding-Mode Control for Dynamic Systems Using Double Hidden Layer Recurrent Neural Network Structure.

IEEE Trans Neural Netw Learn Syst. 2020 Apr;31(4):1297-1309. doi: 10.1109/TNNLS.2019.2919676. Epub 2019 Jun 24.

A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures.

Neural Comput. 2019 Jul;31(7):1235-1270. doi: 10.1162/neco_a_01199. Epub 2019 May 21.

Recurrent Neural Network for Predicting Transcription Factor Binding Sites.

Sci Rep. 2018 Oct 15;8(1):15270. doi: 10.1038/s41598-018-33321-1.

A Deep Spatial Contextual Long-Term Recurrent Convolutional Network for Saliency Detection.

IEEE Trans Image Process. 2018 Jul;27(7):3264-3274. doi: 10.1109/TIP.2018.2817047.

Recurrent Neural Network Model for Constructive Peptide Design.

J Chem Inf Model. 2018 Feb 26;58(2):472-479. doi: 10.1021/acs.jcim.7b00414. Epub 2018 Jan 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于边缘计算和长短时记忆递归神经网络的音乐演奏智能辅助系统。

Intelligent auxiliary system for music performance under edge computing and long short-term recurrent neural networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献