云计算框架下用于音乐即兴创作的深度梯度强化学习

Deep gradient reinforcement learning for music improvisation in cloud computing framework.

作者信息

Alrowais Fadwa, Arasi Munya A, Alotaibi Saud S, Alonazi Mohammed, Marzouk Radwa, Salama Ahmed S

机构信息

Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Department of Computer Science, Applied College, King Khalid University, RijalAlmaa, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2025 Jan 24;11:e2265. doi: 10.7717/peerj-cs.2265. eCollection 2025.

DOI:10.7717/peerj-cs.2265

PMID:39896013

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11784531/

Abstract

Artificial intelligence (AI) in music improvisation offers promising new avenues for developing human creativity. The difficulty of writing dynamic, flexible musical compositions in real time is discussed in this article. We explore using reinforcement learning (RL) techniques to create more interactive and responsive music creation systems. Here, the musical structures train an RL agent to navigate the complex space of musical possibilities to provide improvisations. The melodic framework in the input musical data is initially identified using bi-directional gated recurrent units. The lyrical concepts such as notes, chords, and rhythms from the recognised framework are transformed into a format suitable for RL input. The deep gradient-based reinforcement learning technique used in this research formulates a reward system that directs the agent to compose aesthetically intriguing and harmonically cohesive musical improvisations. The improvised music is further rendered in the MIDI format. The Bach Chorales dataset with six different attributes relevant to musical compositions is employed in implementing the present research. The model was set up in a containerised cloud environment and controlled for smooth load distribution. Five different parameters, such as pitch frequency (PF), standard pitch delay (SPD), average distance between peaks (ADP), note duration gradient (NDG) and pitch class gradient (PCG), are leveraged to assess the quality of the improvised music. The proposed model obtains +0.15 of PF, -0.43 of SPD, -0.07 of ADP and 0.0041 NDG, which is a better value than other improvisation methods.

摘要

音乐即兴创作中的人工智能为开发人类创造力提供了充满希望的新途径。本文讨论了实时创作动态、灵活音乐作品的难度。我们探索使用强化学习（RL）技术来创建更具交互性和响应性的音乐创作系统。在这里，音乐结构训练一个强化学习智能体在复杂的音乐可能性空间中导航，以提供即兴创作。输入音乐数据中的旋律框架最初使用双向门控循环单元进行识别。来自已识别框架的音符、和弦和节奏等抒情概念被转换为适合强化学习输入的格式。本研究中使用的基于深度梯度的强化学习技术制定了一个奖励系统，引导智能体创作具有美学吸引力且和声连贯的音乐即兴作品。即兴创作的音乐进一步以MIDI格式呈现。在实施本研究时采用了具有与音乐作品相关的六个不同属性的巴赫众赞歌数据集。该模型在容器化云环境中建立，并进行控制以实现平稳的负载分布。利用音高频率（PF）、标准音高延迟（SPD）、峰值之间的平均距离（ADP）、音符持续时间梯度（NDG）和音级梯度（PCG）这五个不同参数来评估即兴创作音乐的质量。所提出的模型获得了PF为 +0.15、SPD为 -0.43、ADP为 -0.07 和 NDG为0.0041，这比其他即兴创作方法的值更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b521/11784531/01e6bbe26604/peerj-cs-11-2265-g001.jpg

相似文献

Deep gradient reinforcement learning for music improvisation in cloud computing framework.

PeerJ Comput Sci. 2025 Jan 24;11:e2265. doi: 10.7717/peerj-cs.2265. eCollection 2025.

The taste of music.

Perception. 2011;40(2):209-19. doi: 10.1068/p6801.

Classical creativity: A functional magnetic resonance imaging (fMRI) investigation of pianist and improviser Gabriela Montero.

Neuroimage. 2020 Apr 1;209:116496. doi: 10.1016/j.neuroimage.2019.116496. Epub 2019 Dec 30.

Rapid and flexible creativity in musical improvisation: review and a model.

Ann N Y Acad Sci. 2018 Mar 25. doi: 10.1111/nyas.13628.

Temporal dynamics of uncertainty and prediction error in musical improvisation across different periods.

Sci Rep. 2024 Sep 27;14(1):22297. doi: 10.1038/s41598-024-73689-x.

Representing melodic relationships using network science.

Cognition. 2023 Apr;233:105362. doi: 10.1016/j.cognition.2022.105362. Epub 2023 Jan 9.

Intelligence and musical improvisation.

Psychol Med. 1989 May;19(2):447-57. doi: 10.1017/s0033291700012484.

Creativity as a distinct trainable mental state: An EEG study of musical improvisation.

Neuropsychologia. 2017 May;99:246-258. doi: 10.1016/j.neuropsychologia.2017.03.020. Epub 2017 Mar 18.

Towards a standard model of musical improvisation.

Eur J Neurosci. 2020 Feb;51(3):840-849. doi: 10.1111/ejn.14567. Epub 2019 Sep 17.

Gray Matter Correlates of Creativity in Musical Improvisation.

Front Hum Neurosci. 2019 May 22;13:169. doi: 10.3389/fnhum.2019.00169. eCollection 2019.

本文引用的文献

Understanding Private Car Aggregation Effect via Spatio-Temporal Analysis of Trajectory Data.

IEEE Trans Cybern. 2023 Apr;53(4):2346-2357. doi: 10.1109/TCYB.2021.3117705. Epub 2023 Mar 16.

On the Adaptability of Recurrent Neural Networks for Real-Time Jazz Improvisation Accompaniment.

Front Artif Intell. 2021 Feb 12;3:508727. doi: 10.3389/frai.2020.508727. eCollection 2020.

The Tapio Decoupling Principle and Key Strategies for Changing Factors of Chinese Urban Carbon Footprint Based on Cloud Computing.

Int J Environ Res Public Health. 2021 Feb 21;18(4):2101. doi: 10.3390/ijerph18042101.

The effectiveness of Sufi music for mental health outcomes. A systematic review and meta-analysis of 21 randomised trials.

Complement Ther Med. 2021 Mar;57:102664. doi: 10.1016/j.ctim.2021.102664. Epub 2021 Jan 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

云计算框架下用于音乐即兴创作的深度梯度强化学习

Deep gradient reinforcement learning for music improvisation in cloud computing framework.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献