• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SunoCaps:一个基于文本提示的带有情感注释的人工智能生成音乐的新颖数据集。

SunoCaps: A novel dataset of text-prompt based AI-generated music with emotion annotations.

作者信息

Civit M, Drai-Zerbib V, Lizcano D, Escalona M J

机构信息

Department of Communication and Education, Universidad Loyola Andalucía. Av. de las Universidades s/n. 41704 Sevilla, Spain.

LEAD - CNRS UMR5022 Université Bourgogne Institut Marey - I3M, 64 rue de Sully, Dijon 21000, France.

出版信息

Data Brief. 2024 Jul 18;55:110743. doi: 10.1016/j.dib.2024.110743. eCollection 2024 Aug.

DOI:10.1016/j.dib.2024.110743
PMID:39161878
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11332804/
Abstract

The SunoCaps dataset aims to provide an innovative contribution to music data. Expert description of human-made musical pieces, from the widely used MusicCaps dataset, are used as prompts for generating complete songs for this dataset. This Automatic Music Generation is done with the state-of-the-art Suno generator of audio-based music. A subset of 64 pieces from MusicCaps is currently included, with a total of 256 generated entries. This total stems from generating four different variations for each human piece; two versions based on the original caption and two versions based on the original aspect description. As an AI-generated music dataset, SunoCaps also includes expert-based information on prompt alignment, with the main differences between prompt and final generation annotated. Furthermore, annotations describing the main discrete emotions induced by the piece. This dataset can have an array of implementations, such as creating and improving music generation validation tools, training systems for multi-layered architectures and the optimization of music emotion estimation systems.

摘要

SunoCaps数据集旨在为音乐数据做出创新性贡献。来自广泛使用的MusicCaps数据集中人工制作音乐作品的专家描述,被用作生成该数据集完整歌曲的提示。这种自动音乐生成是使用基于音频的音乐的最先进的Suno生成器完成的。目前包含来自MusicCaps的64首作品的一个子集,共有256个生成的条目。这个总数来自为每首人工作品生成四种不同的变体;两个基于原始标题的版本和两个基于原始方面描述的版本。作为一个由人工智能生成的音乐数据集,SunoCaps还包括基于专家的提示对齐信息,并标注了提示与最终生成之间的主要差异。此外,还有描述作品引发的主要离散情绪的注释。该数据集可以有一系列应用,比如创建和改进音乐生成验证工具、用于多层架构的训练系统以及优化音乐情感估计系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb8d/11332804/9c02261b7f1b/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb8d/11332804/9c02261b7f1b/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb8d/11332804/9c02261b7f1b/gr1.jpg

相似文献

1
SunoCaps: A novel dataset of text-prompt based AI-generated music with emotion annotations.SunoCaps:一个基于文本提示的带有情感注释的人工智能生成音乐的新颖数据集。
Data Brief. 2024 Jul 18;55:110743. doi: 10.1016/j.dib.2024.110743. eCollection 2024 Aug.
2
MERP: A Music Dataset with Emotion Ratings and Raters' Profile Information.MERP:一个带有情感评级和评级者信息的音乐数据集。
Sensors (Basel). 2022 Dec 29;23(1):382. doi: 10.3390/s23010382.
3
A dataset of text prompts, videos and video quality metrics from generative text-to-video AI models.一个来自生成式文本到视频人工智能模型的文本提示、视频及视频质量指标的数据集。
Data Brief. 2024 May 11;54:110514. doi: 10.1016/j.dib.2024.110514. eCollection 2024 Jun.
4
An artificial intelligence-based classifier for musical emotion expression in media education.一种用于媒体教育中音乐情感表达的基于人工智能的分类器。
PeerJ Comput Sci. 2023 Jul 14;9:e1472. doi: 10.7717/peerj-cs.1472. eCollection 2023.
5
Generating rhythm game music with jukebox.使用自动点唱机生成节奏游戏音乐。
Front Artif Intell. 2024 Jul 5;7:1296034. doi: 10.3389/frai.2024.1296034. eCollection 2024.
6
Long Short-Term Memory-Based Music Analysis System for Music Therapy.用于音乐治疗的基于长短期记忆的音乐分析系统
Front Psychol. 2022 Jun 14;13:928048. doi: 10.3389/fpsyg.2022.928048. eCollection 2022.
7
A Lightweight Deep Learning-Based Approach for Jazz Music Generation in MIDI Format.一种基于轻量级深度学习的 MIDI 格式爵士音乐生成方法。
Comput Intell Neurosci. 2022 Aug 5;2022:2140895. doi: 10.1155/2022/2140895. eCollection 2022.
8
EmotionBox: A music-element-driven emotional music generation system based on music psychology.情感盒子:一种基于音乐心理学的音乐元素驱动的情感音乐生成系统。
Front Psychol. 2022 Aug 29;13:841926. doi: 10.3389/fpsyg.2022.841926. eCollection 2022.
9
Creating musical features using multi-faceted, multi-task encoders based on transformers.基于转换器的多方面、多任务编码器创建音乐特征。
Sci Rep. 2023 Jul 3;13(1):10713. doi: 10.1038/s41598-023-36714-z.
10
Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.生成式人工智能生成高保真囊胚期胚胎图像。
Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.

引用本文的文献

1
FakeMusicCaps: A Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models.FakeMusicCaps:一个用于检测和归因通过文本到音乐模型生成的合成音乐的数据集。
J Imaging. 2025 Jul 18;11(7):242. doi: 10.3390/jimaging11070242.

本文引用的文献

1
Ultra-short term heart rate variability as a tool to assess changes in valence.超短期心率变异性可作为评估情绪变化的工具。
Psychiatry Res. 2018 Dec;270:517-522. doi: 10.1016/j.psychres.2018.10.005. Epub 2018 Oct 6.
2
On the Importance of Both Dimensional and Discrete Models of Emotion.论情绪的维度模型与离散模型的重要性。
Behav Sci (Basel). 2017 Sep 29;7(4):66. doi: 10.3390/bs7040066.
3
How much training data for facial action unit detection?面部动作单元检测需要多少训练数据?
IEEE Int Conf Autom Face Gesture Recognit Workshops. 2015 May;1. doi: 10.1109/FG.2015.7163106.
4
Measuring emotion: the Self-Assessment Manikin and the Semantic Differential.测量情绪:自评人偶法与语义差异法。
J Behav Ther Exp Psychiatry. 1994 Mar;25(1):49-59. doi: 10.1016/0005-7916(94)90063-9.