• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

《克里蒂萨米塔》:一个具有主音分类的南印度古典音乐音频片段的机器学习数据集。

KritiSamhita: A machine learning dataset of South Indian classical music audio clips with tonic classification.

作者信息

Konduri Samhita, Pendyala Kriti V, Pendyala Vishnu S

机构信息

Palo Alto High School, 50 Embarcadero Road, Palo Alto, CA 94301, USA.

University Preparatory Academy, 2315 Canoas Garden Ave, San Jose, CA 95125, USA.

出版信息

Data Brief. 2024 Jul 9;55:110730. doi: 10.1016/j.dib.2024.110730. eCollection 2024 Aug.

DOI:10.1016/j.dib.2024.110730
PMID:39081494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11286976/
Abstract

There are currently a limited number of Indian classical music datasets, especially those large enough and with useful annotations, particularly the subtler ones, such as the tonic, for training classification or prediction models. The dataset described in this paper is created with useful tonic annotations, to fill this gap. The tonic pitch, or base pitch, plays an important role in music, so much so that it is sometimes called the keynote. The vocalists and the accompanying instrumental ensemble are fine-tuned to this keynote to render the composition. The first and second authors of this paper, who are vocalists themselves, recorded songs in four different tonics: F#, G, G#, and A. Using the Python library pydub, each 3+ minute song was segmented into 20-second snippets, including the remainder as a separate snippet. The raw audio snippet data is available in folders separated by tonic, and a directory contains each snippet's file path and tonic. This dataset can be reused for tonic classification work in the future, as well as for training other automated systems targeting higher-level attributes of ICM, such as melodic framework, as a tonic can be the basis for them all.

摘要

目前,印度古典音乐数据集数量有限,尤其是那些规模足够大且带有有用注释的数据集,特别是那些更细微的注释,如主音,用于训练分类或预测模型。本文描述的数据集带有有用的主音注释,以填补这一空白。主音音高,即基础音高,在音乐中起着重要作用,以至于有时被称为主调。歌手和伴奏乐器组会根据这个主调进行微调以呈现乐曲。本文的第一作者和第二作者本身就是歌手,他们录制了四种不同主音的歌曲:升F、G、升G和A。使用Python库pydub,每首3分钟多的歌曲被分割成20秒的片段,包括剩余部分作为一个单独的片段。原始音频片段数据按主音保存在不同的文件夹中,一个目录包含每个片段的文件路径和主音。该数据集未来可用于主音分类工作,也可用于训练其他针对印度古典音乐更高级属性(如旋律框架)的自动化系统,因为主音可以是所有这些属性的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0aa4/11286976/c5b07f6eadf5/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0aa4/11286976/c5b07f6eadf5/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0aa4/11286976/c5b07f6eadf5/gr1.jpg

相似文献

1
KritiSamhita: A machine learning dataset of South Indian classical music audio clips with tonic classification.《克里蒂萨米塔》:一个具有主音分类的南印度古典音乐音频片段的机器学习数据集。
Data Brief. 2024 Jul 9;55:110730. doi: 10.1016/j.dib.2024.110730. eCollection 2024 Aug.
2
Notation of Javanese dataset for traditional music applications.用于传统音乐应用的爪哇语数据集的标注
Data Brief. 2024 Feb 6;53:110116. doi: 10.1016/j.dib.2024.110116. eCollection 2024 Apr.
3
A dataset for multimodal music information retrieval of Sotho-Tswana musical videos.一个用于索托-茨瓦纳音乐视频多模态音乐信息检索的数据集。
Data Brief. 2024 Jun 26;55:110672. doi: 10.1016/j.dib.2024.110672. eCollection 2024 Aug.
4
Design of Semiautomatic Digital Creation System for Electronic Music Based on Recurrent Neural Network.基于循环神经网络的电子音乐半自动数字创作系统设计。
Comput Intell Neurosci. 2022 Jun 27;2022:5457376. doi: 10.1155/2022/5457376. eCollection 2022.
5
Kiñit classification in Ethiopian chants, Azmaris and modern music: A new dataset and CNN benchmark.基尼特分类在埃塞俄比亚圣歌、阿兹玛里斯和现代音乐中的应用:一个新数据集和 CNN 基准。
PLoS One. 2023 Apr 20;18(4):e0284560. doi: 10.1371/journal.pone.0284560. eCollection 2023.
6
A compact pitch and time representation for melodic contours in Indian art music.一种紧凑的音高和时间表示法,用于印度艺术音乐中的旋律轮廓。
J Acoust Soc Am. 2019 Jan;145(1):597. doi: 10.1121/1.5087277.
7
Tonal hierarchies in the music of north India.印度北部音乐中的音调层次结构。
J Exp Psychol Gen. 1984 Sep;113(3):394-412. doi: 10.1037//0096-3445.113.3.394.
8
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
9
METER2800: A novel dataset for music time signature detection.METER2800:一个用于音乐节拍检测的新型数据集。
Data Brief. 2023 Oct 26;51:109736. doi: 10.1016/j.dib.2023.109736. eCollection 2023 Dec.
10
A hierarchical approach for speech-instrumental-song classification.一种用于语音-器乐-歌曲分类的分层方法。
Springerplus. 2013 Oct 17;2(1):526. doi: 10.1186/2193-1801-2-526. eCollection 2013.