Konduri Samhita, Pendyala Kriti V, Pendyala Vishnu S
Palo Alto High School, 50 Embarcadero Road, Palo Alto, CA 94301, USA.
University Preparatory Academy, 2315 Canoas Garden Ave, San Jose, CA 95125, USA.
Data Brief. 2024 Jul 9;55:110730. doi: 10.1016/j.dib.2024.110730. eCollection 2024 Aug.
There are currently a limited number of Indian classical music datasets, especially those large enough and with useful annotations, particularly the subtler ones, such as the tonic, for training classification or prediction models. The dataset described in this paper is created with useful tonic annotations, to fill this gap. The tonic pitch, or base pitch, plays an important role in music, so much so that it is sometimes called the keynote. The vocalists and the accompanying instrumental ensemble are fine-tuned to this keynote to render the composition. The first and second authors of this paper, who are vocalists themselves, recorded songs in four different tonics: F#, G, G#, and A. Using the Python library pydub, each 3+ minute song was segmented into 20-second snippets, including the remainder as a separate snippet. The raw audio snippet data is available in folders separated by tonic, and a directory contains each snippet's file path and tonic. This dataset can be reused for tonic classification work in the future, as well as for training other automated systems targeting higher-level attributes of ICM, such as melodic framework, as a tonic can be the basis for them all.
目前,印度古典音乐数据集数量有限,尤其是那些规模足够大且带有有用注释的数据集,特别是那些更细微的注释,如主音,用于训练分类或预测模型。本文描述的数据集带有有用的主音注释,以填补这一空白。主音音高,即基础音高,在音乐中起着重要作用,以至于有时被称为主调。歌手和伴奏乐器组会根据这个主调进行微调以呈现乐曲。本文的第一作者和第二作者本身就是歌手,他们录制了四种不同主音的歌曲:升F、G、升G和A。使用Python库pydub,每首3分钟多的歌曲被分割成20秒的片段,包括剩余部分作为一个单独的片段。原始音频片段数据按主音保存在不同的文件夹中,一个目录包含每个片段的文件路径和主音。该数据集未来可用于主音分类工作,也可用于训练其他针对印度古典音乐更高级属性(如旋律框架)的自动化系统,因为主音可以是所有这些属性的基础。