Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA.
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
J R Soc Interface. 2024 Mar;21(212):20230647. doi: 10.1098/rsif.2023.0647. Epub 2024 Mar 20.
Cultural processes of change bear many resemblances to biological evolution. The underlying units of non-biological evolution have, however, remained elusive, especially in the domain of music. Here, we introduce a general framework to jointly identify underlying units and their associated evolutionary processes. We model musical styles and principles of organization in dimensions such as harmony and form as following an evolutionary process. Furthermore, we propose that such processes can be identified by extracting latent evolutionary signatures from musical corpora, analogously to identifying mutational signatures in genomics. These signatures provide a latent embedding for each song or musical piece. We develop a deep generative architecture for our model, which can be viewed as a type of variational autoencoder with an evolutionary prior constraining the latent space; specifically, the embeddings for each song are tied together via an energy-based prior, which encourages songs close in evolutionary space to share similar representations. As illustration, we analyse songs from the McGill Billboard dataset. We find frequent chord transitions and formal repetition schemes and identify latent evolutionary signatures related to these features. Finally, we show that the latent evolutionary representations learned by our model outperform non-evolutionary representations in such tasks as period and genre prediction.
文化变迁过程与生物进化有许多相似之处。然而,非生物进化的基本单位仍然难以捉摸,尤其是在音乐领域。在这里,我们引入了一个通用框架,以共同识别潜在的单位及其相关的进化过程。我们将音乐风格和组织原则建模为类似进化过程的和声和形式等维度。此外,我们提出可以通过从音乐语料库中提取潜在的进化特征来识别此类过程,类似于在基因组学中识别突变特征。这些特征为每首歌曲或音乐作品提供了潜在的嵌入。我们为我们的模型开发了一种深度生成架构,它可以被视为一种具有进化先验约束潜在空间的变分自动编码器;具体来说,通过基于能量的先验,将每首歌曲的嵌入联系在一起,鼓励进化空间中接近的歌曲具有相似的表示。作为说明,我们分析了 McGill Billboard 数据集的歌曲。我们发现了频繁的和弦转换和形式重复模式,并确定了与这些特征相关的潜在进化特征。最后,我们表明,我们的模型学习的潜在进化表示在诸如周期和类型预测等任务中优于非进化表示。