Suppr超能文献

持续乐器声音中的频谱包络位置和形状。

Spectral envelope position and shape in sustained musical instrument sounds.

机构信息

Department of Medical Physics and Acoustics, Carl von Ossietzky University of Oldenburg, 26129 Oldenburg, Germany.

Department of Musicology, University of Vienna, 1090 Vienna, Austria.

出版信息

J Acoust Soc Am. 2021 Jun;149(6):3715. doi: 10.1121/10.0005088.

Abstract

It has been argued that the relative position of spectral envelopes along the frequency axis serves as a cue for musical instrument size (e.g., violin vs viola) and that the shape of the spectral envelope encodes family identity (violin vs flute). It is further known that fundamental frequency (F0), F0-register for specific instruments, and dynamic level strongly affect spectral properties of acoustical instrument sounds. However, the associations between these factors have not been rigorously quantified for a representative set of musical instruments. Here, we analyzed 5640 sounds from 50 sustained orchestral instruments sampled across their entire range of F0s at three dynamic levels. Regression of spectral centroid (SC) values that index envelope position indicated that smaller instruments possessed higher SC values for a majority of instrument classes (families), but SC also correlated with F0 and was strongly and consistently affected by the dynamic level. Instrument classification using relatively low-dimensional cepstral audio descriptors allowed for discrimination between instrument classes with accuracies beyond 80%. Envelope shape became much less indicative of instrument class whenever the classification problem involved generalization to different dynamic levels or F0-registers. These analyses confirm that spectral envelopes encode information about instrument size and family identity and highlight their dependence on F0(-register) and dynamic level.

摘要

有人认为,频谱包络在频率轴上的相对位置可以作为乐器大小(例如小提琴与中提琴)的线索,而频谱包络的形状则编码了乐器家族的身份(小提琴与长笛)。此外,基频(F0)、特定乐器的 F0 音域以及动态水平强烈影响乐器声音的频谱特性。然而,这些因素之间的关联尚未针对一组具有代表性的乐器进行严格量化。在这里,我们分析了 50 种持续管弦乐器的 5640 个声音,这些声音在三个动态水平上跨越了它们的整个 F0 范围进行采样。对频谱质心(SC)值的回归分析表明,对于大多数乐器类别(家族),较小的乐器具有更高的 SC 值,但 SC 也与 F0 相关,并且强烈且一致地受到动态水平的影响。使用相对低维的倒谱音频描述符进行乐器分类,可以实现超过 80%的乐器类别的准确区分。无论分类问题涉及到不同的动态水平或 F0 音域的泛化,包络形状都变得不那么能指示乐器类别。这些分析证实,频谱包络编码了有关乐器大小和家族身份的信息,并突出了它们对 F0(-音域)和动态水平的依赖。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验