Suppr超能文献

声音纹理感知中的级联幅度调制

Cascaded Amplitude Modulations in Sound Texture Perception.

作者信息

McWalter Richard, Dau Torsten

机构信息

Hearing Systems Group, Technical University of DenmarkKongens Lyngby, Denmark.

出版信息

Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.

Abstract

Sound textures, such as crackling fire or chirping crickets, represent a broad class of sounds defined by their homogeneous temporal structure. It has been suggested that the perception of texture is mediated by time-averaged summary statistics measured from early auditory representations. In this study, we investigated the perception of sound textures that contain rhythmic structure, specifically second-order amplitude modulations that arise from the interaction of different modulation rates, previously described as "beating" in the envelope-frequency domain. We developed an auditory texture model that utilizes a cascade of modulation filterbanks that capture the structure of simple rhythmic patterns. The model was examined in a series of psychophysical listening experiments using synthetic sound textures-stimuli generated using time-averaged statistics measured from real-world textures. In a texture identification task, our results indicated that second-order amplitude modulation sensitivity enhanced recognition. Next, we examined the contribution of the second-order modulation analysis in a preference task, where the proposed auditory texture model was preferred over a range of model deviants that lacked second-order modulation rate sensitivity. Lastly, the discriminability of textures that included second-order amplitude modulations appeared to be perceived using a time-averaging process. Overall, our results demonstrate that the inclusion of second-order modulation analysis generates improvements in the perceived quality of synthetic textures compared to the first-order modulation analysis considered in previous approaches.

摘要

声音纹理,如噼里啪啦的火焰声或唧唧叫的蟋蟀声,代表了一类由其均匀的时间结构定义的广泛声音。有人提出,纹理的感知是由从早期听觉表征中测量的时间平均汇总统计量介导的。在本研究中,我们调查了包含节奏结构的声音纹理的感知,具体来说是由不同调制率的相互作用产生的二阶幅度调制,在包络频率域中先前被描述为“拍频”。我们开发了一种听觉纹理模型,该模型利用一系列调制滤波器组来捕捉简单节奏模式的结构。该模型在一系列心理物理学听力实验中进行了检验,使用的是合成声音纹理——由从真实世界纹理中测量的时间平均统计量生成的刺激。在纹理识别任务中,我们的结果表明二阶幅度调制敏感性增强了识别效果。接下来,我们在一个偏好任务中检验了二阶调制分析的贡献,在该任务中,与一系列缺乏二阶调制率敏感性的模型变体相比,所提出的听觉纹理模型更受青睐。最后,包含二阶幅度调制的纹理的可辨别性似乎是通过时间平均过程来感知的。总体而言,我们的结果表明,与先前方法中考虑的一阶调制分析相比,纳入二阶调制分析可提高合成纹理的感知质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/4aae82ee9ea5/fnins-11-00485-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验