声音纹理感知中的级联幅度调制

Cascaded Amplitude Modulations in Sound Texture Perception.

作者信息

McWalter Richard, Dau Torsten

机构信息

Hearing Systems Group, Technical University of DenmarkKongens Lyngby, Denmark.

出版信息

Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.

DOI:10.3389/fnins.2017.00485

PMID:28955191

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5601004/

Abstract

Sound textures, such as crackling fire or chirping crickets, represent a broad class of sounds defined by their homogeneous temporal structure. It has been suggested that the perception of texture is mediated by time-averaged summary statistics measured from early auditory representations. In this study, we investigated the perception of sound textures that contain rhythmic structure, specifically second-order amplitude modulations that arise from the interaction of different modulation rates, previously described as "beating" in the envelope-frequency domain. We developed an auditory texture model that utilizes a cascade of modulation filterbanks that capture the structure of simple rhythmic patterns. The model was examined in a series of psychophysical listening experiments using synthetic sound textures-stimuli generated using time-averaged statistics measured from real-world textures. In a texture identification task, our results indicated that second-order amplitude modulation sensitivity enhanced recognition. Next, we examined the contribution of the second-order modulation analysis in a preference task, where the proposed auditory texture model was preferred over a range of model deviants that lacked second-order modulation rate sensitivity. Lastly, the discriminability of textures that included second-order amplitude modulations appeared to be perceived using a time-averaging process. Overall, our results demonstrate that the inclusion of second-order modulation analysis generates improvements in the perceived quality of synthetic textures compared to the first-order modulation analysis considered in previous approaches.

摘要

声音纹理，如噼里啪啦的火焰声或唧唧叫的蟋蟀声，代表了一类由其均匀的时间结构定义的广泛声音。有人提出，纹理的感知是由从早期听觉表征中测量的时间平均汇总统计量介导的。在本研究中，我们调查了包含节奏结构的声音纹理的感知，具体来说是由不同调制率的相互作用产生的二阶幅度调制，在包络频率域中先前被描述为“拍频”。我们开发了一种听觉纹理模型，该模型利用一系列调制滤波器组来捕捉简单节奏模式的结构。该模型在一系列心理物理学听力实验中进行了检验，使用的是合成声音纹理——由从真实世界纹理中测量的时间平均统计量生成的刺激。在纹理识别任务中，我们的结果表明二阶幅度调制敏感性增强了识别效果。接下来，我们在一个偏好任务中检验了二阶调制分析的贡献，在该任务中，与一系列缺乏二阶调制率敏感性的模型变体相比，所提出的听觉纹理模型更受青睐。最后，包含二阶幅度调制的纹理的可辨别性似乎是通过时间平均过程来感知的。总体而言，我们的结果表明，与先前方法中考虑的一阶调制分析相比，纳入二阶调制分析可提高合成纹理的感知质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/4aae82ee9ea5/fnins-11-00485-g0001.jpg

相似文献

Cascaded Amplitude Modulations in Sound Texture Perception.

Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.

A two-stage spectral model for sound texture perception: Synthesis and psychophysics.

Iperception. 2023 Feb 22;14(1):20416695231157349. doi: 10.1177/20416695231157349. eCollection 2023 Jan-Feb.

Identification and Discrimination of Sound Textures in Hearing-Impaired and Older Listeners.

Trends Hear. 2021 Jan-Dec;25:23312165211065608. doi: 10.1177/23312165211065608.

Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.

Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.

Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.

Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.

Processing of fast amplitude modulations in bat auditory cortex matches communication call-specific sound features.

J Neurophysiol. 2019 Apr 1;121(4):1501-1512. doi: 10.1152/jn.00748.2018. Epub 2019 Feb 20.

Sensitivity of neural responses in the inferior colliculus to statistical features of sound textures.

Hear Res. 2021 Dec;412:108357. doi: 10.1016/j.heares.2021.108357. Epub 2021 Oct 14.

Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds.

Exp Brain Res. 1996 Mar;108(2):273-84. doi: 10.1007/BF00228100.

Specificity of brain reactions to second-order visual stimuli.

Vis Neurosci. 2015 Jan;32:E011. doi: 10.1017/S0952523815000085.

Exploring the distribution of statistical feature parameters for natural sound textures.

PLoS One. 2021 Jun 23;16(6):e0238960. doi: 10.1371/journal.pone.0238960. eCollection 2021.

引用本文的文献

Developmental origins of natural sound perception.

Front Psychol. 2024 Dec 11;15:1474961. doi: 10.3389/fpsyg.2024.1474961. eCollection 2024.

Predicting tingling sensations induced by autonomous sensory meridian response (ASMR) videos based on sound texture statistics: a comparison to pleasant feelings.

Philos Trans R Soc Lond B Biol Sci. 2024 Aug 26;379(1908):20230254. doi: 10.1098/rstb.2023.0254. Epub 2024 Jul 15.

Human Auditory Ecology: Extending Hearing Research to the Perception of Natural Soundscapes by Humans in Rapidly Changing Environments.

Trends Hear. 2023 Jan-Dec;27:23312165231212032. doi: 10.1177/23312165231212032.

Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.

Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.

Illusory sound texture reveals multi-second statistical completion in auditory scene analysis.

Nat Commun. 2019 Nov 8;10(1):5096. doi: 10.1038/s41467-019-12893-0.

Cascaded Tuning to Amplitude Modulation for Natural Sound Recognition.

J Neurosci. 2019 Jul 10;39(28):5517-5533. doi: 10.1523/JNEUROSCI.2914-18.2019. Epub 2019 May 15.

本文引用的文献

Learning Midlevel Auditory Codes from Natural Sound Statistics.

Neural Comput. 2018 Mar;30(3):631-669. doi: 10.1162/neco_a_01048. Epub 2017 Dec 8.

Representation of Maximally Regular Textures in Human Visual Cortex.

J Neurosci. 2016 Jan 20;36(3):714-29. doi: 10.1523/JNEUROSCI.2962-15.2016.

Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns.

Proc Natl Acad Sci U S A. 2016 Feb 2;113(5):E616-25. doi: 10.1073/pnas.1508523113. Epub 2016 Jan 19.

Modulation-frequency-specific adaptation in awake auditory cortex.

J Neurosci. 2015 Apr 15;35(15):5904-16. doi: 10.1523/JNEUROSCI.4833-14.2015.

Perceptual spaces: mathematical structures to neural mechanisms.

J Neurosci. 2013 Nov 6;33(45):17597-602. doi: 10.1523/JNEUROSCI.3343-13.2013.

Invariant scattering convolution networks.

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1872-86. doi: 10.1109/TPAMI.2012.230.

Summary statistics in auditory perception.

Nat Neurosci. 2013 Apr;16(4):493-8. doi: 10.1038/nn.3347. Epub 2013 Feb 24.

Responses to second-order texture modulations undergo surround suppression.

Vision Res. 2012 Jun 1;62:192-200. doi: 10.1016/j.visres.2012.03.008.

Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.

Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.

Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.

J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

声音纹理感知中的级联幅度调制

Cascaded Amplitude Modulations in Sound Texture Perception.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献