• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

声音纹理感知中的级联幅度调制

Cascaded Amplitude Modulations in Sound Texture Perception.

作者信息

McWalter Richard, Dau Torsten

机构信息

Hearing Systems Group, Technical University of DenmarkKongens Lyngby, Denmark.

出版信息

Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.

DOI:10.3389/fnins.2017.00485
PMID:28955191
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5601004/
Abstract

Sound textures, such as crackling fire or chirping crickets, represent a broad class of sounds defined by their homogeneous temporal structure. It has been suggested that the perception of texture is mediated by time-averaged summary statistics measured from early auditory representations. In this study, we investigated the perception of sound textures that contain rhythmic structure, specifically second-order amplitude modulations that arise from the interaction of different modulation rates, previously described as "beating" in the envelope-frequency domain. We developed an auditory texture model that utilizes a cascade of modulation filterbanks that capture the structure of simple rhythmic patterns. The model was examined in a series of psychophysical listening experiments using synthetic sound textures-stimuli generated using time-averaged statistics measured from real-world textures. In a texture identification task, our results indicated that second-order amplitude modulation sensitivity enhanced recognition. Next, we examined the contribution of the second-order modulation analysis in a preference task, where the proposed auditory texture model was preferred over a range of model deviants that lacked second-order modulation rate sensitivity. Lastly, the discriminability of textures that included second-order amplitude modulations appeared to be perceived using a time-averaging process. Overall, our results demonstrate that the inclusion of second-order modulation analysis generates improvements in the perceived quality of synthetic textures compared to the first-order modulation analysis considered in previous approaches.

摘要

声音纹理,如噼里啪啦的火焰声或唧唧叫的蟋蟀声,代表了一类由其均匀的时间结构定义的广泛声音。有人提出,纹理的感知是由从早期听觉表征中测量的时间平均汇总统计量介导的。在本研究中,我们调查了包含节奏结构的声音纹理的感知,具体来说是由不同调制率的相互作用产生的二阶幅度调制,在包络频率域中先前被描述为“拍频”。我们开发了一种听觉纹理模型,该模型利用一系列调制滤波器组来捕捉简单节奏模式的结构。该模型在一系列心理物理学听力实验中进行了检验,使用的是合成声音纹理——由从真实世界纹理中测量的时间平均统计量生成的刺激。在纹理识别任务中,我们的结果表明二阶幅度调制敏感性增强了识别效果。接下来,我们在一个偏好任务中检验了二阶调制分析的贡献,在该任务中,与一系列缺乏二阶调制率敏感性的模型变体相比,所提出的听觉纹理模型更受青睐。最后,包含二阶幅度调制的纹理的可辨别性似乎是通过时间平均过程来感知的。总体而言,我们的结果表明,与先前方法中考虑的一阶调制分析相比,纳入二阶调制分析可提高合成纹理的感知质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/e9eac97016fc/fnins-11-00485-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/4aae82ee9ea5/fnins-11-00485-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/9784a1e61b37/fnins-11-00485-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/0f33f9a50a69/fnins-11-00485-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/b513c682821e/fnins-11-00485-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/a6a0b28b76dc/fnins-11-00485-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/e4ed7cb59fbd/fnins-11-00485-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/e9eac97016fc/fnins-11-00485-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/4aae82ee9ea5/fnins-11-00485-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/9784a1e61b37/fnins-11-00485-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/0f33f9a50a69/fnins-11-00485-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/b513c682821e/fnins-11-00485-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/a6a0b28b76dc/fnins-11-00485-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/e4ed7cb59fbd/fnins-11-00485-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a62/5601004/e9eac97016fc/fnins-11-00485-g0007.jpg

相似文献

1
Cascaded Amplitude Modulations in Sound Texture Perception.声音纹理感知中的级联幅度调制
Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.
2
A two-stage spectral model for sound texture perception: Synthesis and psychophysics.一种用于声音纹理感知的两阶段频谱模型:合成与心理物理学。
Iperception. 2023 Feb 22;14(1):20416695231157349. doi: 10.1177/20416695231157349. eCollection 2023 Jan-Feb.
3
Identification and Discrimination of Sound Textures in Hearing-Impaired and Older Listeners.听力受损和老年听众声音纹理的识别与区分。
Trends Hear. 2021 Jan-Dec;25:23312165211065608. doi: 10.1177/23312165211065608.
4
Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.通过听觉外围的统计信息感知声音纹理:来自声音合成的证据。
Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.
5
Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.不同的神经集合反应统计数据与自然声音纹理的识别和区分有关。
Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.
6
Processing of fast amplitude modulations in bat auditory cortex matches communication call-specific sound features.蝙蝠听觉皮层对快速幅度调制的处理与通讯叫声特有的声音特征相匹配。
J Neurophysiol. 2019 Apr 1;121(4):1501-1512. doi: 10.1152/jn.00748.2018. Epub 2019 Feb 20.
7
Sensitivity of neural responses in the inferior colliculus to statistical features of sound textures.下丘脑中神经反应对声音纹理统计特征的敏感性。
Hear Res. 2021 Dec;412:108357. doi: 10.1016/j.heares.2021.108357. Epub 2021 Oct 14.
8
Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds.松鼠猴的听觉反应皮层:对调幅声音的神经反应。
Exp Brain Res. 1996 Mar;108(2):273-84. doi: 10.1007/BF00228100.
9
Specificity of brain reactions to second-order visual stimuli.大脑对二阶视觉刺激反应的特异性。
Vis Neurosci. 2015 Jan;32:E011. doi: 10.1017/S0952523815000085.
10
Exploring the distribution of statistical feature parameters for natural sound textures.探索自然声音纹理的统计特征参数分布。
PLoS One. 2021 Jun 23;16(6):e0238960. doi: 10.1371/journal.pone.0238960. eCollection 2021.

引用本文的文献

1
Developmental origins of natural sound perception.自然声音感知的发育起源。
Front Psychol. 2024 Dec 11;15:1474961. doi: 10.3389/fpsyg.2024.1474961. eCollection 2024.
2
Predicting tingling sensations induced by autonomous sensory meridian response (ASMR) videos based on sound texture statistics: a comparison to pleasant feelings.基于声音纹理统计预测自主感觉经络反应(ASMR)视频引起的刺痛感:与愉悦感的比较。
Philos Trans R Soc Lond B Biol Sci. 2024 Aug 26;379(1908):20230254. doi: 10.1098/rstb.2023.0254. Epub 2024 Jul 15.
3
Human Auditory Ecology: Extending Hearing Research to the Perception of Natural Soundscapes by Humans in Rapidly Changing Environments.

本文引用的文献

1
Learning Midlevel Auditory Codes from Natural Sound Statistics.从自然声音统计中学习中级听觉编码。
Neural Comput. 2018 Mar;30(3):631-669. doi: 10.1162/neco_a_01048. Epub 2017 Dec 8.
2
Representation of Maximally Regular Textures in Human Visual Cortex.人类视觉皮层中最大规则纹理的表征。
J Neurosci. 2016 Jan 20;36(3):714-29. doi: 10.1523/JNEUROSCI.2962-15.2016.
3
Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns.人类的大脑反应显示出对复杂声学模式的类理想观察者敏感性。
人类听觉生态学:将听觉研究扩展到人类在快速变化的环境中对自然声音景观的感知。
Trends Hear. 2023 Jan-Dec;27:23312165231212032. doi: 10.1177/23312165231212032.
4
Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.不同的神经集合反应统计数据与自然声音纹理的识别和区分有关。
Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.
5
Illusory sound texture reveals multi-second statistical completion in auditory scene analysis.幻觉声音纹理揭示了听觉场景分析中的多秒统计完成。
Nat Commun. 2019 Nov 8;10(1):5096. doi: 10.1038/s41467-019-12893-0.
6
Cascaded Tuning to Amplitude Modulation for Natural Sound Recognition.级联调幅对自然声音识别。
J Neurosci. 2019 Jul 10;39(28):5517-5533. doi: 10.1523/JNEUROSCI.2914-18.2019. Epub 2019 May 15.
Proc Natl Acad Sci U S A. 2016 Feb 2;113(5):E616-25. doi: 10.1073/pnas.1508523113. Epub 2016 Jan 19.
4
Modulation-frequency-specific adaptation in awake auditory cortex.清醒听觉皮层中的调制频率特异性适应
J Neurosci. 2015 Apr 15;35(15):5904-16. doi: 10.1523/JNEUROSCI.4833-14.2015.
5
Perceptual spaces: mathematical structures to neural mechanisms.知觉空间:从数学结构到神经机制。
J Neurosci. 2013 Nov 6;33(45):17597-602. doi: 10.1523/JNEUROSCI.3343-13.2013.
6
Invariant scattering convolution networks.不变散射卷积网络。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1872-86. doi: 10.1109/TPAMI.2012.230.
7
Summary statistics in auditory perception.听觉感知中的汇总统计。
Nat Neurosci. 2013 Apr;16(4):493-8. doi: 10.1038/nn.3347. Epub 2013 Feb 24.
8
Responses to second-order texture modulations undergo surround suppression.对二阶纹理调制的反应会受到外周抑制。
Vision Res. 2012 Jun 1;62:192-200. doi: 10.1016/j.visres.2012.03.008.
9
Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.通过听觉外围的统计信息感知声音纹理:来自声音合成的证据。
Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.
10
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.基于调制频率选择性处理后的信噪比包络功率比预测语音可懂度。
J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.