• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过听觉外围的统计信息感知声音纹理:来自声音合成的证据。

Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.

机构信息

Howard Hughes Medical Institute, New York University, New York, NY 10003, USA.

出版信息

Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.

DOI:10.1016/j.neuron.2011.06.032
PMID:21903084
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4143345/
Abstract

Rainstorms, insect swarms, and galloping horses produce "sound textures"--the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modulations, and measured statistics of the resulting decomposition. We then assessed the realism and recognizability of novel sounds synthesized to have matching statistics. Statistics of individual frequency channels, capturing spectral power and sparsity, generally failed to produce compelling synthetic textures; however, combining them with correlations between channels produced identifiable and natural-sounding textures. Synthesis quality declined if statistics were computed from biologically implausible auditory models. The results suggest that sound texture perception is mediated by relatively simple statistics of early auditory representations, presumably computed by downstream neural populations. The synthesis methodology offers a powerful tool for their further investigation.

摘要

暴雨、虫群和奔腾的马群会产生“声音纹理”——许多相似声学事件的综合结果。声音纹理的时间均一性特征明显,这表明它们可以通过时间平均统计数据来识别。为了验证这一假设,我们使用包含针对声音频率及其调制进行调谐的滤波器的听觉模型来处理真实世界的纹理,并测量由此产生的分解的统计数据。然后,我们评估了为匹配统计数据而合成的新颖声音的逼真度和可识别度。单个频率通道的统计数据,捕捉频谱功率和稀疏性,通常无法产生引人入胜的合成纹理;然而,将它们与通道之间的相关性相结合,会产生可识别且听起来自然的纹理。如果统计数据是从生物上不可信的听觉模型中计算出来的,则合成质量会下降。结果表明,声音纹理感知是由早期听觉表示的相对简单的统计数据介导的,这些统计数据可能是由下游神经群体计算出来的。该合成方法为进一步研究提供了一个强大的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/908ab0fea51f/nihms-325406-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/62b292df8b52/nihms-325406-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/63b968c4eaae/nihms-325406-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/9defc360a717/nihms-325406-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/05976dbc698b/nihms-325406-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/645656ca65ac/nihms-325406-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/0e2fd8f281a0/nihms-325406-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/40c8b7dbb11a/nihms-325406-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/908ab0fea51f/nihms-325406-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/62b292df8b52/nihms-325406-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/63b968c4eaae/nihms-325406-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/9defc360a717/nihms-325406-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/05976dbc698b/nihms-325406-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/645656ca65ac/nihms-325406-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/0e2fd8f281a0/nihms-325406-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/40c8b7dbb11a/nihms-325406-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a47b/4143345/908ab0fea51f/nihms-325406-f0009.jpg

相似文献

1
Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.通过听觉外围的统计信息感知声音纹理:来自声音合成的证据。
Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.
2
Adaptive and Selective Time Averaging of Auditory Scenes.听觉场景的自适应和选择性时间平均化。
Curr Biol. 2018 May 7;28(9):1405-1418.e10. doi: 10.1016/j.cub.2018.03.049. Epub 2018 Apr 19.
3
Exploring the distribution of statistical feature parameters for natural sound textures.探索自然声音纹理的统计特征参数分布。
PLoS One. 2021 Jun 23;16(6):e0238960. doi: 10.1371/journal.pone.0238960. eCollection 2021.
4
Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.不同的神经集合反应统计数据与自然声音纹理的识别和区分有关。
Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.
5
Dissociable Roles of the Auditory Midbrain and Cortex in Processing the Statistical Features of Natural Sound Textures.听觉中脑和皮层在处理自然声音纹理统计特征中的分离作用。
J Neurosci. 2024 Mar 6;44(10):e1115232023. doi: 10.1523/JNEUROSCI.1115-23.2023.
6
Hemispheric asymmetries in the auditory cortex reflect discriminative responses to temporal details or summary statistics of stationary sounds.听觉皮层中的半球不对称性反映了对稳态声音的时间细节或统计概要的辨别反应。
Cortex. 2025 Mar;184:79-95. doi: 10.1016/j.cortex.2024.09.020. Epub 2025 Jan 7.
7
Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds.调谐至频谱-时间调制作为对自然声音进行听觉辨别的一种机制。
Nat Neurosci. 2005 Oct;8(10):1371-9. doi: 10.1038/nn1536. Epub 2005 Sep 4.
8
Cortical representation of natural complex sounds: effects of acoustic features and auditory object category.自然复杂声音的皮质代表:声音特征和听觉对象类别的影响。
J Neurosci. 2010 Jun 2;30(22):7604-12. doi: 10.1523/JNEUROSCI.0296-10.2010.
9
Cascaded Amplitude Modulations in Sound Texture Perception.声音纹理感知中的级联幅度调制
Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.
10
Neural modulation tuning characteristics scale to efficiently encode natural sound statistics.神经调节调谐特性可按比例有效地对自然声音统计进行编码。
J Neurosci. 2010 Nov 24;30(47):15969-80. doi: 10.1523/JNEUROSCI.0966-10.2010.

引用本文的文献

1
Natural sounds can be reconstructed from human neuroimaging data using deep neural network representation.利用深度神经网络表示,可以从人类神经成像数据中重建自然声音。
PLoS Biol. 2025 Jul 23;23(7):e3003293. doi: 10.1371/journal.pbio.3003293. eCollection 2025 Jul.
2
Multiple timescales of context influence perceptual sensitivity to common pairings of musical pitch and timbre.上下文的多个时间尺度会影响对音高和音色常见组合的感知敏感性。
PLoS One. 2025 Jul 18;20(7):e0328490. doi: 10.1371/journal.pone.0328490. eCollection 2025.
3
Perceptual clustering in auditory streaming.

本文引用的文献

1
Orthogonal representation of sound dimensions in the primate midbrain.灵长类动物中脑声音维度的正交表示。
Nat Neurosci. 2011 Apr;14(4):423-5. doi: 10.1038/nn.2771. Epub 2011 Mar 6.
2
Recovering sound sources from embedded repetition.从嵌入重复中恢复声源。
Proc Natl Acad Sci U S A. 2011 Jan 18;108(3):1188-93. doi: 10.1073/pnas.1004765108. Epub 2011 Jan 3.
3
Neural modulation tuning characteristics scale to efficiently encode natural sound statistics.神经调节调谐特性可按比例有效地对自然声音统计进行编码。
听觉流中的感知聚类
PLoS Comput Biol. 2025 Jul 11;21(7):e1013189. doi: 10.1371/journal.pcbi.1013189. eCollection 2025 Jul.
4
Interference of mid-level speech and noise statistics underlies human speech recognition sensitivity in natural environmental noise.中级语音和噪声统计特性的干扰是自然环境噪声中人类语音识别敏感性的基础。
J Neurosci. 2025 Jul 8. doi: 10.1523/JNEUROSCI.1751-24.2025.
5
Optimized feature gains explain and predict successes and failures of human selective listening.优化后的特征增益能够解释并预测人类选择性听力的成败。
bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.
6
A complex acoustical environment is necessary for maintenance and development in the zebra finch auditory pallium.复杂的声学环境对于斑胸草雀听觉皮层的维持和发育是必要的。
bioRxiv. 2025 May 23:2025.05.22.655494. doi: 10.1101/2025.05.22.655494.
7
Enhanced neural speech tracking through noise indicates stochastic resonance in humans.通过噪声增强的神经语音跟踪表明人类存在随机共振。
Elife. 2025 Mar 18;13:RP100830. doi: 10.7554/eLife.100830.
8
Impaired Prosodic Processing but Not Hearing Function Is Associated with an Age-Related Reduction in AI Speech Recognition.韵律加工受损而非听力功能与年龄相关的人工耳蜗语音识别能力下降有关。
Audiol Res. 2025 Feb 8;15(1):14. doi: 10.3390/audiolres15010014.
9
Measuring self-similarity in empirical signals to understand musical beat perception.测量经验信号中的自相似性以理解音乐节拍感知。
Eur J Neurosci. 2025 Jan;61(2):e16637. doi: 10.1111/ejn.16637.
10
Reduced Neural Responses to Natural Foreground versus Background Sounds in the Auditory Cortex.听觉皮层对自然前景声音与背景声音的神经反应减弱。
J Neurosci. 2025 Mar 5;45(10):e0121242024. doi: 10.1523/JNEUROSCI.0121-24.2024.
J Neurosci. 2010 Nov 24;30(47):15969-80. doi: 10.1523/JNEUROSCI.0966-10.2010.
4
The cocktail party problem.鸡尾酒会问题。
Curr Biol. 2009 Dec 1;19(22):R1024-7. doi: 10.1016/j.cub.2009.09.005.
5
Influence of context and behavior on stimulus reconstruction from neural activity in primary auditory cortex.初级听觉皮层神经活动中刺激重构的上下文和行为影响。
J Neurophysiol. 2009 Dec;102(6):3329-39. doi: 10.1152/jn.91128.2008. Epub 2009 Sep 16.
6
Spectro-temporal modulation transfer function of single voxels in the human auditory cortex measured with high-resolution fMRI.利用高分辨率功能磁共振成像测量人脑听觉皮层单像素的光谱-时间调制传递函数。
Proc Natl Acad Sci U S A. 2009 Aug 25;106(34):14611-6. doi: 10.1073/pnas.0907682106. Epub 2009 Aug 10.
7
Spatial ensemble statistics are efficient codes that can be represented with reduced attention.空间整体统计是可以用较少注意力来表示的高效编码。
Proc Natl Acad Sci U S A. 2009 May 5;106(18):7345-50. doi: 10.1073/pnas.0808981106. Epub 2009 Apr 20.
8
A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation.一场带有皮层转折的鸡尾酒会:皮层机制如何促进声音分离。
J Acoust Soc Am. 2008 Dec;124(6):3751-71. doi: 10.1121/1.3001672.
9
Encoding of spectral correlation over time in auditory cortex.听觉皮层中随时间变化的频谱相关性编码。
J Neurosci. 2008 Dec 3;28(49):13268-73. doi: 10.1523/JNEUROSCI.4596-08.2008.
10
The cocktail party problem: what is it? How can it be solved? And why should animal behaviorists study it?鸡尾酒会问题:它是什么?如何解决?动物行为学家为何要研究它?
J Comp Psychol. 2008 Aug;122(3):235-51. doi: 10.1037/0735-7036.122.3.235.