• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然声音的调制频谱与听觉处理的行为学理论

Modulation spectra of natural sounds and ethological theories of auditory processing.

作者信息

Singh Nandini C, Theunissen Frédéric E

机构信息

Department of Psychology and Neuroscience Institute, University of California, Berkeley, 3210 Tolman Hall, Berkeley, California 94720-1650, USA.

出版信息

J Acoust Soc Am. 2003 Dec;114(6 Pt 1):3394-411. doi: 10.1121/1.1624067.

DOI:10.1121/1.1624067
PMID:14714819
Abstract

The modulation statistics of natural sound ensembles were analyzed by calculating the probability distributions of the amplitude envelope of the sounds and their time-frequency correlations given by the modulation spectra. These modulation spectra were obtained by calculating the two-dimensional Fourier transform of the autocorrelation matrix of the sound stimulus in its spectrographic representation. Since temporal bandwidth and spectral bandwidth are conjugate variables, it is shown that the joint modulation spectrum of sound occupies a restricted space: sounds cannot have rapid temporal and spectral modulations simultaneously. Within this restricted space, it is shown that natural sounds have a characteristic signature. Natural sounds, in general, are low-passed, showing most of their modulation energy for low temporal and spectral modulations. Animal vocalizations and human speech are further characterized by the fact that most of the spectral modulation power is found only for low temporal modulation. Similarly, the distribution of the amplitude envelopes also exhibits characteristic shapes for natural sounds, reflecting the high probability of epochs with no sound, systematic differences across frequencies, and a relatively uniform distribution for the log of the amplitudes for vocalizations. It is postulated that the auditory system as well as engineering applications may exploit these statistical properties to obtain an efficient representation of behaviorally relevant sounds. To test such a hypothesis we show how to create synthetic sounds with first and second order envelope statistics identical to those found in natural sounds.

摘要

通过计算声音幅度包络的概率分布及其由调制谱给出的时频相关性,分析了自然声音集合的调制统计特性。这些调制谱是通过计算声音刺激在其频谱表示中的自相关矩阵的二维傅里叶变换得到的。由于时间带宽和频谱带宽是共轭变量,研究表明声音的联合调制谱占据一个受限空间:声音不能同时具有快速的时间和频谱调制。在这个受限空间内,研究表明自然声音具有特征性标志。一般来说,自然声音是低通的,其大部分调制能量集中在低时间和频谱调制上。动物发声和人类语音的进一步特征在于,大部分频谱调制功率仅在低时间调制时出现。同样,幅度包络的分布对于自然声音也呈现出特征形状,反映了无声时段的高概率、不同频率间的系统性差异以及发声幅度对数的相对均匀分布。据推测,听觉系统以及工程应用可能会利用这些统计特性来获得与行为相关声音的有效表示。为了验证这一假设,我们展示了如何创建具有与自然声音中发现的一阶和二阶包络统计特性相同的合成声音。

相似文献

1
Modulation spectra of natural sounds and ethological theories of auditory processing.自然声音的调制频谱与听觉处理的行为学理论
J Acoust Soc Am. 2003 Dec;114(6 Pt 1):3394-411. doi: 10.1121/1.1624067.
2
Human cortical organization for processing vocalizations indicates representation of harmonic structure as a signal attribute.用于处理发声的人类皮质组织表明,谐波结构的表征是一种信号属性。
J Neurosci. 2009 Feb 18;29(7):2283-96. doi: 10.1523/JNEUROSCI.4145-08.2009.
3
Neural encoding of single-formant stimuli in the cat. I. Responses of auditory nerve fibers.猫对单共振峰刺激的神经编码。I. 听神经纤维的反应
J Neurophysiol. 1993 Sep;70(3):1054-75. doi: 10.1152/jn.1993.70.3.1054.
4
Mapping unpleasantness of sounds to their auditory representation.将声音的不悦感映射到其听觉表征上。
J Acoust Soc Am. 2008 Dec;124(6):3810-7. doi: 10.1121/1.3006380.
5
Acoustic variability and distinguishability among mouse ultrasound vocalizations.小鼠超声发声之间的声学变异性和可区分性。
J Acoust Soc Am. 2003 Dec;114(6 Pt 1):3412-22. doi: 10.1121/1.1623787.
6
Two stages of bandwidth scaling drives efficient neural coding of natural sounds.两个带宽扩展阶段促进了自然声音的高效神经编码。
PLoS Comput Biol. 2023 Feb 14;19(2):e1010862. doi: 10.1371/journal.pcbi.1010862. eCollection 2023 Feb.
7
Processing of spectral and amplitude envelope of animal vocalizations in the human auditory cortex.人类听觉皮层中动物发声的光谱和幅度包络处理。
Neuropsychologia. 2010 Aug;48(10):2824-32. doi: 10.1016/j.neuropsychologia.2010.05.024. Epub 2010 May 21.
8
Sparse codes of harmonic natural sounds and their modulatory interactions.谐波自然声音的稀疏编码及其调制相互作用。
Network. 2009;20(4):253-67. doi: 10.3109/09548980903447751.
9
Dynamics of frequency and amplitude modulations in vocalizations produced by eastern towhees, Pipilo erythrophthalmus.东唧鹀(Pipilo erythrophthalmus)发声中频率和振幅调制的动态变化
J Acoust Soc Am. 2004 Mar;115(3):1333-44. doi: 10.1121/1.1648976.
10
The phonochrome: a coherent spectro-temporal representation of sound.音色素:声音的一种连贯的频谱-时间表征。
Hear Res. 1981 Nov;5(2-3):123-45. doi: 10.1016/0378-5955(81)90042-3.

引用本文的文献

1
High-resolution fMRI reveals a dorsal brain pathway selective for conspecific vocalizations in macaques.高分辨率功能磁共振成像揭示了猕猴大脑中一条对同种发声有选择性的背侧脑通路。
Imaging Neurosci (Camb). 2025 Aug 13;3. doi: 10.1162/IMAG.a.108. eCollection 2025.
2
Precise spike-timing information in the brainstem is well aligned with the needs of communication and the perception of environmental sounds.脑干中精确的峰电位时间信息与交流需求及环境声音感知高度契合。
PLoS Biol. 2025 Jun 16;23(6):e3003213. doi: 10.1371/journal.pbio.3003213. eCollection 2025 Jun.
3
Acoustic estimation of voice roughness.
嗓音粗糙度的声学评估。
Atten Percept Psychophys. 2025 Apr 28. doi: 10.3758/s13414-025-03060-3.
4
A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex.人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。
Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2412243122. doi: 10.1073/pnas.2412243122. Epub 2025 Apr 28.
5
Rough is salient: a conserved vocal niche to hijack the brain's salience system.粗糙音显著:一个用于劫持大脑显著性系统的保守发声生态位。
Philos Trans R Soc Lond B Biol Sci. 2025 Apr 3;380(1923):20240020. doi: 10.1098/rstb.2024.0020.
6
How to analyse and manipulate nonlinear phenomena in voice recordings.如何分析和处理语音记录中的非线性现象。
Philos Trans R Soc Lond B Biol Sci. 2025 Apr 3;380(1923):20240003. doi: 10.1098/rstb.2024.0003.
7
Spectral Weighting of Monaural Cues for Auditory Localization in Sagittal Planes.矢状面听觉定位中双耳线索的频谱加权
Trends Hear. 2025 Jan-Dec;29:23312165251317027. doi: 10.1177/23312165251317027. Epub 2025 Mar 18.
8
Microprism-based two-photon imaging of the mouse inferior colliculus reveals novel organizational principles of the auditory midbrain.基于微棱镜的小鼠下丘双光子成像揭示了听觉中脑的新组织原则。
Elife. 2025 Mar 14;12:RP93063. doi: 10.7554/eLife.93063.
9
Expectation-driven sensory adaptations support enhanced acuity during categorical perception.期望驱动的感官适应有助于在分类感知过程中提高敏锐度。
Nat Neurosci. 2025 Apr;28(4):861-872. doi: 10.1038/s41593-025-01899-1. Epub 2025 Mar 13.
10
Reduced Neural Responses to Natural Foreground versus Background Sounds in the Auditory Cortex.听觉皮层对自然前景声音与背景声音的神经反应减弱。
J Neurosci. 2025 Mar 5;45(10):e0121242024. doi: 10.1523/JNEUROSCI.0121-24.2024.