一种用于声音纹理感知的两阶段频谱模型：合成与心理物理学。

A two-stage spectral model for sound texture perception: Synthesis and psychophysics.

作者信息

Maruyama Hironori, Okada Kosuke, Motoyoshi Isamu

机构信息

Department of Life Sciences, The University of Tokyo, Japan.

出版信息

Iperception. 2023 Feb 22;14(1):20416695231157349. doi: 10.1177/20416695231157349. eCollection 2023 Jan-Feb.

DOI:10.1177/20416695231157349

PMID:36845027

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9950610/

Abstract

The natural environment is filled with a variety of auditory events such as wind blowing, water flowing, and fire crackling. It has been suggested that the perception of such textural sounds is based on the statistics of the natural auditory events. Inspired by a recent spectral model for visual texture perception, we propose a model that can describe the perceived sound texture only with the linear spectrum and the energy spectrum. We tested the validity of the model by using synthetic noise sounds that preserve the two-stage amplitude spectra of the original sound. Psychophysical experiment showed that our synthetic noises were perceived as like the original sounds for 120 real-world auditory events. The performance was comparable with the synthetic sounds produced by McDermott-Simoncelli's model which considers various classes of auditory statistics. The results support the notion that the perception of natural sound textures is predictable by the two-stage spectral signals.

摘要

自然环境中充满了各种听觉事件，如风吹、水流和火的噼啪声。有人提出，对这种纹理声音的感知是基于自然听觉事件的统计数据。受最近视觉纹理感知光谱模型的启发，我们提出了一种仅用线性谱和能谱来描述感知到的声音纹理的模型。我们通过使用保留原始声音两阶段振幅谱的合成噪声声音来测试该模型的有效性。心理物理学实验表明，对于120个真实世界的听觉事件，我们的合成噪声被感知为与原始声音相似。该性能与考虑各种听觉统计类别的麦克德莫特 - 西蒙切利模型产生的合成声音相当。结果支持了自然声音纹理的感知可由两阶段光谱信号预测的观点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff32/9950610/4b790245e86c/10.1177_20416695231157349-fig1.jpg

相似文献

A two-stage spectral model for sound texture perception: Synthesis and psychophysics.

Iperception. 2023 Feb 22;14(1):20416695231157349. doi: 10.1177/20416695231157349. eCollection 2023 Jan-Feb.

Cascaded Amplitude Modulations in Sound Texture Perception.

Front Neurosci. 2017 Sep 11;11:485. doi: 10.3389/fnins.2017.00485. eCollection 2017.

Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.

Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.

Texture synthesis and perception: using computational models to study texture representations in the human visual system.

Vision Res. 2006 Feb;46(3):299-309. doi: 10.1016/j.visres.2005.04.013. Epub 2005 Jun 16.

Identification and Discrimination of Sound Textures in Hearing-Impaired and Older Listeners.

Trends Hear. 2021 Jan-Dec;25:23312165211065608. doi: 10.1177/23312165211065608.

Auditory perception of self-similarity in water sounds.

Front Integr Neurosci. 2011 May 11;5:15. doi: 10.3389/fnint.2011.00015. eCollection 2011.

Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.

Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.

[Neural Representation of Sound Texture in the Auditory Cortex].

Brain Nerve. 2015 Jun;67(6):679-90. doi: 10.11477/mf.1416200202.

Invariant texture perception is harder with synthetic textures: Implications for models of texture processing.

Vision Res. 2015 Oct;115(Pt B):271-9. doi: 10.1016/j.visres.2015.01.022. Epub 2015 Feb 7.

Exploring the distribution of statistical feature parameters for natural sound textures.

PLoS One. 2021 Jun 23;16(6):e0238960. doi: 10.1371/journal.pone.0238960. eCollection 2021.

引用本文的文献

Spectral-temporal processing of naturalistic sounds in monkeys and humans.

J Neurophysiol. 2024 Jan 1;131(1):38-63. doi: 10.1152/jn.00129.2023. Epub 2023 Nov 15.

本文引用的文献

Textures vs Non-Textures: A Simple Computational Method for Classifying Perceived 'Texturality' in Natural Images.

Iperception. 2021 Oct 28;12(5):20416695211054540. doi: 10.1177/20416695211054540. eCollection 2021 Sep-Oct.

Neural responses to natural and model-matched stimuli reveal distinct computations in primary and nonprimary auditory cortex.

PLoS Biol. 2018 Dec 3;16(12):e2005127. doi: 10.1371/journal.pbio.2005127. eCollection 2018 Dec.

Adaptive and Selective Time Averaging of Auditory Scenes.

Curr Biol. 2018 May 7;28(9):1405-1418.e10. doi: 10.1016/j.cub.2018.03.049. Epub 2018 Apr 19.

Selectivity and tolerance for visual texture in macaque V2.

Proc Natl Acad Sci U S A. 2016 May 31;113(22):E3140-9. doi: 10.1073/pnas.1510847113. Epub 2016 May 12.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

Invariant scattering convolution networks.

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1872-86. doi: 10.1109/TPAMI.2012.230.

Summary statistics in auditory perception.

Nat Neurosci. 2013 Apr;16(4):493-8. doi: 10.1038/nn.3347. Epub 2013 Feb 24.

Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis.

Neuron. 2011 Sep 8;71(5):926-40. doi: 10.1016/j.neuron.2011.06.032.

Metamers of the ventral stream.

Nat Neurosci. 2011 Aug 14;14(9):1195-201. doi: 10.1038/nn.2889.

Orthogonal representation of sound dimensions in the primate midbrain.

Nat Neurosci. 2011 Apr;14(4):423-5. doi: 10.1038/nn.2771. Epub 2011 Mar 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于声音纹理感知的两阶段频谱模型：合成与心理物理学。

A two-stage spectral model for sound texture perception: Synthesis and psychophysics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献