听觉分类的最佳特征。

Optimal features for auditory categorization.

机构信息

Department of Bioengineering, University of Pittsburgh, Pittsburgh, 15213, PA, USA.

Department of Neurobiology, University of Pittsburgh, Pittsburgh, 15213, PA, USA.

出版信息

Nat Commun. 2019 Mar 21;10(1):1302. doi: 10.1038/s41467-019-09115-y.

DOI:10.1038/s41467-019-09115-y

PMID:30899018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6428858/

Abstract

Humans and vocal animals use vocalizations to communicate with members of their species. A necessary function of auditory perception is to generalize across the high variability inherent in vocalization production and classify them into behaviorally distinct categories ('words' or 'call types'). Here, we demonstrate that detecting mid-level features in calls achieves production-invariant classification. Starting from randomly chosen marmoset call features, we use a greedy search algorithm to determine the most informative and least redundant features necessary for call classification. High classification performance is achieved using only 10-20 features per call type. Predictions of tuning properties of putative feature-selective neurons accurately match some observed auditory cortical responses. This feature-based approach also succeeds for call categorization in other species, and for other complex classification tasks such as caller identification. Our results suggest that high-level neural representations of sounds are based on task-dependent features optimized for specific computational goals.

摘要

人类和发声动物利用发声来与同种成员进行交流。听觉感知的一个必要功能是对发声产生中的高度可变性进行概括，并将其分类为具有不同行为特征的类别（“单词”或“叫声类型”）。在这里，我们证明了在叫声中检测中间层特征可实现与产生无关的分类。从随机选择的狨猴叫声特征开始，我们使用贪婪搜索算法来确定对于叫声分类最有用且最不冗余的特征。每个叫声类型仅使用 10-20 个特征即可实现高分类性能。对候选特征选择神经元调谐特性的预测与一些观察到的听觉皮层反应非常吻合。这种基于特征的方法也适用于其他物种的叫声分类，以及其他复杂的分类任务，如呼叫者识别。我们的研究结果表明，声音的高级神经表示是基于针对特定计算目标优化的任务相关特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0afc/6428858/b8787c3aa5af/41467_2019_9115_Fig1_HTML.jpg

相似文献

Optimal features for auditory categorization.

Nat Commun. 2019 Mar 21;10(1):1302. doi: 10.1038/s41467-019-09115-y.

Virtual vocalization stimuli for investigating neural representations of species-specific vocalizations.

J Neurophysiol. 2006 Feb;95(2):1244-62. doi: 10.1152/jn.00818.2005. Epub 2005 Oct 5.

Plasticity of temporal pattern codes for vocalization stimuli in primary auditory cortex.

J Neurosci. 2006 May 3;26(18):4785-95. doi: 10.1523/JNEUROSCI.4330-05.2006.

Contributions of sensory tuning to auditory-vocal interactions in marmoset auditory cortex.

Hear Res. 2017 May;348:98-111. doi: 10.1016/j.heares.2017.03.001. Epub 2017 Mar 9.

Auditory Selectivity for Spectral Contrast in Cortical Neurons and Behavior.

J Neurosci. 2020 Jan 29;40(5):1015-1027. doi: 10.1523/JNEUROSCI.1200-19.2019. Epub 2019 Dec 11.

Representation of a species-specific vocalization in the primary auditory cortex of the common marmoset: temporal and spectral characteristics.

J Neurophysiol. 1995 Dec;74(6):2685-706. doi: 10.1152/jn.1995.74.6.2685.

Vocalization categorization behavior explained by a feature-based auditory categorization model.

Elife. 2022 Oct 13;11:e78278. doi: 10.7554/eLife.78278.

Contextual effects of noise on vocalization encoding in primary auditory cortex.

J Neurophysiol. 2017 Feb 1;117(2):713-727. doi: 10.1152/jn.00476.2016. Epub 2016 Nov 23.

Representation of spectral and temporal envelope of twitter vocalizations in common marmoset primary auditory cortex.

J Neurophysiol. 2002 Apr;87(4):1723-37. doi: 10.1152/jn.00632.2001.

Neuronal selectivity to complex vocalization features emerges in the superficial layers of primary auditory cortex.

PLoS Biol. 2021 Jun 16;19(6):e3001299. doi: 10.1371/journal.pbio.3001299. eCollection 2021 Jun.

引用本文的文献

Spatially clustered neurons in the bat midbrain encode vocalization categories.

Nat Neurosci. 2025 May;28(5):1038-1047. doi: 10.1038/s41593-025-01932-3. Epub 2025 Apr 14.

Systematic changes in neural selectivity reflect the acquired salience of category-diagnostic dimensions.

bioRxiv. 2024 Sep 23:2024.09.21.614258. doi: 10.1101/2024.09.21.614258.

Representation of conspecific vocalizations in amygdala of awake marmosets.

Natl Sci Rev. 2023 Jul 13;10(11):nwad194. doi: 10.1093/nsr/nwad194. eCollection 2023 Nov.

The neurobiology of vocal communication in marmosets.

Ann N Y Acad Sci. 2023 Oct;1528(1):13-28. doi: 10.1111/nyas.15057. Epub 2023 Aug 24.

Spatially clustered neurons encode vocalization categories in the bat midbrain.

bioRxiv. 2023 Jun 14:2023.06.14.545029. doi: 10.1101/2023.06.14.545029.

Adaptive mechanisms facilitate robust performance in noise and in reverberation in an auditory categorization model.

Commun Biol. 2023 May 2;6(1):456. doi: 10.1038/s42003-023-04816-z.

Quantitative models of auditory cortical processing.

Hear Res. 2023 Mar 1;429:108697. doi: 10.1016/j.heares.2023.108697. Epub 2023 Jan 14.

Relative pitch representations and invariance to timbre.

Cognition. 2023 Mar;232:105327. doi: 10.1016/j.cognition.2022.105327. Epub 2022 Dec 7.

Vocalization categorization behavior explained by a feature-based auditory categorization model.

Elife. 2022 Oct 13;11:e78278. doi: 10.7554/eLife.78278.

Principal component decomposition of acoustic and neural representations of time-varying pitch reveals adaptive efficient coding of speech covariation patterns.

Brain Lang. 2022 Jul;230:105122. doi: 10.1016/j.bandl.2022.105122. Epub 2022 Apr 20.

本文引用的文献

Sound identity is represented robustly in auditory cortex during perceptual constancy.

Nat Commun. 2018 Nov 14;9(1):4786. doi: 10.1038/s41467-018-07237-3.

A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy.

Neuron. 2018 May 2;98(3):630-644.e16. doi: 10.1016/j.neuron.2018.03.044. Epub 2018 Apr 19.

Training Humans to Categorize Monkey Calls: Auditory Feature- and Category-Selective Neural Tuning Changes.

Neuron. 2018 Apr 18;98(2):405-416.e4. doi: 10.1016/j.neuron.2018.03.014.

Analyzing Distributional Learning of Phonemic Categories in Unsupervised Deep Neural Networks.

Cogsci. 2016 Aug;2016:1757-1762.

Learning Midlevel Auditory Codes from Natural Sound Statistics.

Neural Comput. 2018 Mar;30(3):631-669. doi: 10.1162/neco_a_01048. Epub 2017 Dec 8.

Familiarity and Within-Person Facial Variability: The Importance of the Internal and External Features.

Perception. 2018 Jan;47(1):3-15. doi: 10.1177/0301006617725242. Epub 2017 Aug 13.

Dynamic Encoding of Acoustic Features in Neural Responses to Continuous Speech.

J Neurosci. 2017 Feb 22;37(8):2176-2185. doi: 10.1523/JNEUROSCI.2383-16.2017. Epub 2017 Jan 24.

Distributed acoustic cues for caller identity in macaque vocalization.

R Soc Open Sci. 2015 Dec 23;2(12):150432. doi: 10.1098/rsos.150432. eCollection 2015 Dec.

Atoms of recognition in human and computer vision.

Proc Natl Acad Sci U S A. 2016 Mar 8;113(10):2744-9. doi: 10.1073/pnas.1513198113. Epub 2016 Feb 16.

A quantitative acoustic analysis of the vocal repertoire of the common marmoset (Callithrix jacchus).

J Acoust Soc Am. 2015 Nov;138(5):2906-28. doi: 10.1121/1.4934268.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

听觉分类的最佳特征。

Optimal features for auditory categorization.

机构信息

Department of Bioengineering, University of Pittsburgh, Pittsburgh, 15213, PA, USA.

Department of Neurobiology, University of Pittsburgh, Pittsburgh, 15213, PA, USA.

出版信息

Nat Commun. 2019 Mar 21;10(1):1302. doi: 10.1038/s41467-019-09115-y.

DOI:10.1038/s41467-019-09115-y

PMID:30899018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6428858/

Abstract

摘要

听觉分类的最佳特征。

Optimal features for auditory categorization.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

听觉分类的最佳特征。

Optimal features for auditory categorization.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献