优化后的特征增益能够解释并预测人类选择性听力的成败。

Optimized feature gains explain and predict successes and failures of human selective listening.

作者信息

Griffith Ian M, Hess R Preston, McDermott Josh H

机构信息

Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA.

McGovern Institute for Brain Research, MIT, Cambridge, MA, USA.

出版信息

bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.

DOI:10.1101/2025.05.28.656682

PMID:40501687

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12154610/

Abstract

Attention facilitates communication by enabling selective listening to sound sources of interest. However, little is known about why attentional selection succeeds in some conditions but fails in others. While neurophysiology implicates multiplicative feature gains in selective attention, it is unclear whether such gains can explain real-world attention-driven behavior. To investigate these issues, we optimized an artificial neural network with stimulus-computable, feature-based gains to recognize a cued talker's speech from binaural audio in "cocktail party" scenarios. Though not trained to mimic humans, the model matched human performance across diverse real-world conditions, exhibiting selection based both on voice qualities and spatial location. It also predicted novel attentional effects that we confirmed in human experiments, and exhibited signatures of "late selection" like those seen in human auditory cortex. The results suggest that human-like attentional strategies naturally arise from optimization of feature gains for selective listening, offering a normative account of the mechanisms-and limitations-of auditory attention.

摘要

注意力通过使人能够选择性地倾听感兴趣的声源来促进交流。然而，对于为什么注意力选择在某些情况下成功而在其他情况下失败，我们知之甚少。虽然神经生理学表明在选择性注意中存在乘法特征增益，但尚不清楚这种增益是否能解释现实世界中由注意力驱动的行为。为了研究这些问题，我们优化了一个具有基于刺激可计算的特征增益的人工神经网络，以在“鸡尾酒会”场景中从双耳音频中识别出被提示说话者的语音。尽管该模型并非经过训练来模仿人类，但它在各种现实世界条件下与人类表现相匹配，表现出基于语音质量和空间位置的选择。它还预测了我们在人类实验中证实的新的注意力效应，并展现出与人类听觉皮层中所见类似的“晚期选择”特征。结果表明，类似人类的注意力策略自然地源于为选择性倾听而对特征增益进行的优化，为听觉注意力的机制及局限性提供了一种规范性解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e37e/12154610/cbc395286e2d/nihpp-2025.05.28.656682v1-f0001.jpg

相似文献

Optimized feature gains explain and predict successes and failures of human selective listening.

bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.

Topical antiseptics for chronic suppurative otitis media.

Cochrane Database Syst Rev. 2025 Jun 9;6(6):CD013055. doi: 10.1002/14651858.CD013055.pub3.

Antidepressants for chronic non-cancer pain in children and adolescents.

Cochrane Database Syst Rev. 2017 Aug 5;8(8):CD012535. doi: 10.1002/14651858.CD012535.pub2.

Stigma Management Strategies of Autistic Social Media Users.

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

The Roles of Selective Attention and Asymmetric Experience in Bilateral Speech Interference for Single-Sided Deafness Cochlear Implant and Vocoder Listeners.

Ear Hear. 2025 Jun 19. doi: 10.1097/AUD.0000000000001687.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

"Just Ask What Support We Need": Autistic Adults' Feedback on Social Skills Training.

Autism Adulthood. 2025 May 28;7(3):283-292. doi: 10.1089/aut.2023.0136. eCollection 2025 Jun.

Eliciting adverse effects data from participants in clinical trials.

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Home treatment for mental health problems: a systematic review.

Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

本文引用的文献

Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.

Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.

Listening with generative models.

Cognition. 2024 Dec;253:105874. doi: 10.1016/j.cognition.2024.105874. Epub 2024 Aug 30.

Attention-Driven Modulation of Auditory Cortex Activity during Selective Listening in a Multispeaker Setting.

J Neurosci. 2024 Apr 10;44(15):e1157232023. doi: 10.1523/JNEUROSCI.1157-23.2023.

Emergent human-like covert attention in feedforward convolutional neural networks.

Curr Biol. 2024 Feb 5;34(3):579-593.e12. doi: 10.1016/j.cub.2023.12.058. Epub 2024 Jan 19.

Spatial release from masking in the median plane with non-native speakers using individual and mannequin head related transfer functions.

J Acoust Soc Am. 2024 Jan 1;155(1):284-293. doi: 10.1121/10.0024239.

Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.

PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.

Model metamers reveal divergent invariances between biological and artificial neural networks.

Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.

Individual differences in speech-on-speech masking are correlated with cognitive and visual task performance.

J Acoust Soc Am. 2023 Oct 1;154(4):2137-2153. doi: 10.1121/10.0021301.

Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases.

Adv Neural Inf Process Syst. 2021 Dec;34:6946-6959.

Harmonicity aids hearing in noise.

Atten Percept Psychophys. 2022 Apr;84(3):1016-1042. doi: 10.3758/s13414-021-02376-0. Epub 2022 Jan 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

优化后的特征增益能够解释并预测人类选择性听力的成败。

Optimized feature gains explain and predict successes and failures of human selective listening.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

优化后的特征增益能够解释并预测人类选择性听力的成败。

Optimized feature gains explain and predict successes and failures of human selective listening.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献