皮质下模型反应中的擦音表示：与人类辅音感知的比较。

Representations of fricatives in subcortical model responses: Comparisons with human consonant perception.

机构信息

Department of Biomedical Engineering, University of Rochester, Rochester, New York 14627, USA.

Department of Electrical and Computer Engineering, University of Rochester, Rochester, New York 14627, USA.

出版信息

J Acoust Soc Am. 2023 Aug 1;154(2):602-618. doi: 10.1121/10.0020536.

DOI:10.1121/10.0020536

PMID:37535429

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10550336/

Abstract

Fricatives are obstruent sound contrasts made by airflow constrictions in the vocal tract that produce turbulence across the constriction or at a site downstream from the constriction. Fricatives exhibit significant intra/intersubject and contextual variability. Yet, fricatives are perceived with high accuracy. The current study investigated modeled neural responses to fricatives in the auditory nerve (AN) and inferior colliculus (IC) with the hypothesis that response profiles across populations of neurons provide robust correlates to consonant perception. Stimuli were 270 intervocalic fricatives (10 speakers × 9 fricatives × 3 utterances). Computational model response profiles had characteristic frequencies that were log-spaced from 125 Hz to 8 or 20 kHz to explore the impact of high-frequency responses. Confusion matrices generated by k-nearest-neighbor subspace classifiers were based on the profiles of average rates across characteristic frequencies as feature vectors. Model confusion matrices were compared with published behavioral data. The modeled AN and IC neural responses provided better predictions of behavioral accuracy than the stimulus spectra, and IC showed better accuracy than AN. Behavioral fricative accuracy was explained by modeled neural response profiles, whereas confusions were only partially explained. Extended frequencies improved accuracy based on the model IC, corroborating the importance of extended high frequencies in speech perception.

摘要

擦音是由声道中的气流限制产生的阻碍性声音对比，在限制处或限制处下游的位置产生湍流。擦音表现出显著的个体内/个体间和语境可变性。然而，擦音的感知准确性很高。本研究通过假设神经元群体的反应谱为协同感知提供了强大的相关因素，调查了听觉神经（AN）和下丘（IC）中擦音的模型神经反应。刺激物是 270 个元音间擦音（10 个说话者×9 个擦音×3 个发音）。计算模型的反应谱具有特征频率，从 125Hz 对数间隔到 8kHz 或 20kHz，以探索高频响应的影响。基于特征频率上的平均速率的轮廓作为特征向量生成 k-最近邻子空间分类器的混淆矩阵。将模型混淆矩阵与已发表的行为数据进行比较。与刺激光谱相比，模型化的 AN 和 IC 神经反应提供了更好的行为准确性预测，而 IC 比 AN 提供了更好的准确性。行为擦音准确性可以通过模型化的神经反应谱来解释，而混淆仅部分解释。扩展频率提高了基于模型 IC 的准确性，证实了扩展高频在语音感知中的重要性。

相似文献

Representations of fricatives in subcortical model responses: Comparisons with human consonant perception.皮质下模型反应中的擦音表示：与人类辅音感知的比较。

J Acoust Soc Am. 2023 Aug 1;154(2):602-618. doi: 10.1121/10.0020536.

Acoustic and perceptual characteristics of voicing in fricatives and fricative clusters.擦音及擦音组合中浊音的声学和感知特征。

J Acoust Soc Am. 1992 May;91(5):2979-3000. doi: 10.1121/1.402933.

Speech coding in the auditory nerve: III. Voiceless fricative consonants.听觉神经中的语音编码：III. 清擦辅音

J Acoust Soc Am. 1984 Mar;75(3):887-96. doi: 10.1121/1.390598.

Contribution of consonant landmarks to speech recognition in simulated acoustic-electric hearing.辅音地标对模拟电声听觉中的语音识别的贡献。

Ear Hear. 2010 Apr;31(2):259-67. doi: 10.1097/AUD.0b013e3181c7db17.

Language specificity in the perception of voiceless sibilant fricatives in Japanese and English: implications for cross-language differences in speech-sound development.日语和英语中清齿龈擦音感知的语言特异性：对语音发展中跨语言差异的影响。

J Acoust Soc Am. 2011 Feb;129(2):999-1011. doi: 10.1121/1.3518716.

Neural encoding of single-formant stimuli in the cat. I. Responses of auditory nerve fibers.猫对单共振峰刺激的神经编码。I. 听神经纤维的反应

J Neurophysiol. 1993 Sep;70(3):1054-75. doi: 10.1152/jn.1993.70.3.1054.

Using a vocoder-based frequency-lowering method and spectral enhancement to improve place-of-articulation perception for hearing-impaired listeners.使用基于声码器的频率降低方法和频谱增强来改善听力障碍者的发音感知。

Ear Hear. 2013 May-Jun;34(3):300-12. doi: 10.1097/AUD.0b013e31826fe77a.

Effects of frequency compression and frequency transposition on fricative and affricate perception in listeners with normal hearing and mild to moderate hearing loss.频率压缩和频率转换对听力正常及轻度至中度听力损失听众擦音和塞擦音感知的影响。

Ear Hear. 2014 Sep-Oct;35(5):519-32. doi: 10.1097/AUD.0000000000000040.

Consonant confusions in white noise.白噪声中的辅音混淆。

J Acoust Soc Am. 2008 Aug;124(2):1220-33. doi: 10.1121/1.2913251.

Acoustically distinct and perceptually ambiguous: ʔayʔaǰuθəm (Salish) fricatives.听觉上有明显区别但感知上模棱两可：ʔayʔaǰuθəm（萨利希语）中的擦音。

J Acoust Soc Am. 2020 Apr;147(4):2959. doi: 10.1121/10.0001007.

引用本文的文献

Neural Fluctuation Contrast as a Code for Complex Sounds: The Role and Control of Peripheral Nonlinearities.神经波动对比作为复杂声音的代码：外围非线性的作用和控制。

Hear Res. 2024 Mar 1;443:108966. doi: 10.1016/j.heares.2024.108966. Epub 2024 Feb 1.

Effects of sensorineural hearing loss on formant-frequency discrimination: Measurements and models.感音神经性听力损失对共振峰频率辨别力的影响：测量与模型。

Hear Res. 2023 Aug;435:108788. doi: 10.1016/j.heares.2023.108788. Epub 2023 May 8.

本文引用的文献

A comparative study of eight human auditory models of monaural processing.八种单耳听觉处理的人体听觉模型的比较研究。

Acta Acust (2020). 2022;6. doi: 10.1051/aacus/2022008. Epub 2022 May 4.

Sublexical cues affect degraded speech processing: insights from fMRI.次词汇线索影响言语退化处理：来自功能磁共振成像的见解

Cereb Cortex Commun. 2022 Feb 16;3(1):tgac007. doi: 10.1093/texcom/tgac007. eCollection 2022.

The Importance of Extended High-Frequency Speech Information in the Recognition of Digits, Words, and Sentences in Quiet and Noise.扩展高频语音信息在安静和噪声环境中对数字、单词及句子识别的重要性

Ear Hear. 2022 May/Jun;43(3):913-920. doi: 10.1097/AUD.0000000000001142.

Top-Down Inference in the Auditory System: Potential Roles for Corticofugal Projections.听觉系统中的自上而下推理：皮质传出投射的潜在作用。

Front Neural Circuits. 2021 Jan 22;14:615259. doi: 10.3389/fncir.2020.615259. eCollection 2020.

Amplitude modulation transfer functions reveal opposing populations within both the inferior colliculus and medial geniculate body.幅度调制传递函数揭示了下丘和内侧膝状体中两个相反的神经元群体。

J Neurophysiol. 2020 Oct 1;124(4):1198-1215. doi: 10.1152/jn.00279.2020. Epub 2020 Sep 9.

Extended high frequency hearing and speech perception implications in adults and children.成人和儿童扩展高频听力和言语感知的意义。

Hear Res. 2020 Nov;397:107922. doi: 10.1016/j.heares.2020.107922. Epub 2020 Feb 18.

Acoustic-phonetic and auditory mechanisms of adaptation in the perception of sibilant fricatives.咝音擦音感知中适应的声学语音和听觉机制。

Atten Percept Psychophys. 2020 May;82(4):2027-2048. doi: 10.3758/s13414-019-01894-2.

Extended high-frequency hearing enhances speech perception in noise.扩展高频听力可增强噪声环境下的言语感知。

Proc Natl Acad Sci U S A. 2019 Nov 19;116(47):23753-23759. doi: 10.1073/pnas.1903315116. Epub 2019 Nov 4.

Ecological cocktail party listening reveals the utility of extended high-frequency hearing.生态鸡尾酒会听力揭示了扩展高频听力的实用性。

Hear Res. 2019 Sep 15;381:107773. doi: 10.1016/j.heares.2019.107773. Epub 2019 Aug 3.

Nonlinear auditory models yield new insights into representations of vowels.非线性听觉模型为元音表征带来了新的见解。

Atten Percept Psychophys. 2019 May;81(4):1034-1046. doi: 10.3758/s13414-018-01644-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验