作为听力受损听众言语可懂度预测指标的频谱时间调制敏感性

Spectrotemporal modulation sensitivity as a predictor of speech intelligibility for hearing-impaired listeners.

作者信息

Bernstein Joshua G W, Mehraei Golbarg, Shamma Shihab, Gallun Frederick J, Theodoroff Sarah M, Leek Marjorie R

机构信息

Audiology and Speech Center, Scientific and Clinical Studies Section, Walter Reed National Military Medical Center, Bethesda, MD, USA.

出版信息

J Am Acad Audiol. 2013 Apr;24(4):293-306. doi: 10.3766/jaaa.24.4.5.

DOI:10.3766/jaaa.24.4.5

PMID:23636210

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3973426/

Abstract

BACKGROUND

A model that can accurately predict speech intelligibility for a given hearing-impaired (HI) listener would be an important tool for hearing-aid fitting or hearing-aid algorithm development. Existing speech-intelligibility models do not incorporate variability in suprathreshold deficits that are not well predicted by classical audiometric measures. One possible approach to the incorporation of such deficits is to base intelligibility predictions on sensitivity to simultaneously spectrally and temporally modulated signals.

PURPOSE

The likelihood of success of this approach was evaluated by comparing estimates of spectrotemporal modulation (STM) sensitivity to speech intelligibility and to psychoacoustic estimates of frequency selectivity and temporal fine-structure (TFS) sensitivity across a group of HI listeners.

RESEARCH DESIGN

The minimum modulation depth required to detect STM applied to an 86 dB SPL four-octave noise carrier was measured for combinations of temporal modulation rate (4, 12, or 32 Hz) and spectral modulation density (0.5, 1, 2, or 4 cycles/octave). STM sensitivity estimates for individual HI listeners were compared to estimates of frequency selectivity (measured using the notched-noise method at 500, 1000, 2000, and 4000 Hz), TFS processing ability (2 Hz frequency-modulation detection thresholds for 500, 1000, 2000, and 4000 Hz carriers) and sentence intelligibility in noise (at a 0 dB signal-to-noise ratio) that were measured for the same listeners in a separate study.

STUDY SAMPLE

Eight normal-hearing (NH) listeners and 12 listeners with a diagnosis of bilateral sensorineural hearing loss participated.

DATA COLLECTION AND ANALYSIS

STM sensitivity was compared between NH and HI listener groups using a repeated-measures analysis of variance. A stepwise regression analysis compared STM sensitivity for individual HI listeners to audiometric thresholds, age, and measures of frequency selectivity and TFS processing ability. A second stepwise regression analysis compared speech intelligibility to STM sensitivity and the audiogram-based Speech Intelligibility Index.

RESULTS

STM detection thresholds were elevated for the HI listeners, but only for low rates and high densities. STM sensitivity for individual HI listeners was well predicted by a combination of estimates of frequency selectivity at 4000 Hz and TFS sensitivity at 500 Hz but was unrelated to audiometric thresholds. STM sensitivity accounted for an additional 40% of the variance in speech intelligibility beyond the 40% accounted for by the audibility-based Speech Intelligibility Index.

CONCLUSIONS

Impaired STM sensitivity likely results from a combination of a reduced ability to resolve spectral peaks and a reduced ability to use TFS information to follow spectral-peak movements. Combining STM sensitivity estimates with audiometric threshold measures for individual HI listeners provided a more accurate prediction of speech intelligibility than audiometric measures alone. These results suggest a significant likelihood of success for an STM-based model of speech intelligibility for HI listeners.

摘要

背景

对于给定的听力受损（HI）听众，一个能够准确预测言语可懂度的模型将是助听器验配或助听器算法开发的重要工具。现有的言语可懂度模型没有纳入超阈值缺陷的变异性，而经典听力测试方法无法很好地预测这些缺陷。纳入此类缺陷的一种可能方法是基于对同时进行频谱和时间调制信号的敏感度来进行可懂度预测。

目的

通过比较一组HI听众的频谱时间调制（STM）敏感度估计值与言语可懂度以及频率选择性和时间精细结构（TFS）敏感度的心理声学估计值，评估这种方法成功的可能性。

研究设计

针对时间调制率（4、12或32赫兹）和频谱调制密度（0.5、1、2或4周期/倍频程）的组合，测量检测应用于86分贝声压级四分之一倍频程噪声载波的STM所需的最小调制深度。将个体HI听众的STM敏感度估计值与频率选择性估计值（使用陷波噪声法在500、1000、2000和4000赫兹处测量）、TFS处理能力（500、1000、2000和4000赫兹载波的2赫兹频率调制检测阈值）以及在单独研究中为相同听众测量的噪声环境下句子可懂度（在0分贝信噪比下）进行比较。

研究样本

八名听力正常（NH）的听众和12名被诊断为双侧感音神经性听力损失的听众参与了研究。

数据收集与分析

使用重复测量方差分析比较NH和HI听众组之间的STM敏感度。逐步回归分析将个体HI听众的STM敏感度与听力阈值、年龄以及频率选择性和TFS处理能力的测量值进行比较。第二步逐步回归分析将言语可懂度与STM敏感度以及基于听力图的言语可懂度指数进行比较。

结果

HI听众的STM检测阈值升高，但仅在低调制率和高密度时出现。4000赫兹处的频率选择性估计值和500赫兹处的TFS敏感度估计值相结合，可以很好地预测个体HI听众的STM敏感度，但与听力阈值无关。STM敏感度在基于可听度的言语可懂度指数所解释的40%方差之外，又额外解释了40%的言语可懂度方差。

结论

STM敏感度受损可能是由于分辨频谱峰值的能力下降以及利用TFS信息跟踪频谱峰值移动的能力下降共同导致的。将个体HI听众的STM敏感度估计值与听力阈值测量值相结合，比单独使用听力测量方法能更准确地预测言语可懂度。这些结果表明，基于STM的HI听众言语可懂度模型有很大的成功可能性。

相似文献

Spectrotemporal modulation sensitivity as a predictor of speech intelligibility for hearing-impaired listeners.作为听力受损听众言语可懂度预测指标的频谱时间调制敏感性

J Am Acad Audiol. 2013 Apr;24(4):293-306. doi: 10.3766/jaaa.24.4.5.

Auditory models of suprathreshold distortion and speech intelligibility in persons with impaired hearing.听力受损者的超阈值失真与言语可懂度的听觉模型。

J Am Acad Audiol. 2013 Apr;24(4):307-28. doi: 10.3766/jaaa.24.4.6.

Spectrotemporal modulation sensitivity for hearing-impaired listeners: dependence on carrier center frequency and the relationship to speech intelligibility.听力受损听众的频谱时间调制敏感性：对载波中心频率的依赖性以及与言语可懂度的关系。

J Acoust Soc Am. 2014 Jul;136(1):301-16. doi: 10.1121/1.4881918.

Suprathreshold auditory processing and speech perception in noise: hearing-impaired and normal-hearing listeners.超阈值听觉处理与噪声环境下的言语感知：听力受损和听力正常的听众

J Am Acad Audiol. 2013 Apr;24(4):274-92. doi: 10.3766/jaaa.24.4.4.

Understanding excessive SNR loss in hearing-impaired listeners.理解听力受损听众中过度的信噪比损失。

J Am Acad Audiol. 2013 Apr;24(4):258-73; quiz 337-8. doi: 10.3766/jaaa.24.4.3.

Spectrotemporal Modulation Sensitivity as a Predictor of Speech-Reception Performance in Noise With Hearing Aids.助听后语音感知的频谱时间调制敏感性预测。

Trends Hear. 2016 Nov 4;20:2331216516670387. doi: 10.1177/2331216516670387.

Abnormal intelligibility of speech in competing speech and in noise in a frequency region where audiometric thresholds are near-normal for hearing-impaired listeners.对于听力受损的听众，在听力阈值接近正常的频率区域中，竞争性言语和噪声环境下言语清晰度异常。

Hear Res. 2014 Oct;316:102-9. doi: 10.1016/j.heares.2014.07.008. Epub 2014 Aug 11.

Comparing Binaural Pre-processing Strategies III: Speech Intelligibility of Normal-Hearing and Hearing-Impaired Listeners.双耳预处理策略比较III：正常听力和听力受损听众的言语可懂度

Trends Hear. 2015 Dec 30;19:2331216515618609. doi: 10.1177/2331216515618609.

Sentence intelligibility during segmental interruption and masking by speech-modulated noise: Effects of age and hearing loss.语音调制噪声分段干扰和掩蔽期间的句子可懂度：年龄和听力损失的影响。

J Acoust Soc Am. 2015 Jun;137(6):3487-501. doi: 10.1121/1.4921603.

Speech intelligibility benefits of hearing AIDS at various input levels.助听器在不同输入水平下对言语可懂度的益处。

J Am Acad Audiol. 2015 Mar;26(3):275-88. doi: 10.3766/jaaa.26.3.7.

引用本文的文献

Am J Audiol. 2025 Jun 3;34(2):388-399. doi: 10.1044/2025_AJA-24-00253. Epub 2025 May 9.

J Assoc Res Otolaryngol. 2025 Apr 21. doi: 10.1007/s10162-025-00985-2.

Multidimensional relationships between sensory perception and cognitive aging.感官知觉与认知衰老之间的多维关系。

Front Aging Neurosci. 2024 Dec 20;16:1484494. doi: 10.3389/fnagi.2024.1484494. eCollection 2024.

Feasibility and Repeatability of an Abbreviated Auditory Perceptual and Cognitive Test Battery.简短听觉感知与认知测试组合的可行性与可重复性

J Speech Lang Hear Res. 2025 Feb 4;68(2):719-739. doi: 10.1044/2024_JSLHR-23-00590. Epub 2024 Dec 19.

Spatial selective auditory attention is preserved in older age but is degraded by peripheral hearing loss.老年人的空间选择性听觉注意力得以保留，但会因外周听力损失而下降。

Sci Rep. 2024 Oct 31;14(1):26243. doi: 10.1038/s41598-024-77102-5.

Adaptation to Noise in Spectrotemporal Modulation Detection and Word Recognition.声谱时变调制检测和单词识别中的噪声适应。

Trends Hear. 2024 Jan-Dec;28:23312165241266322. doi: 10.1177/23312165241266322.

A Step Toward Precision Audiology: Individual Differences and Characteristic Profiles From Auditory Perceptual and Cognitive Abilities.迈向精准听力学的一步：听觉感知和认知能力的个体差异和特征图谱。

Trends Hear. 2024 Jan-Dec;28:23312165241263485. doi: 10.1177/23312165241263485.

Is Recognition of Speech in Noise Related to Memory Disruption Caused by Irrelevant Sound?噪声中言语识别与无关声音引起的记忆干扰有关吗？

Trends Hear. 2024 Jan-Dec;28:23312165241262517. doi: 10.1177/23312165241262517.

A Characterization of Central Auditory Processing in Parkinson's Disease.帕金森病患者的中枢听觉处理特征分析。

J Parkinsons Dis. 2024;14(5):999-1013. doi: 10.3233/JPD-230458.

Sentence recognition with modulation-filtered speech segments for younger and older adults: Effects of hearing impairment and cognition.调制滤波语音段对年轻和老年成年人的句子识别：听力障碍和认知的影响。

J Acoust Soc Am. 2023 Nov 1;154(5):3328-3343. doi: 10.1121/10.0022445.

本文引用的文献

J Am Acad Audiol. 2013 Apr;24(4):274-92. doi: 10.3766/jaaa.24.4.4.

Envelope coding in auditory nerve fibers following noise-induced hearing loss.噪声性听力损失后听神经纤维的包络编码。

J Assoc Res Otolaryngol. 2010 Dec;11(4):657-73. doi: 10.1007/s10162-010-0223-6. Epub 2010 Jun 16.

The importance of temporal fine structure information in speech at different spectral regions for normal-hearing and hearing-impaired subjects.不同频谱区域语音中时间精细结构信息对正常听力和听力障碍受试者的重要性。

J Acoust Soc Am. 2010 Mar;127(3):1595-608. doi: 10.1121/1.3293003.

Children's speech perception and loudness ratings when fitted with hearing aids using the DSL v.4.1 and the NAL-NL1 prescriptions.使用 DSL v.4.1 和 NAL-NL1 验配公式为儿童验配助听器时他们的言语感知和响度感知评估。

Int J Audiol. 2010 Jan;49 Suppl 1:S26-34. doi: 10.3109/14992020903121159.

Adding insult to injury: cochlear nerve degeneration after "temporary" noise-induced hearing loss.雪上加霜：“暂时性”噪声性听力损失后蜗神经变性

J Neurosci. 2009 Nov 11;29(45):14077-85. doi: 10.1523/JNEUROSCI.2845-09.2009.

Relations between frequency selectivity, temporal fine-structure processing, and speech reception in impaired hearing.听力受损时频率选择性、时间精细结构处理与言语接收之间的关系。

J Acoust Soc Am. 2009 May;125(5):3328-45. doi: 10.1121/1.3097469.

J Acoust Soc Am. 2008 Dec;124(6):3841-9. doi: 10.1121/1.2998779.

Effects of spectral modulation filtering on vowel identification.频谱调制滤波对元音识别的影响。

J Acoust Soc Am. 2008 Sep;124(3):1704-15. doi: 10.1121/1.2956468.

An objective measure for selecting microphone modes in OMNI/DIR hearing aid circuits.一种用于在全向性/方向性助听器电路中选择麦克风模式的客观测量方法。

Ear Hear. 2008 Apr;29(2):199-213. doi: 10.1097/aud.0b013e318164531f.

Moderate cochlear hearing loss leads to a reduced ability to use temporal fine structure information.中度耳蜗性听力损失会导致利用时间精细结构信息的能力下降。

J Acoust Soc Am. 2007 Aug;122(2):1055-68. doi: 10.1121/1.2749457.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验