Bandyopadhyay Sharba, Young Eric D
Center for Hearing Sciences and Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21205, USA.
J Neurosci. 2004 Jan 14;24(2):531-41. doi: 10.1523/JNEUROSCI.4234-03.2004.
Previous studies of the neural representation of speech assumed some form of neural code, usually discharge rate or phase locking, for the representation. In the present study, responses to five synthesized CVC_CV (e.g., /dad_da/) utterances have been examined using information-theoretic distance measures [or Kullback-Leibler (KL) distance] that are independent of a priori assumptions about the neural code. The consonants in the stimuli fall along a continuum from /b/ to /d/ and include both formant-frequency (F1, F2, and F3) transitions and onset (release) bursts. Differences in responses to pairs of stimuli, based on single-fiber auditory nerve responses at 70 and 50 dB sound pressure level, have been quantified, based on KL and KL-like distances, to show how each portion of the response contributes to information coding and the fidelity of the encoding. Distances were large at best frequencies, in which the formants differ but were largest for fibers encoding the high-frequency release bursts. Distances computed at differing time resolutions show significant information in the temporal pattern of spiking, beyond that encoded by rate, at time resolutions from 1-40 msec. Single-fiber just noticeable differences (JNDs) for F2 and F3 were computed from the data. These results show that F2 is coded with greater fidelity than F3, even among fibers tuned to F3, and that JNDs are larger in the syllable final consonant than in the releases.
以往关于语音神经表征的研究假定了某种形式的神经编码,通常是放电率或锁相,用于表征。在本研究中,使用信息论距离度量(或库尔贝克-莱布勒(KL)距离)来检验对五种合成的CVC_CV(例如,/dad_da/)话语的反应,这些度量独立于关于神经编码的先验假设。刺激中的辅音沿着从/b/到/d/的连续体分布,并且包括共振峰频率(F1、F2和F3)转换以及起始(释放)突发音。基于70和50分贝声压级下的单纤维听神经反应,已根据KL和类KL距离对成对刺激的反应差异进行了量化,以显示反应的每个部分如何对信息编码和编码保真度做出贡献。在最佳频率下距离较大,其中共振峰不同,但对于编码高频释放突发音的纤维距离最大。在不同时间分辨率下计算的距离显示,在1至40毫秒的时间分辨率下,放电的时间模式中存在显著信息,超出了由放电率编码的信息。从数据中计算出F2和F3的单纤维恰可分辨差异(JND)。这些结果表明,即使在调谐到F3的纤维中,F2的编码保真度也高于F3,并且音节末尾辅音中的JND比释放音中的更大。