Klinke R, Hartmann R
Arch Otorhinolaryngol. 1979;223(1):77-137. doi: 10.1007/BF00455077.
For the attempt to develop a cochlear prosthesis, which allows some understanding of speech, it seems--at least for the first attempts--to be appropriate to mimic natural conditions as far as possible. The auditory nerve contains about 30,000 afferent fibres. Qualitatively, their behavior is similar but quantitative measures show considerably differences (2.3). Nothing certain can be said at present however about the spiral fibres originating from the outer hair cells. The quantitative differences between single afferents concern tuning, frequency selectivity, thresholds, intensity functions and--of particular interest for electrical stimulation--differences in timing of the activity pattern, brought about by differences in travelling time along the cochlear duct (2.3). The time differences seen in the activity pattern of different fibres are in the order of several ms (2.3.6;2.3.7). Actionpotentials elicited by natural acoustic stimuli show probabilistic behavior, that is they are not strictly determined. It is obvious that with artificial electrical stimulation not every surviving single fibre can be selectively stimulated. An electrode will always stimulate a group of fibres simultaneously. With any conceivable electrical stimulation all fibres in the suprathreshold region of the electrode will be synchronously activated (3.2); a fundamental difference to the natural situation. To estimate the number of channels, necessary to stimulate the auditory nerve with sufficient accuracy to allow speech perception we consider some psychoacoustic data. These have shown that the auditory system possesses the ability to differentiate a great number of different pitches, but on the other hand it is capable of integrating different frequency areas to a so called critical bandwidth. Sound energy falling into one critical bandwidth is integrated to a uniform auditory sensation. If one is to integrate various fibres of the auditory nerve to one channel of stimulation it seems to be adequate to use the critical bandwidth as a measure (3.1). According to this criterion 15 channels would have to be introduced into the speech region of the cochlea. This would allow 1.2 mm of cochlear length for each channel. Perfect electrical separation of the channels is required. Considering the severe distortions in neuronal activity pattern, introduced by electrical stimulation in comparison to the natural conditions it is not clear even whether the number given would be sufficient. On the other hand, current spreading would appear to prohibit any higher electrode density. As far as coding of sound parameters within one channel is concerned it is proposed that full use should be made of frequency analysis according to the place principle. In respect to coding of periodicity and loudness it is proposed to approach natural conditions as far as possible (3.3). Here delay times between the individual channels and a probabilistic character of the stimuli should be introduced to avoid dominance of periodicity pitch...
对于开发一种能让人听懂一些言语的人工耳蜗的尝试而言,至少在最初的尝试中,尽可能模仿自然条件似乎是合适的。听神经包含约30,000条传入纤维。从定性角度看,它们的行为相似,但定量测量显示出相当大的差异(2.3)。然而,目前对于源自外毛细胞的螺旋纤维还无法确定任何确切情况。单个传入纤维之间的定量差异涉及调谐、频率选择性、阈值、强度函数,以及——对于电刺激特别重要的——活动模式的时间差异,这种差异是由沿耳蜗管传播时间的不同造成的(2.3)。不同纤维活动模式中观察到的时间差异在几毫秒的量级(2.3.6;2.3.7)。由自然声刺激引发的动作电位表现出概率性行为,也就是说它们不是严格确定的。显然,在人工电刺激下,并非每一根存活的单纤维都能被选择性地刺激。一个电极总是会同时刺激一组纤维。在任何可想象的电刺激下,电极阈上区域内的所有纤维都会被同步激活(3.2);这与自然情况有根本区别。为了估计以足够精度刺激听神经以实现言语感知所需的通道数量,我们考虑一些心理声学数据。这些数据表明,听觉系统有能力区分大量不同的音调,但另一方面,它也能够将不同的频率区域整合到一个所谓的临界带宽中。落入一个临界带宽的声能会被整合为一种统一的听觉感受。如果要将听神经的各种纤维整合到一个刺激通道中,使用临界带宽作为一种度量似乎是合适的(3.1)。根据这个标准,必须在耳蜗的言语区域引入15个通道。这将为每个通道留出1.2毫米的耳蜗长度。需要通道之间实现完美的电隔离。考虑到与自然条件相比,电刺激会在神经元活动模式中引入严重的失真,甚至不清楚所给出的数量是否足够。另一方面,电流扩散似乎会限制更高的电极密度。就一个通道内声音参数的编码而言,建议充分利用基于位置原理的频率分析。在周期性和响度的编码方面,建议尽可能接近自然条件(3.3)。这里应该引入各个通道之间的延迟时间以及刺激的概率性特征,以避免周期性音调占主导……