Aix Marseille Univ, CNRS, UMR7061 PRISM, UMR7020 LIS, Marseille, France.
Institute of Language, Communication, and the Brain (ILCB), Marseille, France.
Commun Biol. 2023 Jun 24;6(1):671. doi: 10.1038/s42003-023-05040-5.
The human auditory system is designed to capture and encode sounds from our surroundings and conspecifics. However, the precise mechanisms by which it adaptively extracts the most important spectro-temporal information from sounds are still not fully understood. Previous auditory models have explained sound encoding at the cochlear level using static filter banks, but this vision is incompatible with the nonlinear and adaptive properties of the auditory system. Here we propose an approach that considers the cochlear processes as envelope interpolations inspired by cochlear physiology. It unifies linear and nonlinear adaptive behaviors into a single comprehensive framework that provides a data-driven understanding of auditory coding. It allows simulating a broad range of psychophysical phenomena from virtual pitches and combination tones to consonance and dissonance of harmonic sounds. It further predicts the properties of the cochlear filters such as frequency selectivity. Here we propose a possible link between the parameters of the model and the density of hair cells on the basilar membrane. Cascaded Envelope Interpolation may lead to improvements in sound processing for hearing aids by providing a non-linear, data-driven, way to preprocessing of acoustic signals consistent with peripheral processes.
人类听觉系统旨在捕捉和编码来自周围环境和同种生物的声音。然而,其自适应地从声音中提取最重要的谱时信息的确切机制仍不完全清楚。以前的听觉模型使用静态滤波器组来解释耳蜗水平的声音编码,但这种观点与听觉系统的非线性和适应性特性不兼容。在这里,我们提出了一种方法,该方法将耳蜗过程视为受耳蜗生理学启发的包络插值。它将线性和非线性自适应行为统一到一个单一的综合框架中,为听觉编码提供了数据驱动的理解。它允许模拟从虚拟音高和组合音到谐波声音的和谐和不和谐等广泛的心理物理现象。它进一步预测了耳蜗滤波器的特性,如频率选择性。在这里,我们提出了模型参数和基底膜上毛细胞密度之间的可能联系。级联包络插值可以通过提供与外围过程一致的非线性、数据驱动的声信号预处理方法,为助听器的声音处理带来改进。