Bittar Alexandre, Garner Philip N
Idiap Research Institute, Audio Inference, Martigny, Switzerland.
École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
Front Neurosci. 2024 Sep 25;18:1449181. doi: 10.3389/fnins.2024.1449181. eCollection 2024.
Understanding cognitive processes in the brain demands sophisticated models capable of replicating neural dynamics at large scales. We present a physiologically inspired speech recognition architecture, compatible and scalable with deep learning frameworks, and demonstrate that end-to-end gradient descent training leads to the emergence of neural oscillations in the central spiking neural network. Significant cross-frequency couplings, indicative of these oscillations, are measured within and across network layers during speech processing, whereas no such interactions are observed when handling background noise inputs. Furthermore, our findings highlight the crucial inhibitory role of feedback mechanisms, such as spike frequency adaptation and recurrent connections, in regulating and synchronizing neural activity to improve recognition performance. Overall, on top of developing our understanding of synchronization phenomena notably observed in the human auditory pathway, our architecture exhibits dynamic and efficient information processing, with relevance to neuromorphic technology.
理解大脑中的认知过程需要能够在大规模上复制神经动力学的复杂模型。我们提出了一种受生理启发的语音识别架构,它与深度学习框架兼容且可扩展,并证明端到端梯度下降训练会导致中央脉冲神经网络中出现神经振荡。在语音处理过程中,在网络层内和跨网络层测量到了指示这些振荡的显著交叉频率耦合,而在处理背景噪声输入时未观察到此类相互作用。此外,我们的研究结果强调了反馈机制(如脉冲频率适应和递归连接)在调节和同步神经活动以提高识别性能方面的关键抑制作用。总体而言,除了增进我们对在人类听觉通路中显著观察到的同步现象的理解之外,我们的架构还展现出动态且高效的信息处理能力,与神经形态技术相关。