基于超维脑启发学习的大规模下丘神经元活动的语音识别

Hyperdimensional Brain-Inspired Learning for Phoneme Recognition With Large-Scale Inferior Colliculus Neural Activities.

出版信息

IEEE Trans Biomed Eng. 2024 Nov;71(11):3098-3110. doi: 10.1109/TBME.2024.3408279. Epub 2024 Oct 25.

DOI:10.1109/TBME.2024.3408279

Abstract

OBJECTIVE

Develop a novel and highly efficient framework that decodes Inferior Colliculus (IC) neural activities for phoneme recognition.

METHODS

We propose using Hyperdimensional Computing (HDC) to support an efficient phoneme recognition algorithm, in contrast to widely applied Deep Neural Networks (DNN). The high-dimensional representation and operations in HDC are rooted in human brain functionalities and naturally parallelizable, showing the potential for efficient neural activity analysis. Our proposed method includes a spatial and temporal-aware HDC encoder that effectively captures global and local patterns. As part of our framework, we deploy the lightweight HDC-based algorithm on a highly customizable and flexible hardware platform, i.e., Field Programmable Gate Arrays (FPGA), for optimal algorithm speedup. To evaluate our method, we record IC neural activities on gerbils while playing the sound of different phonemes.

RESULTS

We compare our proposed method with multiple baseline machine learning algorithms in recognition quality and learning efficiency, across different hardware platforms. The results show that our method generally achieves better classification quality than the best-performing baseline. Compared to the Deep Residual Neural Network (i.e., ResNet), our method shows a speedup up to 74×, 67×, 210× on CPU, GPU, and FPGA respectively. We achieve up to 15% (10%) higher accuracy in consonant (vowel) classification than ResNet.

CONCLUSION

By leveraging brain-inspired HDC for IC neural activity encoding and phoneme classification, we achieve orders of magnitude runtime speedup while improving accuracy in various challenging task settings.

SIGNIFICANCE

Decoding IC neural activities is an important step to enhance understanding about human auditory system. However, these responses from the central auditory system are noisy and contain high variance, demanding large-scale datasets and iterative model fine-tuning. The proposed HDC-based framework is more scalable and viable for future real-world deployment thanks to its fast training and overall better quality.

摘要

目的

开发一种新颖且高效的框架，用于对下丘（IC）神经活动进行解码以实现音素识别。

方法

我们提出使用超维计算（HDC）来支持高效的音素识别算法，与广泛应用的深度神经网络（DNN）形成对比。HDC 中的高维表示和操作基于人类大脑功能，并且自然地可并行化，显示出有效分析神经活动的潜力。我们提出的方法包括一个具有空间和时间意识的 HDC 编码器，可有效地捕获全局和局部模式。作为我们框架的一部分，我们将基于轻量级 HDC 的算法部署在高度可定制和灵活的硬件平台（即现场可编程门阵列（FPGA））上，以实现最佳的算法加速。为了评估我们的方法，我们在播放不同音素的声音时，在沙鼠上记录 IC 神经活动。

结果

我们在不同的硬件平台上，通过识别质量和学习效率，将我们提出的方法与多种基线机器学习算法进行了比较。结果表明，我们的方法通常比表现最好的基线实现了更好的分类质量。与深度残差神经网络（即 ResNet）相比，我们的方法在 CPU、GPU 和 FPGA 上的速度分别提高了 74 倍、67 倍和 210 倍。在辅音（元音）分类方面，我们的准确率比 ResNet 提高了 15%（10%）。

结论

通过利用大脑启发的 HDC 对 IC 神经活动进行编码和音素分类，我们在提高准确性的同时实现了数量级的运行时加速，在各种具有挑战性的任务设置中都有出色的表现。

意义

对 IC 神经活动进行解码是增强对人类听觉系统理解的重要步骤。然而，这些来自中枢听觉系统的反应是嘈杂的，并且包含很高的方差，需要大规模数据集和迭代模型的微调。由于其快速的训练和整体更高的质量，所提出的基于 HDC 的框架更具可扩展性和适用于未来的实际部署。

相似文献

Hyperdimensional Brain-Inspired Learning for Phoneme Recognition With Large-Scale Inferior Colliculus Neural Activities.基于超维脑启发学习的大规模下丘神经元活动的语音识别

IEEE Trans Biomed Eng. 2024 Nov;71(11):3098-3110. doi: 10.1109/TBME.2024.3408279. Epub 2024 Oct 25.

Hyperdimensional computing with holographic and adaptive encoder.采用全息与自适应编码器的超维计算

Front Artif Intell. 2024 Apr 9;7:1371988. doi: 10.3389/frai.2024.1371988. eCollection 2024.

An encoding framework for binarized images using hyperdimensional computing.一种使用超维计算的二值化图像编码框架。

Front Big Data. 2024 Jun 14;7:1371518. doi: 10.3389/fdata.2024.1371518. eCollection 2024.

HDBind: encoding of molecular structure with hyperdimensional binary representations.HDBind：采用超维二进制表示法对分子结构进行编码。

Sci Rep. 2024 Nov 23;14(1):29025. doi: 10.1038/s41598-024-80009-w.

Supervised Contrastive Learning Framework and Hardware Implementation of Learned ResNet for Real-Time Respiratory Sound Classification.用于实时呼吸音分类的监督对比学习框架及学习型ResNet的硬件实现

IEEE Trans Biomed Circuits Syst. 2025 Feb;19(1):185-195. doi: 10.1109/TBCAS.2024.3409584. Epub 2025 Feb 11.

Enhanced Noise-Resilient Pressure Mat System Based on Hyperdimensional Computing.基于超高维度计算的抗噪压力垫系统。

Sensors (Basel). 2024 Feb 4;24(3):1014. doi: 10.3390/s24031014.

Energy-Efficient Sleep Apnea Detection Using a Hyperdimensional Computing Framework Based on Wearable Bracelet Photoplethysmography.基于可穿戴手环光电容积脉搏波描记术的超维计算框架实现的节能型睡眠呼吸暂停检测

IEEE Trans Biomed Eng. 2024 Aug;71(8):2483-2494. doi: 10.1109/TBME.2024.3377270. Epub 2024 Jul 18.

Memory-inspired spiking hyperdimensional network for robust online learning.受记忆启发的尖峰超维网络，用于稳健的在线学习。

Sci Rep. 2022 May 10;12(1):7641. doi: 10.1038/s41598-022-11073-3.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Dual Stream Long Short-Term Memory Feature Fusion Classifier for Surface Electromyography Gesture Recognition.双通道长短时记忆特征融合分类器用于表面肌电手势识别。

Sensors (Basel). 2024 Jun 4;24(11):3631. doi: 10.3390/s24113631.

基于超维脑启发学习的大规模下丘神经元活动的语音识别

Hyperdimensional Brain-Inspired Learning for Phoneme Recognition With Large-Scale Inferior Colliculus Neural Activities.

出版信息

IEEE Trans Biomed Eng. 2024 Nov;71(11):3098-3110. doi: 10.1109/TBME.2024.3408279. Epub 2024 Oct 25.

DOI:10.1109/TBME.2024.3408279

PMID:39008389

Abstract

OBJECTIVE

Develop a novel and highly efficient framework that decodes Inferior Colliculus (IC) neural activities for phoneme recognition.

METHODS

RESULTS

CONCLUSION

SIGNIFICANCE

摘要

目的

开发一种新颖且高效的框架，用于对下丘（IC）神经活动进行解码以实现音素识别。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于超维脑启发学习的大规模下丘神经元活动的语音识别

Hyperdimensional Brain-Inspired Learning for Phoneme Recognition With Large-Scale Inferior Colliculus Neural Activities.

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

SIGNIFICANCE

目的

方法

结果

结论

意义

相似文献

基于超维脑启发学习的大规模下丘神经元活动的语音识别

Hyperdimensional Brain-Inspired Learning for Phoneme Recognition With Large-Scale Inferior Colliculus Neural Activities.

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

SIGNIFICANCE

目的

方法

结果

结论

意义

相似文献