Suppr超能文献

基于物理计算的模拟语音识别。

Analogue speech recognition based on physical computing.

作者信息

Zolfagharinejad Mohamadreza, Büchel Julian, Cassola Lorenzo, Kinge Sachin, Syed Ghazi Sarwat, Sebastian Abu, van der Wiel Wilfred G

机构信息

NanoElectronics Group, MESA+ Institute and BRAINS Center for Brain-Inspired Computing, University of Twente, Enschede, the Netherlands.

IBM Research Europe, Rüschlikon, Switzerland.

出版信息

Nature. 2025 Sep 17. doi: 10.1038/s41586-025-09501-1.

Abstract

With the rise of decentralized computing, such as in the Internet of Things, autonomous driving and personalized healthcare, it is increasingly important to process time-dependent signals 'at the edge' efficiently: right at the place where the temporal data are collected, avoiding time-consuming, insecure and costly communication with a centralized computing facility (or 'cloud'). However, modern-day processors often cannot meet the restrained power and time budgets of edge systems because of intrinsic limitations imposed by their architecture (von Neumann bottleneck) or domain conversions (analogue to digital and time to frequency). Here we propose an edge temporal-signal processor based on two in-materia computing systems for both feature extraction and classification, reaching near-software accuracy for the TI-46-Word and Google Speech Commands datasets. First, a nonlinear, room-temperature reconfigurable-nonlinear-processing-unit layer realizes analogue, time-domain feature extraction from the raw audio signals, similar to the human cochlea. Second, an analogue in-memory computing chip, consisting of memristive crossbar arrays, implements a compact neural network trained on the extracted features for classification. With submillisecond latency, reconfigurable-nonlinear-processing-unit-based feature extraction consuming roughly 300 nJ per inference, and the analogue in-memory computing-based classifier using around 78 µJ (with potential for roughly 10 µJ), our findings offer a promising avenue for advancing the compactness, efficiency and performance of heterogeneous smart edge processors through in materia computing hardware.

摘要

随着去中心化计算的兴起,如在物联网、自动驾驶和个性化医疗保健领域,在“边缘”高效处理时间相关信号变得越来越重要:就在收集时间数据的地方,避免与集中式计算设施(或“云”)进行耗时、不安全且成本高昂的通信。然而,由于其架构(冯·诺依曼瓶颈)或域转换(模拟到数字以及时间到频率)带来的固有局限性,现代处理器往往无法满足边缘系统严格的功率和时间预算。在此,我们提出一种基于两个材料内计算系统的边缘时间信号处理器,用于特征提取和分类,在TI - 46 - Word和谷歌语音命令数据集上达到了接近软件的准确率。首先,一个非线性的室温可重构非线性处理单元层实现了从原始音频信号中进行模拟时域特征提取,类似于人类耳蜗。其次,一个由忆阻交叉阵列组成的模拟内存计算芯片,对提取的特征进行训练以实现分类的紧凑型神经网络。我们的研究结果表明,基于可重构非线性处理单元的特征提取每推理一次的延迟为亚毫秒级,功耗约为300纳焦,基于模拟内存计算的分类器功耗约为78微焦(潜在功耗约为10微焦),这为通过材料内计算硬件提高异构智能边缘处理器的紧凑性、效率和性能提供了一条有前景的途径。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验