基于物理计算的模拟语音识别。

Analogue speech recognition based on physical computing.

作者信息

Zolfagharinejad Mohamadreza, Büchel Julian, Cassola Lorenzo, Kinge Sachin, Syed Ghazi Sarwat, Sebastian Abu, van der Wiel Wilfred G

机构信息

NanoElectronics Group, MESA+ Institute and BRAINS Center for Brain-Inspired Computing, University of Twente, Enschede, the Netherlands.

IBM Research Europe, Rüschlikon, Switzerland.

出版信息

Nature. 2025 Sep 17. doi: 10.1038/s41586-025-09501-1.

DOI:10.1038/s41586-025-09501-1

PMID:40963022

Abstract

With the rise of decentralized computing, such as in the Internet of Things, autonomous driving and personalized healthcare, it is increasingly important to process time-dependent signals 'at the edge' efficiently: right at the place where the temporal data are collected, avoiding time-consuming, insecure and costly communication with a centralized computing facility (or 'cloud'). However, modern-day processors often cannot meet the restrained power and time budgets of edge systems because of intrinsic limitations imposed by their architecture (von Neumann bottleneck) or domain conversions (analogue to digital and time to frequency). Here we propose an edge temporal-signal processor based on two in-materia computing systems for both feature extraction and classification, reaching near-software accuracy for the TI-46-Word and Google Speech Commands datasets. First, a nonlinear, room-temperature reconfigurable-nonlinear-processing-unit layer realizes analogue, time-domain feature extraction from the raw audio signals, similar to the human cochlea. Second, an analogue in-memory computing chip, consisting of memristive crossbar arrays, implements a compact neural network trained on the extracted features for classification. With submillisecond latency, reconfigurable-nonlinear-processing-unit-based feature extraction consuming roughly 300 nJ per inference, and the analogue in-memory computing-based classifier using around 78 µJ (with potential for roughly 10 µJ), our findings offer a promising avenue for advancing the compactness, efficiency and performance of heterogeneous smart edge processors through in materia computing hardware.

摘要

随着去中心化计算的兴起，如在物联网、自动驾驶和个性化医疗保健领域，在“边缘”高效处理时间相关信号变得越来越重要：就在收集时间数据的地方，避免与集中式计算设施（或“云”）进行耗时、不安全且成本高昂的通信。然而，由于其架构（冯·诺依曼瓶颈）或域转换（模拟到数字以及时间到频率）带来的固有局限性，现代处理器往往无法满足边缘系统严格的功率和时间预算。在此，我们提出一种基于两个材料内计算系统的边缘时间信号处理器，用于特征提取和分类，在TI - 46 - Word和谷歌语音命令数据集上达到了接近软件的准确率。首先，一个非线性的室温可重构非线性处理单元层实现了从原始音频信号中进行模拟时域特征提取，类似于人类耳蜗。其次，一个由忆阻交叉阵列组成的模拟内存计算芯片，对提取的特征进行训练以实现分类的紧凑型神经网络。我们的研究结果表明，基于可重构非线性处理单元的特征提取每推理一次的延迟为亚毫秒级，功耗约为300纳焦，基于模拟内存计算的分类器功耗约为78微焦（潜在功耗约为10微焦），这为通过材料内计算硬件提高异构智能边缘处理器的紧凑性、效率和性能提供了一条有前景的途径。

相似文献

Analogue speech recognition based on physical computing.基于物理计算的模拟语音识别。

Nature. 2025 Sep 17. doi: 10.1038/s41586-025-09501-1.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Short-Term Memory Impairment短期记忆障碍

Leveraging multithreading on edge computing for smart healthcare based on intelligent multimodal classification approach.基于智能多模态分类方法，在边缘计算上利用多线程实现智能医疗保健。

Comput Med Imaging Graph. 2025 Jul 1;124:102594. doi: 10.1016/j.compmedimag.2025.102594.

Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划：一项混合方法研究。

Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.

Sexual Harassment and Prevention Training性骚扰与预防培训

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。

Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.

Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义

APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂（GLP-1 RAs）减肥效果的网状Meta分析的数量、质量及结果：一项范围综述

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

本文引用的文献

Phase-Change Memory for In-Memory Computing.用于内存计算的相变存储器

Chem Rev. 2025 Jun 11;125(11):5163-5194. doi: 10.1021/acs.chemrev.4c00670. Epub 2025 May 22.

Demonstration of 4-quadrant analog in-memory matrix multiplication in a single modulation.在单次调制中演示四象限模拟内存矩阵乘法。

Npj Unconv Comput. 2024;1(1):11. doi: 10.1038/s44335-024-00010-4. Epub 2024 Oct 3.

Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators.使用基于内存计算的加速器对大规模多样的深度学习推理工作负载进行硬件感知训练。

Nat Commun. 2023 Aug 30;14(1):5282. doi: 10.1038/s41467-023-40770-4.

An analog-AI chip for energy-efficient speech recognition and transcription.一种用于节能语音识别和转录的模拟人工智能芯片。

Nature. 2023 Aug;620(7975):768-775. doi: 10.1038/s41586-023-06337-5. Epub 2023 Aug 23.

Hopf physical reservoir computer for reconfigurable sound recognition.Hopf 物理存储计算机可用于可重构声音识别。

Sci Rep. 2023 May 30;13(1):8719. doi: 10.1038/s41598-023-35760-x.

Thousands of conductance levels in memristors integrated on CMOS.在 CMOS 上集成的数千个电导水平的忆阻器。

Nature. 2023 Mar;615(7954):823-829. doi: 10.1038/s41586-023-05759-5. Epub 2023 Mar 29.

In-Materio Reservoir Computing in a Sulfonated Polyaniline Network.磺化聚苯胺网络中的原位储层计算

Adv Mater. 2021 Dec;33(48):e2102688. doi: 10.1002/adma.202102688. Epub 2021 Sep 17.

A deep-learning approach to realizing functionality in nanoelectronic devices.深度学习在纳米电子器件功能实现中的应用。

Nat Nanotechnol. 2020 Dec;15(12):992-998. doi: 10.1038/s41565-020-00779-y. Epub 2020 Oct 19.

Fully hardware-implemented memristor convolutional neural network.全硬件实现的忆阻器卷积神经网络。

Nature. 2020 Jan;577(7792):641-646. doi: 10.1038/s41586-020-1942-4. Epub 2020 Jan 29.

Classification with a disordered dopant-atom network in silicon.硅中具有无序掺杂原子网络的分类。

Nature. 2020 Jan;577(7790):341-345. doi: 10.1038/s41586-019-1901-0. Epub 2020 Jan 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于物理计算的模拟语音识别。

Analogue speech recognition based on physical computing.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献