优化基于模拟记忆的深度神经网络的权重编程。

Optimised weight programming for analogue memory-based deep neural networks.

机构信息

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

IBM Research-Yorktown Heights, 1101 Kitchawan Road, Yorktown Heights, NY, USA.

出版信息

Nat Commun. 2022 Jun 30;13(1):3765. doi: 10.1038/s41467-022-31405-1.

DOI:10.1038/s41467-022-31405-1

PMID:35773285

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9247051/

Abstract

Analogue memory-based deep neural networks provide energy-efficiency and per-area throughput gains relative to state-of-the-art digital counterparts such as graphics processing units. Recent advances focus largely on hardware-aware algorithmic training and improvements to circuits, architectures, and memory devices. Optimal translation of software-trained weights into analogue hardware weights-given the plethora of complex memory non-idealities-represents an equally important task. We report a generalised computational framework that automates the crafting of complex weight programming strategies to minimise accuracy degradations during inference, particularly over time. The framework is agnostic to network structure and generalises well across recurrent, convolutional, and transformer neural networks. As a highly flexible numerical heuristic, the approach accommodates arbitrary device-level complexity, making it potentially relevant for a variety of analogue memories. By quantifying the limit of achievable inference accuracy, it also enables analogue memory-based deep neural network accelerators to reach their full inference potential.

摘要

基于模拟记忆的深度神经网络相对于最先进的数字对应物（如图形处理单元）提供了节能和每面积吞吐量的优势。最近的进展主要集中在硬件感知算法训练以及对电路、架构和存储设备的改进上。将软件训练的权重转换为模拟硬件权重——考虑到大量复杂的存储非理想情况——是一项同样重要的任务。我们报告了一个通用的计算框架，该框架可以自动制作复杂的权重编程策略，以最大限度地减少推理过程中的精度下降，特别是随着时间的推移。该框架与网络结构无关，并且可以很好地推广到递归、卷积和变压器神经网络。作为一种高度灵活的数值启发式方法，该方法可以适应任意设备级别的复杂性，因此对于各种模拟存储器都可能具有相关性。通过量化可实现的推理精度的极限，它还可以使基于模拟存储器的深度神经网络加速器充分发挥其推理潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ff/9247051/235abb2cd617/41467_2022_31405_Fig1_HTML.jpg

相似文献

Optimised weight programming for analogue memory-based deep neural networks.

Nat Commun. 2022 Jun 30;13(1):3765. doi: 10.1038/s41467-022-31405-1.

Quantization-aware training for low precision photonic neural networks.

Neural Netw. 2022 Nov;155:561-573. doi: 10.1016/j.neunet.2022.09.015. Epub 2022 Sep 19.

Toward Software-Equivalent Accuracy on Transformer-Based Deep Neural Networks With Analog Memory Devices.

Front Comput Neurosci. 2021 Jul 5;15:675741. doi: 10.3389/fncom.2021.675741. eCollection 2021.

Lean Neural Networks for Autonomous Radar Waveform Design.

Sensors (Basel). 2022 Feb 9;22(4):1317. doi: 10.3390/s22041317.

Equivalent-accuracy accelerated neural-network training using analogue memory.

Nature. 2018 Jun;558(7708):60-67. doi: 10.1038/s41586-018-0180-5. Epub 2018 Jun 6.

Toward Full-Stack Acceleration of Deep Convolutional Neural Networks on FPGAs.

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3974-3987. doi: 10.1109/TNNLS.2021.3055240. Epub 2022 Aug 3.

Energy-efficient Mott activation neuron for full-hardware implementation of neural networks.

Nat Nanotechnol. 2021 Jun;16(6):680-687. doi: 10.1038/s41565-021-00874-8. Epub 2021 Mar 18.

Mixed-Precision Deep Learning Based on Computational Memory.

Front Neurosci. 2020 May 12;14:406. doi: 10.3389/fnins.2020.00406. eCollection 2020.

Accelerating Inference of Convolutional Neural Networks Using In-memory Computing.

Front Comput Neurosci. 2021 Aug 3;15:674154. doi: 10.3389/fncom.2021.674154. eCollection 2021.

Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices.

Front Neurosci. 2017 Oct 10;11:538. doi: 10.3389/fnins.2017.00538. eCollection 2017.

引用本文的文献

The Role of Phase-Change Memory in Edge Computing and Analog In-Memory Computing: An Overview of Recent Research Contributions and Future Challenges.

Sensors (Basel). 2025 Jun 9;25(12):3618. doi: 10.3390/s25123618.

In situ training of an in-sensor artificial neural network based on ferroelectric photosensors.

Nat Commun. 2025 Jan 7;16(1):421. doi: 10.1038/s41467-024-55508-z.

Hardware implementation of backpropagation using progressive gradient descent for in situ training of multilayer neural networks.

Sci Adv. 2024 Jul 12;10(28):eado8999. doi: 10.1126/sciadv.ado8999.

Resistive Switching Devices for Neuromorphic Computing: From Foundations to Chip Level Innovations.

Nanomaterials (Basel). 2024 Mar 15;14(6):527. doi: 10.3390/nano14060527.

Bringing uncertainty quantification to the extreme-edge with memristor-based Bayesian neural networks.

Nat Commun. 2023 Nov 20;14(1):7530. doi: 10.1038/s41467-023-43317-9.

Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators.

Nat Commun. 2023 Aug 30;14(1):5282. doi: 10.1038/s41467-023-40770-4.

Thousands of conductance levels in memristors integrated on CMOS.

Nature. 2023 Mar;615(7954):823-829. doi: 10.1038/s41586-023-05759-5. Epub 2023 Mar 29.

本文引用的文献

Toward Software-Equivalent Accuracy on Transformer-Based Deep Neural Networks With Analog Memory Devices.

Front Comput Neurosci. 2021 Jul 5;15:675741. doi: 10.3389/fncom.2021.675741. eCollection 2021.

Robo-writers: the rise and risks of language-generating AI.

Nature. 2021 Mar;591(7848):22-25. doi: 10.1038/d41586-021-00530-0.

Accurate deep neural network inference using computational phase-change memory.

Nat Commun. 2020 May 18;11(1):2473. doi: 10.1038/s41467-020-16108-9.

Memory devices and applications for in-memory computing.

Nat Nanotechnol. 2020 Jul;15(7):529-544. doi: 10.1038/s41565-020-0655-z. Epub 2020 Mar 30.

Parallel programming of an ionic floating-gate memory array for scalable neuromorphic computing.

Science. 2019 May 10;364(6440):570-574. doi: 10.1126/science.aaw5581. Epub 2019 Apr 25.

Memristive crossbar arrays for brain-inspired computing.

Nat Mater. 2019 Apr;18(4):309-323. doi: 10.1038/s41563-019-0291-x. Epub 2019 Mar 20.

High-Performance Mixed-Signal Neurocomputing With Nanoscale Floating-Gate Memory Cell Arrays.

IEEE Trans Neural Netw Learn Syst. 2018 Oct;29(10):4782-4790. doi: 10.1109/TNNLS.2017.2778940. Epub 2017 Dec 22.

Equivalent-accuracy accelerated neural-network training using analogue memory.

Nature. 2018 Jun;558(7708):60-67. doi: 10.1038/s41586-018-0180-5. Epub 2018 Jun 6.

Mastering the game of Go with deep neural networks and tree search.

Nature. 2016 Jan 28;529(7587):484-9. doi: 10.1038/nature16961.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

优化基于模拟记忆的深度神经网络的权重编程。

Optimised weight programming for analogue memory-based deep neural networks.

机构信息

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

IBM Research-Yorktown Heights, 1101 Kitchawan Road, Yorktown Heights, NY, USA.

出版信息

Nat Commun. 2022 Jun 30;13(1):3765. doi: 10.1038/s41467-022-31405-1.

DOI:10.1038/s41467-022-31405-1

PMID:35773285

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9247051/

Abstract

摘要

优化基于模拟记忆的深度神经网络的权重编程。

Optimised weight programming for analogue memory-based deep neural networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

优化基于模拟记忆的深度神经网络的权重编程。

Optimised weight programming for analogue memory-based deep neural networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献