用于在电阻式器件阵列上训练神经网络的算法

Algorithm for Training Neural Networks on Resistive Device Arrays.

作者信息

Gokmen Tayfun, Haensch Wilfried

机构信息

IBM Research AI, Yorktown Heights, NY, United States.

出版信息

Front Neurosci. 2020 Feb 26;14:103. doi: 10.3389/fnins.2020.00103. eCollection 2020.

DOI:10.3389/fnins.2020.00103

PMID:32174807

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7054461/

Abstract

Hardware architectures composed of resistive cross-point device arrays can provide significant power and speed benefits for deep neural network training workloads using stochastic gradient descent (SGD) and backpropagation (BP) algorithm. The training accuracy on this imminent analog hardware, however, strongly depends on the switching characteristics of the cross-point elements. One of the key requirements is that these resistive devices must change conductance in a symmetrical fashion when subjected to positive or negative pulse stimuli. Here, we present a new training algorithm, so-called the "Tiki-Taka" algorithm, that eliminates this stringent symmetry requirement. We show that device asymmetry introduces an unintentional implicit cost term into the SGD algorithm, whereas in the "Tiki-Taka" algorithm a coupled dynamical system simultaneously minimizes the original objective function of the neural network and the unintentional cost term due to device asymmetry in a self-consistent fashion. We tested the validity of this new algorithm on a range of network architectures such as fully connected, convolutional and LSTM networks. Simulation results on these various networks show that the accuracy achieved using the conventional SGD algorithm with symmetric (ideal) device switching characteristics is matched in accuracy achieved using the "Tiki-Taka" algorithm with non-symmetric (non-ideal) device switching characteristics. Moreover, all the operations performed on the arrays are still parallel and therefore the implementation cost of this new algorithm on array architectures is minimal; and it maintains the aforementioned power and speed benefits. These algorithmic improvements are crucial to relax the material specification and to realize technologically viable resistive crossbar arrays that outperform digital accelerators for similar training tasks.

摘要

由电阻式交叉点器件阵列组成的硬件架构，对于使用随机梯度下降（SGD）和反向传播（BP）算法的深度神经网络训练工作负载而言，能够带来显著的功率和速度优势。然而，在这种即将出现的模拟硬件上的训练精度，在很大程度上取决于交叉点元件的开关特性。其中一个关键要求是，这些电阻式器件在受到正脉冲或负脉冲刺激时，必须以对称方式改变电导。在此，我们提出一种新的训练算法，即所谓的“Tiki-Taka”算法，它消除了这一严格的对称性要求。我们表明，器件不对称会在SGD算法中引入一个无意的隐式成本项，而在“Tiki-Taka”算法中，一个耦合动力系统会以自洽的方式同时最小化神经网络的原始目标函数以及由器件不对称导致的无意成本项。我们在一系列网络架构上测试了这种新算法的有效性，如全连接网络、卷积网络和长短期记忆（LSTM）网络。在这些不同网络上的仿真结果表明，使用具有对称（理想）器件开关特性的传统SGD算法所达到的精度，与使用具有非对称（非理想）器件开关特性的“Tiki-Taka”算法所达到的精度相当。此外，在阵列上执行的所有操作仍然是并行的，因此这种新算法在阵列架构上的实现成本极低；并且它保持了上述的功率和速度优势。这些算法改进对于放宽材料规格要求以及实现技术上可行的电阻式交叉开关阵列至关重要，这种阵列在类似训练任务中性能优于数字加速器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4782/7054461/07153f501504/fnins-14-00103-g001.jpg

相似文献

Algorithm for Training Neural Networks on Resistive Device Arrays.

Front Neurosci. 2020 Feb 26;14:103. doi: 10.3389/fnins.2020.00103. eCollection 2020.

Enabling Training of Neural Networks on Noisy Hardware.

Front Artif Intell. 2021 Sep 9;4:699148. doi: 10.3389/frai.2021.699148. eCollection 2021.

Impact of Asymmetric Weight Update on Neural Network Training With Tiki-Taka Algorithm.

Front Neurosci. 2022 Jan 6;15:767953. doi: 10.3389/fnins.2021.767953. eCollection 2021.

Analog Resistive Switching Devices for Training Deep Neural Networks with the Novel Tiki-Taka Algorithm.

Nano Lett. 2024 Jan 24;24(3):866-872. doi: 10.1021/acs.nanolett.3c03697. Epub 2024 Jan 11.

Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices.

Front Neurosci. 2017 Oct 10;11:538. doi: 10.3389/fnins.2017.00538. eCollection 2017.

Training LSTM Networks With Resistive Cross-Point Devices.

Front Neurosci. 2018 Oct 24;12:745. doi: 10.3389/fnins.2018.00745. eCollection 2018.

Retention-aware zero-shifting technique for Tiki-Taka algorithm-based analog deep learning accelerator.

Sci Adv. 2024 Jun 14;10(24):eadl3350. doi: 10.1126/sciadv.adl3350.

Neural Network Training With Asymmetric Crosspoint Elements.

Front Artif Intell. 2022 May 9;5:891624. doi: 10.3389/frai.2022.891624. eCollection 2022.

Memristors for Neuromorphic Circuits and Artificial Intelligence Applications.

Materials (Basel). 2020 Feb 20;13(4):938. doi: 10.3390/ma13040938.

Fully hardware-implemented memristor convolutional neural network.

Nature. 2020 Jan;577(7792):641-646. doi: 10.1038/s41586-020-1942-4. Epub 2020 Jan 29.

引用本文的文献

Analog Switching in Hexagonal Boron Nitride Memristors via Multiple Nano-Filaments Confinement.

Small. 2025 Aug;21(32):e2504507. doi: 10.1002/smll.202504507. Epub 2025 Jun 16.

Disturbance-Aware On-Chip Training with Mitigation Schemes for Massively Parallel Computing in Analog Deep Learning Accelerator.

Adv Sci (Weinh). 2025 Jun;12(23):e2417635. doi: 10.1002/advs.202417635. Epub 2025 May 20.

Resistive Switching Random-Access Memory (RRAM): Applications and Requirements for Memory and Computing.

Chem Rev. 2025 Jun 25;125(12):5584-5625. doi: 10.1021/acs.chemrev.4c00845. Epub 2025 May 2.

Wafer-Scale AgS-Based Memristive Crossbar Arrays with Ultra-Low Switching-Energies Reaching Biological Synapses.

Nanomicro Lett. 2024 Nov 22;17(1):69. doi: 10.1007/s40820-024-01559-2.

Fast and robust analog in-memory deep neural network training.

Nat Commun. 2024 Aug 20;15(1):7133. doi: 10.1038/s41467-024-51221-z.

Retention-aware zero-shifting technique for Tiki-Taka algorithm-based analog deep learning accelerator.

Sci Adv. 2024 Jun 14;10(24):eadl3350. doi: 10.1126/sciadv.adl3350.

A comprehensive review of advanced trends: from artificial synapses to neuromorphic systems with consideration of non-ideal effects.

Front Neurosci. 2024 Apr 10;18:1279708. doi: 10.3389/fnins.2024.1279708. eCollection 2024.

Analytical modelling of the transport in analog filamentary conductive-metal-oxide/HfO ReRAM devices.

Nanoscale Horiz. 2024 Apr 29;9(5):775-784. doi: 10.1039/d4nh00072b.

Analog Resistive Switching Devices for Training Deep Neural Networks with the Novel Tiki-Taka Algorithm.

Nano Lett. 2024 Jan 24;24(3):866-872. doi: 10.1021/acs.nanolett.3c03697. Epub 2024 Jan 11.

Open-loop analog programmable electrochemical memory array.

Nat Commun. 2023 Oct 4;14(1):6184. doi: 10.1038/s41467-023-41958-4.

本文引用的文献

Training LSTM Networks With Resistive Cross-Point Devices.

Front Neurosci. 2018 Oct 24;12:745. doi: 10.3389/fnins.2018.00745. eCollection 2018.

Equivalent-accuracy accelerated neural-network training using analogue memory.

Nature. 2018 Jun;558(7708):60-67. doi: 10.1038/s41586-018-0180-5. Epub 2018 Jun 6.

Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices.

Front Neurosci. 2017 Oct 10;11:538. doi: 10.3389/fnins.2017.00538. eCollection 2017.

Li-Ion Synaptic Transistor for Low Power Analog Computing.

Adv Mater. 2017 Jan;29(4). doi: 10.1002/adma.201604310. Epub 2016 Nov 22.

Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices: Design Considerations.

Front Neurosci. 2016 Jul 21;10:333. doi: 10.3389/fnins.2016.00333. eCollection 2016.

Energy Scaling Advantages of Resistive Memory Crossbar Based Computation and Its Application to Sparse Coding.

Front Neurosci. 2016 Jan 6;9:484. doi: 10.3389/fnins.2015.00484. eCollection 2015.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

Training and operation of an integrated neuromorphic network based on metal-oxide memristors.

Nature. 2015 May 7;521(7550):61-4. doi: 10.1038/nature14441.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于在电阻式器件阵列上训练神经网络的算法

Algorithm for Training Neural Networks on Resistive Device Arrays.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献