用于深度神经网络的经济高效随机 MAC 电路。

Cost-effective stochastic MAC circuits for deep neural networks.

机构信息

School of Electrical and Computer Engineering, UNIST, 50, UNIST-gil, Ulsan 44919, Republic of Korea.

出版信息

Neural Netw. 2019 Sep;117:152-162. doi: 10.1016/j.neunet.2019.04.017. Epub 2019 May 20.

DOI:10.1016/j.neunet.2019.04.017

Abstract

Stochastic computing (SC) is a promising computing paradigm that can help address both the uncertainties of future process technology and the challenges of efficient hardware realization for deep neural networks (DNNs). However the impreciseness and long latency of SC have rendered previous SC-based DNN architectures less competitive against optimized fixed-point digital implementations, unless inference accuracy is significantly sacrificed. In this paper we propose a new SC-MAC (multiply-and-accumulate) algorithm, which is a key building block for SC-based DNNs, that is orders of magnitude more efficient and accurate than previous SC-MACs. We also show how our new SC-MAC can be extended to a vector version and used to accelerate both convolution and fully-connected layers of convolutional neural networks (CNNs) using the same hardware. Our experimental results using CNNs designed for MNIST and CIFAR-10 datasets demonstrate that not only is our SC-based CNNs more accurate and 40∼490× more energy-efficient for convolution layers than conventional SC-based ones, but ours can also achieve lower area-delay product and lower energy compared with precision-optimized fixed-point implementations without sacrificing accuracy. We also demonstrate the feasibility of our SC-based CNNs through FPGA prototypes.

摘要

随机计算（SC）是一种很有前途的计算范例，可以帮助解决未来工艺技术的不确定性和深度神经网络（DNN）高效硬件实现的挑战。然而，SC 的不精确性和长延迟使得以前基于 SC 的 DNN 架构在不显著牺牲推断准确性的情况下，与优化的定点数字实现相比竞争力较弱。在本文中，我们提出了一种新的 SC-MAC（乘累加）算法，这是基于 SC 的 DNN 的关键构建块，比以前的 SC-MAC 高效和准确几个数量级。我们还展示了如何将我们的新 SC-MAC 扩展为向量版本，并使用相同的硬件加速卷积神经网络（CNN）的卷积和全连接层。我们使用 MNIST 和 CIFAR-10 数据集设计的 CNN 的实验结果表明，我们的基于 SC 的 CNN 不仅在卷积层比传统的基于 SC 的 CNN 更准确和 40∼490 倍更节能，而且与不牺牲准确性的精度优化定点实现相比，还可以实现更低的面积延迟乘积和更低的能量。我们还通过 FPGA 原型证明了我们的基于 SC 的 CNN 的可行性。

相似文献

Cost-effective stochastic MAC circuits for deep neural networks.用于深度神经网络的经济高效随机 MAC 电路。

Neural Netw. 2019 Sep;117:152-162. doi: 10.1016/j.neunet.2019.04.017. Epub 2019 May 20.

Bitstream-Based Neural Network for Scalable, Efficient, and Accurate Deep Learning Hardware.基于比特流的神经网络，用于可扩展、高效且准确的深度学习硬件。

Front Neurosci. 2020 Dec 23;14:543472. doi: 10.3389/fnins.2020.543472. eCollection 2020.

A Survey of Stochastic Computing Neural Networks for Machine Learning Applications.用于机器学习应用的随机计算神经网络调查。

IEEE Trans Neural Netw Learn Syst. 2021 Jul;32(7):2809-2824. doi: 10.1109/TNNLS.2020.3009047. Epub 2021 Jul 6.

Fully Parallel Stochastic Computing Hardware Implementation of Convolutional Neural Networks for Edge Computing Applications.用于边缘计算应用的卷积神经网络的全并行随机计算硬件实现

IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10408-10418. doi: 10.1109/TNNLS.2022.3166799. Epub 2023 Nov 30.

Hardware-Efficient Stochastic Binary CNN Architectures for Near-Sensor Computing.用于近传感器计算的硬件高效随机二值卷积神经网络架构

Front Neurosci. 2022 Jan 5;15:781786. doi: 10.3389/fnins.2021.781786. eCollection 2021.

Stochastic Computing Convolutional Neural Network Architecture Reinvented for Highly Efficient Artificial Intelligence Workload on Field-Programmable Gate Array.为现场可编程门阵列上的高效人工智能工作负载重新设计的随机计算卷积神经网络架构。

Research (Wash D C). 2024 Mar 4;7:0307. doi: 10.34133/research.0307. eCollection 2024.

Design of Fully Spectral CNNs for Efficient FPGA-Based Acceleration.用于基于现场可编程门阵列（FPGA）的高效加速的全谱卷积神经网络（CNN）设计

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8111-8123. doi: 10.1109/TNNLS.2022.3224779. Epub 2024 Jun 3.

Deep Convolutional Neural Networks for large-scale speech tasks.用于大规模语音任务的深度卷积神经网络。

Neural Netw. 2015 Apr;64:39-48. doi: 10.1016/j.neunet.2014.08.005. Epub 2014 Sep 16.

Towards dropout training for convolutional neural networks.面向卷积神经网络的随机失活训练

Neural Netw. 2015 Nov;71:1-10. doi: 10.1016/j.neunet.2015.07.007. Epub 2015 Jul 29.

Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices.使用电阻式交叉点器件训练深度卷积神经网络。

Front Neurosci. 2017 Oct 10;11:538. doi: 10.3389/fnins.2017.00538. eCollection 2017.

引用本文的文献

Research (Wash D C). 2024 Mar 4;7:0307. doi: 10.34133/research.0307. eCollection 2024.

Prognosis Prediction of Uveal Melanoma After Plaque Brachytherapy Based on Ultrasound With Machine Learning.基于超声与机器学习的敷贴近距离放疗后葡萄膜黑色素瘤的预后预测

Front Med (Lausanne). 2022 Jan 21;8:777142. doi: 10.3389/fmed.2021.777142. eCollection 2021.

Bitstream-Based Neural Network for Scalable, Efficient, and Accurate Deep Learning Hardware.基于比特流的神经网络，用于可扩展、高效且准确的深度学习硬件。

Front Neurosci. 2020 Dec 23;14:543472. doi: 10.3389/fnins.2020.543472. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于深度神经网络的经济高效随机 MAC 电路。

Cost-effective stochastic MAC circuits for deep neural networks.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献