• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 GPU 的多个前馈神经网络评估。

An evaluation of multiple feed-forward networks on GPUs.

机构信息

CISUC, Department of Informatics Engineering, University of Coimbra, Portugal.

出版信息

Int J Neural Syst. 2011 Feb;21(1):31-47. doi: 10.1142/S0129065711002638.

DOI:10.1142/S0129065711002638
PMID:21243729
Abstract

The Graphics Processing Unit (GPU) originally designed for rendering graphics and which is difficult to program for other tasks, has since evolved into a device suitable for general-purpose computations. As a result graphics hardware has become progressively more attractive yielding unprecedented performance at a relatively low cost. Thus, it is the ideal candidate to accelerate a wide variety of data parallel tasks in many fields such as in Machine Learning (ML). As problems become more and more demanding, parallel implementations of learning algorithms are crucial for a useful application. In particular, the implementation of Neural Networks (NNs) in GPUs can significantly reduce the long training times during the learning process. In this paper we present a GPU parallel implementation of the Back-Propagation (BP) and Multiple Back-Propagation (MBP) algorithms, and describe the GPU kernels needed for this task. The results obtained on well-known benchmarks show faster training times and improved performances as compared to the implementation in traditional hardware, due to maximized floating-point throughput and memory bandwidth. Moreover, a preliminary GPU based Autonomous Training System (ATS) is developed which aims at automatically finding high-quality NNs-based solutions for a given problem.

摘要

图形处理单元(GPU)最初是为渲染图形而设计的,很难用于其他任务的编程,但后来演变成了一种适用于通用计算的设备。因此,图形硬件变得越来越有吸引力,以相对较低的成本实现了前所未有的性能。因此,它是加速许多领域(如机器学习(ML))中各种数据并行任务的理想候选者。随着问题变得越来越复杂,学习算法的并行实现对于有用的应用至关重要。特别是,神经网络(NN)在 GPU 上的实现可以显著减少学习过程中的长时间训练。在本文中,我们提出了一种 GPU 并行实现反向传播(BP)和多次反向传播(MBP)算法的方法,并描述了实现此任务所需的 GPU 内核。在著名的基准测试上获得的结果表明,与传统硬件的实现相比,训练时间更快,性能得到了提高,这是由于最大化了浮点吞吐量和内存带宽。此外,还开发了一个初步的基于 GPU 的自主训练系统(ATS),旨在为给定问题自动找到基于高质量神经网络的解决方案。

相似文献

1
An evaluation of multiple feed-forward networks on GPUs.基于 GPU 的多个前馈神经网络评估。
Int J Neural Syst. 2011 Feb;21(1):31-47. doi: 10.1142/S0129065711002638.
2
Comparison of GPU- and CPU-implementations of mean-firing rate neural networks on parallel hardware.比较在并行硬件上基于 GPU 和 CPU 的平均发放率神经网络的实现。
Network. 2012;23(4):212-36. doi: 10.3109/0954898X.2012.739292. Epub 2012 Nov 9.
3
Radial basis function networks GPU-based implementation.
IEEE Trans Neural Netw. 2008 Dec;19(12):2150-4. doi: 10.1109/TNN.2008.2003284.
4
NMF-mGPU: non-negative matrix factorization on multi-GPU systems.NMF-mGPU:多GPU系统上的非负矩阵分解
BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.
5
Medical image processing on the GPU - past, present and future.GPU 上的医学图像处理——过去、现在和未来。
Med Image Anal. 2013 Dec;17(8):1073-94. doi: 10.1016/j.media.2013.05.008. Epub 2013 Jun 5.
6
Performance evaluation of image processing algorithms on the GPU.图像处理算法在图形处理器上的性能评估。
J Struct Biol. 2008 Oct;164(1):153-60. doi: 10.1016/j.jsb.2008.07.006. Epub 2008 Jul 24.
7
Evaluation of accelerated iterative x-ray CT image reconstruction using floating point graphics hardware.使用浮点图形硬件对加速迭代X射线CT图像重建进行评估。
Phys Med Biol. 2006 Feb 21;51(4):875-89. doi: 10.1088/0031-9155/51/4/008. Epub 2006 Jan 25.
8
Accelerating reaction-diffusion simulations with general-purpose graphics processing units.使用通用图形处理单元加速反应-扩散模拟。
Bioinformatics. 2011 Jan 15;27(2):288-90. doi: 10.1093/bioinformatics/btq622. Epub 2010 Nov 8.
9
Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware.MAFFT在支持CUDA的图形硬件上的并行实现。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):205-18. doi: 10.1109/TCBB.2014.2351801.
10
GPU accelerated dynamic functional connectivity analysis for functional MRI data.GPU 加速的功能磁共振成像数据动态功能连接分析。
Comput Med Imaging Graph. 2015 Jul;43:53-63. doi: 10.1016/j.compmedimag.2015.02.009. Epub 2015 Mar 7.