• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPU 上 3D 卷积神经网络的高性能实现。

High Performance Implementation of 3D Convolutional Neural Networks on a GPU.

机构信息

College of Computer, National University of Defense Technology, Changsha 410073, China.

National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China.

出版信息

Comput Intell Neurosci. 2017;2017:8348671. doi: 10.1155/2017/8348671. Epub 2017 Nov 8.

DOI:10.1155/2017/8348671
PMID:29250109
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5698830/
Abstract

Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version.

摘要

卷积神经网络在图像分类、目标跟踪等基于二维输入的许多任务中已经被证明非常成功。最近,研究人员开始将卷积神经网络应用于视频分类,这构成了三维输入,需要大量的内存和更多的计算。基于 FFT 的方法可以减少计算量,但这通常是以增加内存需求为代价的。另一方面,Winograd Minimal Filtering Algorithm (WMFA) 可以减少所需的操作数量,从而加速计算,而不增加所需的内存。这种策略在二维神经网络中被证明是成功的。我们为三维卷积神经网络实现了该算法,并将其应用于一种流行的用于视频分类的三维卷积神经网络,将其与 cuDNN 进行比较。对于我们对算法的高度优化实现,与 cuDNN 版本相比,我们测试网络的大多数三维卷积层的速度都提高了两倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/a49f73512364/CIN2017-8348671.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/7b5320db7814/CIN2017-8348671.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/4eeb8787edd4/CIN2017-8348671.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/759290d803e0/CIN2017-8348671.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/a49de6472776/CIN2017-8348671.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/95b294b64a12/CIN2017-8348671.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/faab2d2c7e58/CIN2017-8348671.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/a49f73512364/CIN2017-8348671.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/7b5320db7814/CIN2017-8348671.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/4eeb8787edd4/CIN2017-8348671.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/759290d803e0/CIN2017-8348671.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/a49de6472776/CIN2017-8348671.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/95b294b64a12/CIN2017-8348671.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/faab2d2c7e58/CIN2017-8348671.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aa3/5698830/a49f73512364/CIN2017-8348671.alg.002.jpg

相似文献

1
High Performance Implementation of 3D Convolutional Neural Networks on a GPU.GPU 上 3D 卷积神经网络的高性能实现。
Comput Intell Neurosci. 2017;2017:8348671. doi: 10.1155/2017/8348671. Epub 2017 Nov 8.
2
Differential convolutional neural network.差异卷积神经网络。
Neural Netw. 2019 Aug;116:279-287. doi: 10.1016/j.neunet.2019.04.025. Epub 2019 May 10.
3
Attention-aware fully convolutional neural network with convolutional long short-term memory network for ultrasound-based motion tracking.基于注意力感知的全卷积神经网络与卷积长短期记忆网络的超声运动跟踪
Med Phys. 2019 May;46(5):2275-2285. doi: 10.1002/mp.13510. Epub 2019 Apr 22.
4
Segmentation of organs-at-risks in head and neck CT images using convolutional neural networks.使用卷积神经网络对头颈部CT图像中的危险器官进行分割。
Med Phys. 2017 Feb;44(2):547-557. doi: 10.1002/mp.12045.
5
Fast vision through frameless event-based sensing and convolutional processing: application to texture recognition.通过无框架事件驱动传感和卷积处理实现快速视觉:应用于纹理识别
IEEE Trans Neural Netw. 2010 Apr;21(4):609-20. doi: 10.1109/TNN.2009.2039943. Epub 2010 Feb 22.
6
Deep convolutional neural networks for automatic classification of gastric carcinoma using whole slide images in digital histopathology.基于数字病理切片全场图像的深度学习卷积神经网络用于胃癌的自动分类。
Comput Med Imaging Graph. 2017 Nov;61:2-13. doi: 10.1016/j.compmedimag.2017.06.001. Epub 2017 Jun 16.
7
[Using Parallel Convolutional Neural Networks for Treatment Position Recognition in X-ray Images].[使用并行卷积神经网络进行X射线图像中的治疗位置识别]
Zhongguo Yi Liao Qi Xie Za Zhi. 2018 Feb 8;42(2):92-94. doi: 10.3969/j.issn.1671-7104.2018.02.004.
8
MR-based synthetic CT generation using a deep convolutional neural network method.基于磁共振成像利用深度卷积神经网络方法生成合成CT图像
Med Phys. 2017 Apr;44(4):1408-1419. doi: 10.1002/mp.12155. Epub 2017 Mar 21.
9
Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.使用增强卷积和递归神经网络监测手术视频中的工具使用情况。
Med Image Anal. 2018 Jul;47:203-218. doi: 10.1016/j.media.2018.05.001. Epub 2018 May 9.
10
Efficient training of convolutional deep belief networks in the frequency domain for application to high-resolution 2D and 3D images.卷积深度置信网络在频域中的高效训练及其在高分辨率二维和三维图像中的应用
Neural Comput. 2015 Jan;27(1):211-27. doi: 10.1162/NECO_a_00682.

引用本文的文献

1
Machine learning in healthcare citizen science: A scoping review.医疗保健公民科学中的机器学习:一项范围综述。
Int J Med Inform. 2025 Mar;195:105766. doi: 10.1016/j.ijmedinf.2024.105766. Epub 2024 Dec 19.
2
Potential use of deep learning techniques for postmortem imaging.深度学习技术在死后成像中的潜在应用。
Forensic Sci Med Pathol. 2020 Dec;16(4):671-679. doi: 10.1007/s12024-020-00307-3. Epub 2020 Sep 29.

本文引用的文献

1
3D convolutional neural networks for human action recognition.三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.
2
Human tracking using convolutional neural networks.使用卷积神经网络进行人体跟踪。
IEEE Trans Neural Netw. 2010 Oct;21(10):1610-23. doi: 10.1109/TNN.2010.2066286. Epub 2010 Aug 30.
3
Face recognition: a convolutional neural-network approach.人脸识别:一种卷积神经网络方法。
IEEE Trans Neural Netw. 1997;8(1):98-113. doi: 10.1109/72.554195.