Suppr超能文献

神经网络结构对使用GPU/TPU的卷积神经网络在图像分析中加速性能和提高准确性的影响。

Effect of neural network structure in accelerating performance and accuracy of a convolutional neural network with GPU/TPU for image analytics.

作者信息

Ravikumar Aswathy, Sriraman Harini, Sai Saketh P Maruthi, Lokesh Saddikuti, Karanam Abhiram

机构信息

School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, Tamil Nadu, India.

出版信息

PeerJ Comput Sci. 2022 Mar 3;8:e909. doi: 10.7717/peerj-cs.909. eCollection 2022.

Abstract

BACKGROUND

In deep learning the most significant breakthrough in the field of image recognition, object detection language processing was done by Convolutional Neural Network (CNN). Rapid growth in data and neural networks the performance of the DNN algorithms depends on the computation power and the storage capacity of the devices.

METHODS

In this paper, the convolutional neural network used for various image applications was studied and its acceleration in the various platforms like CPU, GPU, TPU was done. The neural network structure and the computing power and characteristics of the GPU, TPU was analyzed and summarized, the effect of these on accelerating the tasks is also explained. Cross-platform comparison of the CNN was done using three image applications the face mask detection (object detection/Computer Vision), Virus Detection in Plants (Image Classification: agriculture sector), and Pneumonia detection from X-ray Images (Image Classification/medical field).

RESULTS

The CNN implementation was done and a comprehensive comparison was done on the platforms to identify the performance, throughput, bottlenecks, and training time. The CNN layer-wise execution in GPU and TPU is explained with layer-wise analysis. The impact of the fully connected layer and convolutional layer on the network is analyzed. The challenges faced during the acceleration process were discussed and future works are identified.

摘要

背景

在深度学习这一图像识别、目标检测和语言处理领域最重大的突破是由卷积神经网络(CNN)实现的。随着数据和神经网络的快速增长,深度神经网络(DNN)算法的性能取决于设备的计算能力和存储容量。

方法

本文研究了用于各种图像应用的卷积神经网络,并在CPU、GPU、TPU等各种平台上对其进行了加速。分析并总结了神经网络结构以及GPU、TPU的计算能力和特性,并解释了它们对加速任务的影响。使用面部口罩检测(目标检测/计算机视觉)、植物病毒检测(图像分类:农业领域)和X光图像肺炎检测(图像分类/医学领域)这三个图像应用对CNN进行了跨平台比较。

结果

完成了CNN的实现,并在各平台上进行了全面比较,以确定性能、吞吐量、瓶颈和训练时间。通过逐层分析解释了GPU和TPU中CNN的逐层执行情况。分析了全连接层和卷积层对网络的影响。讨论了加速过程中面临的挑战,并确定了未来的工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d9/9044238/f67061242a1d/peerj-cs-08-909-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验