Suppr超能文献

大数据:一种基于MapReduce的并行粒子群优化-反向传播神经网络算法

Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce.

作者信息

Cao Jianfang, Cui Hongyan, Shi Hao, Jiao Lijuan

机构信息

Computer Science and Technology Department, Xinzhou Teachers University, Xinzhou, China.

College of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan, China.

出版信息

PLoS One. 2016 Jun 15;11(6):e0157551. doi: 10.1371/journal.pone.0157551. eCollection 2016.

Abstract

A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network's initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data.

摘要

反向传播(BP)神经网络能够解决复杂的随机非线性映射问题,因此可应用于广泛的问题。然而,随着样本数量的增加,训练BP神经网络所需的时间会变长。此外,分类准确率也会降低。为提高BP神经网络算法的分类准确率和运行时效率,我们基于Hadoop平台上的MapReduce提出了一种并行设计与实现方法,用于对粒子群优化(PSO)的BP神经网络进行优化,同时使用PSO算法和并行设计。PSO算法用于优化BP神经网络的初始权重和阈值,提高分类算法的准确率。利用MapReduce并行编程模型实现BP算法的并行处理,从而解决BP神经网络处理大数据时的硬件和通信开销问题。使用来自SUN数据库的场景图像库构建了5种不同规模的数据集。并行PSO-BP神经网络算法的分类准确率约为92%,系统效率约为0.85,在处理大数据时具有明显优势。本研究提出的算法展现出更高的分类准确率和提升的时间效率,这代表了将并行处理应用于大数据智能算法所取得的显著进步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1827/4909218/44feb8bc7e32/pone.0157551.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验