• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于极端学习机的大数据并行多分类算法。

A Parallel Multiclassification Algorithm for Big Data Using an Extreme Learning Machine.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2337-2351. doi: 10.1109/TNNLS.2017.2654357. Epub 2017 Apr 20.

DOI:10.1109/TNNLS.2017.2654357
PMID:28436893
Abstract

As data sets become larger and more complicated, an extreme learning machine (ELM) that runs in a traditional serial environment cannot realize its ability to be fast and effective. Although a parallel ELM (PELM) based on MapReduce to process large-scale data shows more efficient learning speed than identical ELM algorithms in a serial environment, some operations, such as intermediate results stored on disks and multiple copies for each task, are indispensable, and these operations create a large amount of extra overhead and degrade the learning speed and efficiency of the PELMs. In this paper, an efficient ELM based on the Spark framework (SELM), which includes three parallel subalgorithms, is proposed for big data classification. By partitioning the corresponding data sets reasonably, the hidden layer output matrix calculation algorithm, matrix decomposition algorithm, and matrix decomposition algorithm perform most of the computations locally. At the same time, they retain the intermediate results in distributed memory and cache the diagonal matrix as broadcast variables instead of several copies for each task to reduce a large amount of the costs, and these actions strengthen the learning ability of the SELM. Finally, we implement our SELM algorithm to classify large data sets. Extensive experiments have been conducted to validate the effectiveness of the proposed algorithms. As shown, our SELM achieves an speedup on a cluster with ten nodes, and reaches a speedup with 15 nodes, an speedup with 20 nodes, a speedup with 25 nodes, a speedup with 30 nodes, and a speedup with 35 nodes.

摘要

随着数据集变得越来越大且越来越复杂,在传统串行环境中运行的极限学习机(ELM)无法实现其快速有效的能力。虽然基于 MapReduce 的并行 ELM(PELM)可用于处理大规模数据,其学习速度比在串行环境中相同的 ELM 算法更快,但一些操作(如存储在磁盘上的中间结果和每个任务的多个副本)是必不可少的,这些操作会产生大量额外开销,并降低 PELM 的学习速度和效率。在本文中,我们提出了一种基于 Spark 框架的高效 ELM(SELM),它包括三个并行子算法,用于大数据分类。通过合理划分相应的数据集,隐藏层输出矩阵计算算法、矩阵分解算法和矩阵分解算法在本地执行大部分计算。同时,它们保留分布式内存中的中间结果,并将对角矩阵作为广播变量缓存,而不是为每个任务复制多个副本,以减少大量开销,这些操作增强了 SELM 的学习能力。最后,我们实现了我们的 SELM 算法来对大数据集进行分类。通过大量实验验证了所提出算法的有效性。结果表明,我们的 SELM 在具有十个节点的集群上实现了加速,在具有十五个节点、二十个节点、二十五个节点、三十个节点和三十五个节点的集群上也实现了加速。

相似文献

1
A Parallel Multiclassification Algorithm for Big Data Using an Extreme Learning Machine.基于极端学习机的大数据并行多分类算法。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2337-2351. doi: 10.1109/TNNLS.2017.2654357. Epub 2017 Apr 20.
2
A Fast SVD-Hidden-nodes based Extreme Learning Machine for Large-Scale Data Analytics.一种基于快速奇异值分解-隐藏节点的极端学习机用于大规模数据分析
Neural Netw. 2016 May;77:14-28. doi: 10.1016/j.neunet.2015.09.003. Epub 2015 Oct 21.
3
Extreme Learning Machine for Multilayer Perceptron.极限学习机用于多层感知机。
IEEE Trans Neural Netw Learn Syst. 2016 Apr;27(4):809-21. doi: 10.1109/TNNLS.2015.2424995. Epub 2015 May 7.
4
Tuning extreme learning machine by an improved electromagnetism-like mechanism algorithm for classification problem.基于改进的电磁机制算法的极限学习机在分类问题中的调优。
Math Biosci Eng. 2019 May 23;16(5):4692-4707. doi: 10.3934/mbe.2019235.
5
SELM: Siamese extreme learning machine with application to face biometrics.连体极端学习机及其在面部生物识别中的应用
Neural Comput Appl. 2022;34(14):12143-12157. doi: 10.1007/s00521-022-07100-z. Epub 2022 Mar 15.
6
A novel multiple instance learning method based on extreme learning machine.一种基于极限学习机的新型多示例学习方法。
Comput Intell Neurosci. 2015;2015:405890. doi: 10.1155/2015/405890. Epub 2015 Feb 3.
7
Distributed semi-supervised learning algorithm based on extreme learning machine over networks using event-triggered communication scheme.基于事件触发通信方案的网络极端学习机分布式半监督学习算法。
Neural Netw. 2019 Nov;119:261-272. doi: 10.1016/j.neunet.2019.08.013. Epub 2019 Aug 17.
8
Bidirectional extreme learning machine for regression problem and its learning effectiveness.双向极端学习机在回归问题中的应用及其学习有效性。
IEEE Trans Neural Netw Learn Syst. 2012 Sep;23(9):1498-505. doi: 10.1109/TNNLS.2012.2202289.
9
Efficient DV-HOP Localization for Wireless Cyber-Physical Social Sensing System: A Correntropy-Based Neural Network Learning Scheme.用于无线信息物理社会感知系统的高效DV-HOP定位:一种基于核相关度的神经网络学习方案。
Sensors (Basel). 2017 Jan 12;17(1):135. doi: 10.3390/s17010135.
10
A hierarchical semi-supervised extreme learning machine method for EEG recognition.一种用于 EEG 识别的分层半监督极限学习机方法。
Med Biol Eng Comput. 2019 Jan;57(1):147-157. doi: 10.1007/s11517-018-1875-3. Epub 2018 Jul 28.

引用本文的文献

1
Skin Lesion Synthesis and Classification Using an Improved DCGAN Classifier.使用改进的深度卷积生成对抗网络分类器进行皮肤病变合成与分类
Diagnostics (Basel). 2023 Aug 9;13(16):2635. doi: 10.3390/diagnostics13162635.
2
A New Artificial Intelligence Approach Using Extreme Learning Machine as the Potentially Effective Model to Predict and Analyze the Diagnosis of Anemia.一种使用极限学习机作为潜在有效模型来预测和分析贫血诊断的新型人工智能方法。
Healthcare (Basel). 2023 Feb 26;11(5):697. doi: 10.3390/healthcare11050697.
3
Word2vec Word Embedding-Based Artificial Intelligence Model in the Triage of Patients with Suspected Diagnosis of Major Ischemic Stroke: A Feasibility Study.
基于 Word2vec 词嵌入的人工智能模型在疑似大血管闭塞性缺血性脑卒中患者分诊中的可行性研究。
Int J Environ Res Public Health. 2022 Nov 19;19(22):15295. doi: 10.3390/ijerph192215295.
4
Artificial Intelligence in Spinal Imaging: Current Status and Future Directions.人工智能在脊柱成像中的应用:现状与未来方向。
Int J Environ Res Public Health. 2022 Sep 16;19(18):11708. doi: 10.3390/ijerph191811708.
5
Multi-Class Skin Problem Classification Using Deep Generative Adversarial Network (DGAN).基于深度生成对抗网络(DGAN)的多类别皮肤问题分类。
Comput Intell Neurosci. 2022 Mar 23;2022:1797471. doi: 10.1155/2022/1797471. eCollection 2022.
6
A Neural Network Approach for Chinese Sports Tourism Demand Based on Knowledge Discovery.基于知识发现的中文体育旅游需求神经网络方法。
Comput Intell Neurosci. 2022 Apr 4;2022:9400742. doi: 10.1155/2022/9400742. eCollection 2022.
7
Real-Time Tracking of Object Melting Based on Enhanced DeepLab 3+ Network.基于增强型 DeepLab 3+ 网络的物体熔化实时跟踪。
Comput Intell Neurosci. 2022 Mar 30;2022:2309317. doi: 10.1155/2022/2309317. eCollection 2022.
8
Using Big Data-Based Neural Network Parallel Optimization Algorithm in Sports Fatigue Warning.基于大数据的神经网络并行优化算法在运动疲劳预警中的应用。
Comput Intell Neurosci. 2021 Jul 14;2021:2747940. doi: 10.1155/2021/2747940. eCollection 2021.
9
Few-shot pulse wave contour classification based on multi-scale feature extraction.基于多尺度特征提取的少脉冲波轮廓分类。
Sci Rep. 2021 Feb 12;11(1):3762. doi: 10.1038/s41598-021-83134-y.
10
LiDAR Point Cloud Recognition and Visualization with Deep Learning for Overhead Contact Inspection.基于深度学习的激光雷达点云识别与可视化用于架空接触网检测
Sensors (Basel). 2020 Nov 9;20(21):6387. doi: 10.3390/s20216387.