基于流分块增量学习的类数据流分类方法，具有快速学习速度和低结构复杂度。

Streaming chunk incremental learning for class-wise data stream classification with fast learning speed and low structural complexity.

机构信息

Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, Thailand.

出版信息

PLoS One. 2019 Sep 9;14(9):e0220624. doi: 10.1371/journal.pone.0220624. eCollection 2019.

DOI:10.1371/journal.pone.0220624

PMID:31498787

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6733468/

Abstract

Due to the fast speed of data generation and collection from advanced equipment, the amount of data obviously overflows the limit of available memory space and causes difficulties achieving high learning accuracy. Several methods based on discard-after-learn concept have been proposed. Some methods were designed to cope with a single incoming datum but some were designed for a chunk of incoming data. Although the results of these approaches are rather impressive, most of them are based on temporally adding more neurons to learn new incoming data without any neuron merging process which can obviously increase the computational time and space complexities. Only online versatile elliptic basis function (VEBF) introduced neuron merging to reduce the space-time complexity of learning only a single incoming datum. This paper proposed a method for further enhancing the capability of discard-after-learn concept for streaming data-chunk environment in terms of low computational time and neural space complexities. A set of recursive functions for computing the relevant parameters of a new neuron, based on statistical confidence interval, was introduced. The newly proposed method, named streaming chunk incremental learning (SCIL), increases the plasticity and the adaptabilty of the network structure according to the distribution of incoming data and their classes. When being compared to the others in incremental-like manner, based on 11 benchmarked data sets of 150 to 581,012 samples with attributes ranging from 4 to 1,558 formed as streaming data, the proposed SCIL gave better accuracy and time in most data sets.

摘要

由于先进设备的数据生成和采集速度很快，数据量显然超过了可用内存空间的限制，导致难以实现高精度的学习。已经提出了几种基于学习后丢弃概念的方法。有些方法是为处理单个传入数据而设计的，而有些方法是为处理传入数据块而设计的。尽管这些方法的结果相当令人印象深刻，但它们大多数都是基于暂时添加更多神经元来学习新的传入数据，而没有任何神经元合并过程，这显然会增加计算时间和空间复杂度。只有在线通用椭圆基函数 (VEBF) 引入了神经元合并，以降低仅学习单个传入数据的时空复杂度。本文提出了一种方法，用于在低计算时间和神经空间复杂度的情况下，进一步增强流式数据块环境中学习后丢弃概念的能力。引入了一组基于统计置信区间的递归函数，用于计算新神经元的相关参数。新提出的方法名为流式数据块增量学习 (SCIL)，根据传入数据及其类别的分布，增加了网络结构的可塑性和适应性。与其他以增量方式比较时，基于 11 个具有 150 到 581,012 个样本的基准数据集，这些数据集的属性范围从 4 到 1558，形成了流式数据，所提出的 SCIL 在大多数数据集中都提供了更好的准确性和时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2272/6733468/e4781260d4d3/pone.0220624.g001.jpg

相似文献

Streaming chunk incremental learning for class-wise data stream classification with fast learning speed and low structural complexity.基于流分块增量学习的类数据流分类方法，具有快速学习速度和低结构复杂度。

PLoS One. 2019 Sep 9;14(9):e0220624. doi: 10.1371/journal.pone.0220624. eCollection 2019.

A very fast neural learning for classification using only new incoming datum.一种仅使用新传入数据进行分类的非常快速的神经学习方法。

IEEE Trans Neural Netw. 2010 Mar;21(3):381-92. doi: 10.1109/TNN.2009.2037148. Epub 2010 Jan 15.

One-pass-throw-away learning for cybersecurity in streaming non-stationary environments by dynamic stratum network.基于动态分层网络的流媒体非平稳环境中用于网络安全的单次丢弃学习。

PLoS One. 2018 Sep 6;13(9):e0202937. doi: 10.1371/journal.pone.0202937. eCollection 2018.

Adaptive Chunk-Based Dynamic Weighted Majority for Imbalanced Data Streams With Concept Drift.用于处理带有概念漂移的不平衡数据流的基于自适应块的动态加权多数算法

IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2764-2778. doi: 10.1109/TNNLS.2019.2951814. Epub 2019 Dec 5.

Novelty Detection and Online Learning for Chunk Data Streams.针对块数据流的新颖性检测与在线学习

IEEE Trans Pattern Anal Mach Intell. 2021 Jul;43(7):2400-2412. doi: 10.1109/TPAMI.2020.2965531. Epub 2021 Jun 8.

Incremental Learning to Personalize Human Activity Recognition Models: The Importance of Human AI Collaboration.个性化人类活动识别模型的增量学习：人机 AI 协作的重要性。

Sensors (Basel). 2019 Nov 25;19(23):5151. doi: 10.3390/s19235151.

Incremental learning of chunk data for online pattern classification systems.用于在线模式分类系统的块数据增量学习。

IEEE Trans Neural Netw. 2008 Jun;19(6):1061-74. doi: 10.1109/TNN.2007.2000059.

Hybrid Low-Order and Higher-Order Graph Convolutional Networks.混合低阶和高阶图卷积网络。

Comput Intell Neurosci. 2020 Jun 23;2020:3283890. doi: 10.1155/2020/3283890. eCollection 2020.

A Highly Effective and Robust Membrane Potential-Driven Supervised Learning Method for Spiking Neurons.一种高效稳健的基于膜电位的尖峰神经元监督学习方法。

IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):123-137. doi: 10.1109/TNNLS.2018.2833077. Epub 2018 May 28.

Hashing for Adaptive Real-Time Graph Stream Classification With Concept Drifts.基于概念漂移的自适应实时图流分类哈希。

IEEE Trans Cybern. 2018 May;48(5):1591-1604. doi: 10.1109/TCYB.2017.2708979. Epub 2017 Aug 25.

本文引用的文献

Artificial intelligence to predict needs for urgent revascularization from 12-leads electrocardiography in emergency patients.人工智能预测急诊患者 12 导联心电图中紧急血运重建的需求。

PLoS One. 2019 Jan 9;14(1):e0210103. doi: 10.1371/journal.pone.0210103. eCollection 2019.

A data-driven artificial intelligence model for remote triage in the prehospital environment.基于数据驱动的人工智能模型，用于院前环境中的远程分诊。

PLoS One. 2018 Oct 23;13(10):e0206006. doi: 10.1371/journal.pone.0206006. eCollection 2018.

Machine learning models in electronic health records can outperform conventional survival models for predicting patient mortality in coronary artery disease.机器学习模型在电子健康记录中可以优于传统的生存模型，用于预测冠心病患者的死亡率。

PLoS One. 2018 Aug 31;13(8):e0202344. doi: 10.1371/journal.pone.0202344. eCollection 2018.

A Fast Incremental Gaussian Mixture Model.一种快速增量高斯混合模型。

PLoS One. 2015 Oct 7;10(10):e0139931. doi: 10.1371/journal.pone.0139931. eCollection 2015.

A very fast neural learning for classification using only new incoming datum.一种仅使用新传入数据进行分类的非常快速的神经学习方法。

IEEE Trans Neural Netw. 2010 Mar;21(3):381-92. doi: 10.1109/TNN.2009.2037148. Epub 2010 Jan 15.

A fast nearest neighbor classifier based on self-organizing incremental neural network.基于自组织增量神经网络的快速最近邻分类器。

Neural Netw. 2008 Dec;21(10):1537-47. doi: 10.1016/j.neunet.2008.07.001. Epub 2008 Jul 6.

Incremental learning of chunk data for online pattern classification systems.用于在线模式分类系统的块数据增量学习。

IEEE Trans Neural Netw. 2008 Jun;19(6):1061-74. doi: 10.1109/TNN.2007.2000059.

Parameter incremental learning algorithm for neural networks.神经网络的参数增量学习算法

IEEE Trans Neural Netw. 2006 Nov;17(6):1424-38. doi: 10.1109/TNN.2006.880581.

An incremental training method for the probabilistic RBF network.概率径向基函数网络的一种增量训练方法。

IEEE Trans Neural Netw. 2006 Jul;17(4):966-974. doi: 10.1109/TNN.2006.875982.

Incremental linear discriminant analysis for classification of data streams.用于数据流分类的增量线性判别分析。

IEEE Trans Syst Man Cybern B Cybern. 2005 Oct;35(5):905-14. doi: 10.1109/tsmcb.2005.847744.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于流分块增量学习的类数据流分类方法，具有快速学习速度和低结构复杂度。

Streaming chunk incremental learning for class-wise data stream classification with fast learning speed and low structural complexity.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献