TECNALIA, 48160 Derio (Bizkaia), Spain.
TECNALIA, 48160 Derio (Bizkaia), Spain.
Neural Netw. 2020 Mar;123:118-133. doi: 10.1016/j.neunet.2019.11.021. Epub 2019 Dec 6.
Stream data processing has lately gained momentum with the arrival of new Big Data scenarios and applications dealing with continuously produced information flows. Unfortunately, traditional machine learning algorithms are not prepared to tackle the specific challenges imposed by data stream processing, such as the need for learning incrementally, limited memory and processing time requirements, and adaptation to non-stationary data, among others. To face these paradigms, Spiking Neural Networks have emerged as one of the most promising stream learning techniques, with variants such as Evolving Spiking Neural Networks capable of efficiently addressing many of these challenges. Interestingly, these networks resort to a particular population encoding scheme - Gaussian Receptive Fields - to transform the incoming stimuli into temporal spikes. The study presented in this manuscript sheds light on the predictive potential of this encoding scheme, focusing on how it can be applied as a computationally lightweight, model-agnostic preprocessing step for data stream learning. We provide informed intuition to unveil under which circumstances the aforementioned population encoding method yields effective prediction gains in data stream classification with respect to the case where no preprocessing is performed. Results obtained for a variety of stream learning models and both synthetic and real stream datasets are discussed to empirically buttress the capability of Gaussian Receptive Fields to boost the predictive performance of stream learning methods, spanning further research towards extrapolating our findings to other machine learning problems.
数据流处理最近随着新的大数据场景和应用的出现而得到了迅猛发展,这些应用处理的是连续产生的信息流。然而,传统的机器学习算法还没有准备好应对数据流处理带来的特定挑战,例如需要增量学习、有限的内存和处理时间要求以及对非平稳数据的适应能力等。为了应对这些挑战,脉冲神经网络已成为最有前途的流学习技术之一,其中演变脉冲神经网络等变体能够有效地解决许多这些挑战。有趣的是,这些网络采用了一种特殊的群体编码方案——高斯感受野,将输入的刺激转换为时间脉冲。本文档中介绍的研究揭示了这种编码方案的预测潜力,重点研究了它如何作为一种计算轻量级、与模型无关的预处理步骤,应用于数据流学习中。我们提供了有根据的直觉,揭示了在何种情况下,上述群体编码方法在数据流分类中相对于不进行预处理的情况能够有效地提高预测增益。讨论了针对各种流学习模型以及合成和真实流数据集的结果,以经验性地支持高斯感受野提高流学习方法预测性能的能力,进一步研究将我们的发现推广到其他机器学习问题。