University of Belgrade - Faculty of Mechanical Engineering, Innovation Center, Kraljice Marije 16; 11120 Belgrade 35, Serbia.
University of Belgrade - Faculty of Mechanical Engineering, Production Engineering Department, Kraljice Marije 16; 11120 Belgrade 35, Serbia.
Neural Netw. 2015 Mar;63:31-47. doi: 10.1016/j.neunet.2014.11.001. Epub 2014 Nov 15.
Feedforward neural networks (FFNN) are among the most used neural networks for modeling of various nonlinear problems in engineering. In sequential and especially real time processing all neural networks models fail when faced with outliers. Outliers are found across a wide range of engineering problems. Recent research results in the field have shown that to avoid overfitting or divergence of the model, new approach is needed especially if FFNN is to run sequentially or in real time. To accommodate limitations of FFNN when training data contains a certain number of outliers, this paper presents new learning algorithm based on improvement of conventional extended Kalman filter (EKF). Extended Kalman filter robust to outliers (EKF-OR) is probabilistic generative model in which measurement noise covariance is not constant; the sequence of noise measurement covariance is modeled as stochastic process over the set of symmetric positive-definite matrices in which prior is modeled as inverse Wishart distribution. In each iteration EKF-OR simultaneously estimates noise estimates and current best estimate of FFNN parameters. Bayesian framework enables one to mathematically derive expressions, while analytical intractability of the Bayes' update step is solved by using structured variational approximation. All mathematical expressions in the paper are derived using the first principles. Extensive experimental study shows that FFNN trained with developed learning algorithm, achieves low prediction error and good generalization quality regardless of outliers' presence in training data.
前馈神经网络 (FFNN) 是用于工程中各种非线性问题建模的最常用神经网络之一。在顺序处理和实时处理中,当遇到异常值时,所有神经网络模型都会失败。异常值在广泛的工程问题中都有发现。该领域的最新研究结果表明,为了避免模型过度拟合或发散,特别是如果 FFNN 要顺序或实时运行,则需要新的方法。为了在训练数据包含一定数量的异常值时适应 FFNN 的局限性,本文提出了一种基于改进传统扩展卡尔曼滤波器 (EKF) 的新学习算法。抗异常值的扩展卡尔曼滤波器 (EKF-OR) 是一种概率生成模型,其中测量噪声协方差不是常数;噪声测量协方差的序列被建模为对称正定矩阵集合上的随机过程,其中先验被建模为逆 Wishart 分布。在每次迭代中,EKF-OR 同时估计噪声估计值和 FFNN 参数的当前最佳估计值。贝叶斯框架使我们能够从数学上推导出表达式,同时通过使用结构化变分逼近来解决贝叶斯更新步骤的分析不可行性。本文中的所有数学表达式都是从第一性原理推导出的。广泛的实验研究表明,无论训练数据中是否存在异常值,使用开发的学习算法训练的 FFNN 都能实现低预测误差和良好的泛化质量。