• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于时间序列的非线性门控专家:发现模式并避免过拟合。

Nonlinear gated experts for time series: discovering regimes and avoiding overfitting.

作者信息

Weigend A S, Mangeas M, Srivastava A N

机构信息

Department of Computer Science, University of Colorado, Boulder, 80309-0430, USA.

出版信息

Int J Neural Syst. 1995 Dec;6(4):373-99. doi: 10.1142/s0129065795000251.

DOI:10.1142/s0129065795000251
PMID:8963468
Abstract

In the analysis and prediction of real-world systems, two of the key problems are nonstationarity (often in the form of switching between regimes) and overfitting (particularly serious for noisy processes). This article addresses these problems using gated experts, consisting of a (nonlinear) gating network, and several (also nonlinear) competing experts. Each expert learns to predict the conditional mean, and each expert adapts its width to match the noise level in its regime. The gating network learns to predict the probability of each expert, given the input. This article focuses on the case where the gating network bases its decision on information from the inputs. This can be contrasted to hidden Markov models where the decision is based on the previous state(s) (i.e. on the output of the gating network at the previous time step), as well as to averaging over several predictors. In contrast, gated experts soft-partition the input space, only learning to model their region. This article discusses the underlying statistical assumptions, derives the weight update rules, and compares the performance of gated experts to standard methods on three time series: (1) a computer-generated series, obtained by randomly switching between two nonlinear processes; (2) a time series from the Santa Fe Time Series Competition (the light intensity of a laser in chaotic state); and (3) the daily electricity demand of France, a real-world multivariate problem with structure on several time scales. The main results are: (1) the gating network correctly discovers the different regimes of the process; (2) the widths associated with each expert are important for the segmentation task (and they can be used to characterize the sub-processes); and (3) there is less overfitting compared to single networks (homogeneous multilayer perceptrons), since the experts learn to match their variances to the (local) noise levels. This can be viewed as matching the local complexity of the model to the local complexity of the data.

摘要

在对现实世界系统进行分析和预测时,两个关键问题是非平稳性(通常表现为不同状态之间的切换)和过拟合(对于有噪声的过程尤为严重)。本文使用门控专家来解决这些问题,门控专家由一个(非线性)门控网络和几个(同样是非线性)竞争专家组成。每个专家学习预测条件均值,并且每个专家调整其宽度以匹配其所处状态下的噪声水平。门控网络学习根据输入预测每个专家的概率。本文重点关注门控网络基于输入信息进行决策的情况。这与隐马尔可夫模型形成对比,在隐马尔可夫模型中决策基于先前状态(即前一时间步上门控网络的输出),也与对多个预测器进行平均的情况形成对比。相比之下,门控专家对输入空间进行软划分,只学习对其区域进行建模。本文讨论了潜在的统计假设,推导了权重更新规则,并在三个时间序列上比较了门控专家与标准方法的性能:(1)一个通过在两个非线性过程之间随机切换获得的计算机生成序列;(2)圣达菲时间序列竞赛中的一个时间序列(混沌状态下激光的光强);(3)法国的每日电力需求,这是一个具有多个时间尺度结构的现实世界多变量问题。主要结果如下:(1)门控网络正确地发现了过程的不同状态;(2)与每个专家相关联的宽度对于分割任务很重要(并且它们可用于表征子过程);(3)与单个网络(均匀多层感知器)相比,过拟合较少,因为专家们学习使它们的方差与(局部)噪声水平相匹配。这可以看作是使模型的局部复杂度与数据的局部复杂度相匹配。

相似文献

1
Nonlinear gated experts for time series: discovering regimes and avoiding overfitting.用于时间序列的非线性门控专家:发现模式并避免过拟合。
Int J Neural Syst. 1995 Dec;6(4):373-99. doi: 10.1142/s0129065795000251.
2
Developing a local least-squares support vector machines-based neuro-fuzzy model for nonlinear and chaotic time series prediction.开发基于局部最小二乘支持向量机的神经模糊模型,用于非线性和混沌时间序列预测。
IEEE Trans Neural Netw Learn Syst. 2013 Feb;24(2):207-18. doi: 10.1109/TNNLS.2012.2227148.
3
Predicting conditional probability distributions: a connectionist approach.预测条件概率分布:一种联结主义方法。
Int J Neural Syst. 1995 Jun;6(2):109-18. doi: 10.1142/s0129065795000093.
4
Learning chaotic attractors by neural networks.
Neural Comput. 2000 Oct;12(10):2355-83. doi: 10.1162/089976600300014971.
5
Learning to imitate stochastic time series in a compositional way by chaos.通过混沌来学习以组合方式模仿随机时间序列。
Neural Netw. 2010 Jun;23(5):625-38. doi: 10.1016/j.neunet.2009.12.006. Epub 2009 Dec 23.
6
On structure-exploiting trust-region regularized nonlinear least squares algorithms for neural-network learning.关于用于神经网络学习的基于结构利用的信赖域正则化非线性最小二乘算法
Neural Netw. 2003 Jun-Jul;16(5-6):745-53. doi: 10.1016/S0893-6080(03)00085-6.
7
Forecasting the evolution of nonlinear and nonstationary systems using recurrence-based local Gaussian process models.使用基于递归的局部高斯过程模型预测非线性和非平稳系统的演变。
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Nov;82(5 Pt 2):056206. doi: 10.1103/PhysRevE.82.056206. Epub 2010 Nov 15.
8
Nonlinear time series analysis by neural networks: a case study.
Int J Neural Syst. 1996 May;7(2):195-201. doi: 10.1142/s0129065796000166.
9
Modelling and prediction for chaotic fir laser attractor using rational function neural network.
Int J Neural Syst. 2001 Feb;11(1):89-99. doi: 10.1142/S0129065701000527.
10
Stochastic nonlinear time series forecasting using time-delay reservoir computers: performance and universality.基于时滞reservoir 计算机的随机非线性时间序列预测:性能与泛化能力。
Neural Netw. 2014 Jul;55:59-71. doi: 10.1016/j.neunet.2014.03.004. Epub 2014 Mar 21.

引用本文的文献

1
Preventing Forklift Front-End Failures: Predicting the Weight Centers of Heavy Objects, Remaining Useful Life Prediction under Abnormal Conditions, and Failure Diagnosis Based on Alarm Rules.预防叉车前端故障:预测重物重心、异常工况下剩余使用寿命预测以及基于报警规则的故障诊断
Sensors (Basel). 2023 Sep 6;23(18):7706. doi: 10.3390/s23187706.
2
A Generalized Mixture Framework for Multi-label Classification.一种用于多标签分类的广义混合框架。
Proc SIAM Int Conf Data Min. 2015;2015:712-720. doi: 10.1137/1.9781611974010.80.
3
Computational approaches to motor learning by imitation.
通过模仿进行运动学习的计算方法。
Philos Trans R Soc Lond B Biol Sci. 2003 Mar 29;358(1431):537-47. doi: 10.1098/rstb.2002.1258.