Corzo Gerald, Solomatine Dimitri
Department of Hydroinformatics and Knowledge Management, UNESCO-IHE Institute for Water Education, Delft, The Netherlands.
Neural Netw. 2007 May;20(4):528-36. doi: 10.1016/j.neunet.2007.04.019. Epub 2007 May 6.
Natural phenomena are multistationary and are composed of a number of interacting processes, so one single model handling all processes often suffers from inaccuracies. A solution is to partition data in relation to such processes using the available domain knowledge or expert judgment, to train separate models for each of the processes, and to merge them in a modular model (committee). In this paper a problem of water flow forecast in watershed hydrology is considered where the flow process can be presented as consisting of two subprocesses -- base flow and excess flow, so that these two processes can be separated. Several approaches to data separation techniques are studied. Two case studies with different forecast horizons are considered. Parameters of the algorithms responsible for data partitioning are optimized using genetic algorithms and global pattern search. It was found that modularization of ANN models using domain knowledge makes models more accurate, if compared with a global model trained on the whole data set, especially when forecast horizon (and hence the complexity of the modelled processes) is increased.
自然现象是多稳态的,由许多相互作用的过程组成,因此处理所有过程的单一模型往往存在不准确的问题。一种解决方案是利用可用的领域知识或专家判断,根据这些过程对数据进行划分,为每个过程训练单独的模型,并将它们合并到一个模块化模型(委员会)中。本文考虑了流域水文学中的水流预测问题,其中水流过程可以表示为由两个子过程——基流和超量径流组成,这样这两个过程就可以分离。研究了几种数据分离技术的方法。考虑了两个具有不同预测期的案例研究。使用遗传算法和全局模式搜索对负责数据划分的算法参数进行了优化。结果发现,与在整个数据集上训练的全局模型相比,利用领域知识对人工神经网络模型进行模块化处理可以使模型更加准确,尤其是当预测期(以及因此建模过程的复杂性)增加时。