Micheli Alessio
Dipartimento di Informatica, Università di Pisa, 56127 Pisa, Italy.
IEEE Trans Neural Netw. 2009 Mar;20(3):498-511. doi: 10.1109/TNN.2008.2010350. Epub 2009 Feb 3.
This paper presents a new approach for learning in structured domains (SDs) using a constructive neural network for graphs (NN4G). The new model allows the extension of the input domain for supervised neural networks to a general class of graphs including both acyclic/cyclic, directed/undirected labeled graphs. In particular, the model can realize adaptive contextual transductions, learning the mapping from graphs for both classification and regression tasks. In contrast to previous neural networks for structures that had a recursive dynamics, NN4G is based on a constructive feedforward architecture with state variables that uses neurons with no feedback connections. The neurons are applied to the input graphs by a general traversal process that relaxes the constraints of previous approaches derived by the causality assumption over hierarchical input data. Moreover, the incremental approach eliminates the need to introduce cyclic dependencies in the definition of the system state variables. In the traversal process, the NN4G units exploit (local) contextual information of the graphs vertices. In spite of the simplicity of the approach, we show that, through the compositionality of the contextual information developed by the learning, the model can deal with contextual information that is incrementally extended according to the graphs topology. The effectiveness and the generality of the new approach are investigated by analyzing its theoretical properties and providing experimental results.
本文提出了一种在结构化领域(SDs)中使用图构造神经网络(NN4G)进行学习的新方法。新模型允许将监督神经网络的输入域扩展到一类通用的图,包括无环/有环、有向/无向带标签图。特别地,该模型可以实现自适应上下文转导,学习用于分类和回归任务的从图到图的映射。与先前具有递归动态的结构神经网络不同,NN4G基于具有状态变量的构造性前馈架构,使用无反馈连接的神经元。通过一个通用的遍历过程将神经元应用于输入图,该过程放宽了先前由因果假设对分层输入数据推导的方法的约束。此外,增量方法消除了在系统状态变量定义中引入循环依赖的需要。在遍历过程中,NN4G单元利用图顶点的(局部)上下文信息。尽管该方法简单,但我们表明,通过学习所开发的上下文信息的组合性,该模型可以处理根据图拓扑结构增量扩展的上下文信息。通过分析其理论特性并提供实验结果,研究了新方法的有效性和通用性。