一种从循环神经网络中提取符号知识的机器学习方法。

Department of Computer Science, University of the Western Cape, Bellville, South Africa.

Neural Comput. 2004 Jan;16(1):59-71. doi: 10.1162/08997660460733994.

Neural networks do not readily provide an explanation of the knowledge stored in their weights as part of their information processing. Until recently, neural networks were considered to be black boxes, with the knowledge stored in their weights not readily accessible. Since then, research has resulted in a number of algorithms for extracting knowledge in symbolic form from trained neural networks. This article addresses the extraction of knowledge in symbolic form from recurrent neural networks trained to behave like deterministic finite-state automata (DFAs). To date, methods used to extract knowledge from such networks have relied on the hypothesis that networks' states tend to cluster and that clusters of network states correspond to DFA states. The computational complexity of such a cluster analysis has led to heuristics that either limit the number of clusters that may form during training or limit the exploration of the space of hidden recurrent state neurons. These limitations, while necessary, may lead to decreased fidelity, in which the extracted knowledge may not model the true behavior of a trained network, perhaps not even for the training set. The method proposed here uses a polynomial time, symbolic learning algorithm to infer DFAs solely from the observation of a trained network's input-output behavior. Thus, this method has the potential to increase the fidelity of the extracted knowledge.

神经网络在其信息处理过程中，不容易对存储在其权重中的知识做出解释。直到最近，神经网络还被视为黑箱，存储在其权重中的知识难以获取。从那时起，研究产生了许多从经过训练的神经网络中提取符号形式知识的算法。本文讨论从训练为表现得像确定性有限状态自动机（DFA）的递归神经网络中提取符号形式的知识。迄今为止，用于从此类网络中提取知识的方法依赖于这样的假设：网络状态倾向于聚类，并且网络状态的聚类对应于DFA状态。这种聚类分析的计算复杂性导致了启发式方法，这些方法要么限制训练期间可能形成的聚类数量，要么限制对隐藏递归状态神经元空间的探索。这些限制虽然是必要的，但可能会导致保真度降低，即提取的知识可能无法对经过训练的网络的真实行为进行建模，甚至可能对训练集也无法建模。这里提出的方法使用多项式时间符号学习算法，仅从对经过训练的网络的输入 - 输出行为的观察中推断DFA。因此，这种方法有可能提高提取知识的保真度。

相似文献

A machine learning method for extracting symbolic knowledge from recurrent neural networks.

Neural Comput. 2004 Jan;16(1):59-71. doi: 10.1162/08997660460733994.

Stable encoding of large finite-state automata in recurrent neural networks with sigmoid discriminants.

Neural Comput. 1996 May 15;8(4):675-96. doi: 10.1162/neco.1996.8.4.675.

Symbolic representation of recurrent neural network dynamics.

IEEE Trans Neural Netw Learn Syst. 2012 Oct;23(10):1649-58. doi: 10.1109/TNNLS.2012.2210242.

[Artificial intelligence for future MD].

G Ital Nefrol. 2018 Dec;35(6).

Neural network explanation using inversion.

Neural Netw. 2007 Jan;20(1):78-93. doi: 10.1016/j.neunet.2006.07.005. Epub 2006 Oct 6.

Logistic model tree extraction from artificial neural networks.

IEEE Trans Syst Man Cybern B Cybern. 2007 Aug;37(4):794-802. doi: 10.1109/tsmcb.2007.895334.

A modular architecture for transparent computation in recurrent neural networks.

Neural Netw. 2017 Jan;85:85-105. doi: 10.1016/j.neunet.2016.09.001. Epub 2016 Sep 24.

An improvement of extreme learning machine for compact single-hidden-layer feedforward neural networks.

Int J Neural Syst. 2008 Oct;18(5):433-41. doi: 10.1142/S0129065708001695.

Modular representation of layered neural networks.

Neural Netw. 2018 Jan;97:62-73. doi: 10.1016/j.neunet.2017.09.017. Epub 2017 Oct 12.

Knowledge extraction from neural networks using the all-permutations fuzzy rule base: the LED display recognition problem.

IEEE Trans Neural Netw. 2007 May;18(3):925-31. doi: 10.1109/TNN.2007.891686.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

A machine learning method for extracting symbolic knowledge from recurrent neural networks.

Neural Comput. 2004 Jan;16(1):59-71. doi: 10.1162/08997660460733994.

Stable encoding of large finite-state automata in recurrent neural networks with sigmoid discriminants.

Neural Comput. 1996 May 15;8(4):675-96. doi: 10.1162/neco.1996.8.4.675.

Symbolic representation of recurrent neural network dynamics.

IEEE Trans Neural Netw Learn Syst. 2012 Oct;23(10):1649-58. doi: 10.1109/TNNLS.2012.2210242.

[Artificial intelligence for future MD].

G Ital Nefrol. 2018 Dec;35(6).

Neural network explanation using inversion.

Neural Netw. 2007 Jan;20(1):78-93. doi: 10.1016/j.neunet.2006.07.005. Epub 2006 Oct 6.

Logistic model tree extraction from artificial neural networks.

IEEE Trans Syst Man Cybern B Cybern. 2007 Aug;37(4):794-802. doi: 10.1109/tsmcb.2007.895334.

A modular architecture for transparent computation in recurrent neural networks.

Neural Netw. 2017 Jan;85:85-105. doi: 10.1016/j.neunet.2016.09.001. Epub 2016 Sep 24.

An improvement of extreme learning machine for compact single-hidden-layer feedforward neural networks.

Int J Neural Syst. 2008 Oct;18(5):433-41. doi: 10.1142/S0129065708001695.

Modular representation of layered neural networks.

Neural Netw. 2018 Jan;97:62-73. doi: 10.1016/j.neunet.2017.09.017. Epub 2017 Oct 12.

Knowledge extraction from neural networks using the all-permutations fuzzy rule base: the LED display recognition problem.

IEEE Trans Neural Netw. 2007 May;18(3):925-31. doi: 10.1109/TNN.2007.891686.

A machine learning method for extracting symbolic knowledge from recurrent neural networks.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献