使用智能状态标记进化算法学习确定性有限自动机。

Learning deterministic finite automata with a smart state labeling evolutionary algorithm.

作者信息

Lucas Simon M, Reynolds T Jeff

机构信息

Department of Computer Science, University of Essex, Wivenhoe Park, Colchester, Essex C04 35Q, UK.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1063-74. doi: 10.1109/TPAMI.2005.143.

DOI:10.1109/TPAMI.2005.143

PMID:16013754

Abstract

Learning a Deterministic Finite Automaton (DFA) from a training set of labeled strings is a hard task that has been much studied within the machine learning community. It is equivalent to learning a regular language by example and has applications in language modeling. In this paper, we describe a novel evolutionary method for learning DFA that evolves only the transition matrix and uses a simple deterministic procedure to optimally assign state labels. We compare its performance with the Evidence Driven State Merging (EDSM) algorithm, one of the most powerful known DFA learning algorithms. We present results on random DFA induction problems of varying target size and training set density. We also studythe effects of noisy training data on the evolutionary approach and on EDSM. On noise-free data, we find that our evolutionary method outperforms EDSM on small sparse data sets. In the case of noisy training data, we find that our evolutionary method consistently outperforms EDSM, as well as other significant methods submitted to two recent competitions.

摘要

从带标签字符串的训练集中学习确定有限自动机（DFA）是一项艰巨的任务，机器学习社区对此进行了大量研究。它等同于通过示例学习正则语言，并在语言建模中有应用。在本文中，我们描述了一种学习DFA的新颖进化方法，该方法仅进化转移矩阵，并使用简单的确定性过程来最优地分配状态标签。我们将其性能与证据驱动状态合并（EDSM）算法进行比较，EDSM算法是已知最强大的DFA学习算法之一。我们给出了不同目标大小和训练集密度的随机DFA归纳问题的结果。我们还研究了有噪声训练数据对进化方法和EDSM的影响。在无噪声数据上，我们发现在小的稀疏数据集上我们的进化方法优于EDSM。在有噪声训练数据的情况下，我们发现我们的进化方法始终优于EDSM以及提交到最近两次竞赛的其他重要方法。

相似文献

Learning deterministic finite automata with a smart state labeling evolutionary algorithm.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1063-74. doi: 10.1109/TPAMI.2005.143.

Probabilistic finite-state machines--part I.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1013-25. doi: 10.1109/TPAMI.2005.147.

Probabilistic finite-state machines--part II.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1026-39. doi: 10.1109/TPAMI.2005.148.

Parsing with probabilistic strictly locally testable tree languages.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1040-50. doi: 10.1109/TPAMI.2005.144.

Grammatical inference in bioinformatics.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1051-62. doi: 10.1109/TPAMI.2005.140.

Structural semantic interconnections: a knowledge-based approach to word sense disambiguation.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1075-86. doi: 10.1109/TPAMI.2005.149.

Online clustering algorithms for radar emitter classification.

IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1185-96. doi: 10.1109/TPAMI.2005.166.

Learning weighted metrics to minimize nearest-neighbor classification error.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1100-10. doi: 10.1109/TPAMI.2006.145.

Onvergence and application of online active sampling using orthogonal pillar vectors.

IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1197-207. doi: 10.1109/TPAMI.2004.61.

A scale space approach for automatically segmenting words from historical handwritten documents.

IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1212-25. doi: 10.1109/TPAMI.2005.150.

引用本文的文献

Inferring test models from user bug reports using multi-objective search.

Empir Softw Eng. 2023;28(4):95. doi: 10.1007/s10664-023-10333-8. Epub 2023 Jun 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用智能状态标记进化算法学习确定性有限自动机。

Learning deterministic finite automata with a smart state labeling evolutionary algorithm.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献