Suppr超能文献

一种基于内存高效确定性有限自动机的位分割字符串匹配方案,用于深度包检测中的模式唯一性。

A memory-efficient deterministic finite automaton-based bit-split string matching scheme using pattern uniqueness in deep packet inspection.

作者信息

Kim HyunJin, Choi Kang-Il, Choi Sang-Il

机构信息

School of Electronics and Electrical Engineering, Dankook University, Yongin-si, Republic of Korea.

Advanced Communications Research Laboratory, Electronics and Telecommunications Research Institute, Daejeon, Republic of Korea.

出版信息

PLoS One. 2015 May 4;10(5):e0126517. doi: 10.1371/journal.pone.0126517. eCollection 2015.

Abstract

This paper proposes a memory-efficient bit-split string matching scheme for deep packet inspection (DPI). When the number of target patterns becomes large, the memory requirements of the string matching engine become a critical issue. The proposed string matching scheme reduces the memory requirements using the uniqueness of the target patterns in the deterministic finite automaton (DFA)-based bit-split string matching. The pattern grouping extracts a set of unique patterns from the target patterns. In the set of unique patterns, a pattern is not the suffix of any other patterns. Therefore, in the DFA constructed with the set of unique patterns, when only one pattern can be matched in an output state. In the bit-split string matching, multiple finite-state machine (FSM) tiles with several input bit groups are adopted in order to reduce the number of stored state transitions. However, the memory requirements for storing the matching vectors can be large because each bit in the matching vector is used to identify whether its own pattern is matched or not. In our research, the proposed pattern grouping is applied to the multiple FSM tiles in the bit-split string matching. For the set of unique patterns, the memory-based bit-split string matching engine stores only the pattern match index for each state to indicate the match with its own unique pattern. Therefore, the memory requirements are significantly decreased by not storing the matching vectors in the string matchers for the set of unique patterns. The experimental results show that the proposed string matching scheme can reduce the storage cost significantly compared to the previous bit-split string matching methods.

摘要

本文提出了一种用于深度包检测(DPI)的内存高效位分割字符串匹配方案。当目标模式的数量变得很大时,字符串匹配引擎的内存需求就成为一个关键问题。所提出的字符串匹配方案在基于确定性有限自动机(DFA)的位分割字符串匹配中利用目标模式的唯一性来降低内存需求。模式分组从目标模式中提取一组唯一模式。在这组唯一模式中,一个模式不是任何其他模式的后缀。因此,在用这组唯一模式构建的DFA中,在一个输出状态下只能匹配一个模式。在位分割字符串匹配中,采用具有多个输入位组的多个有限状态机(FSM)切片来减少存储状态转换的数量。然而,由于匹配向量中的每一位都用于识别其自身模式是否匹配,所以存储匹配向量的内存需求可能很大。在我们的研究中,所提出的模式分组应用于位分割字符串匹配中的多个FSM切片。对于这组唯一模式,基于内存的位分割字符串匹配引擎仅为每个状态存储模式匹配索引,以指示与其自身唯一模式的匹配情况。因此,通过不在唯一模式集的字符串匹配器中存储匹配向量,内存需求显著降低。实验结果表明,与先前的位分割字符串匹配方法相比,所提出的字符串匹配方案可以显著降低存储成本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8f3/4418843/5410d0f6e746/pone.0126517.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验