Brendel V, Busse H G
Nucleic Acids Res. 1984 Mar 12;12(5):2561-8. doi: 10.1093/nar/12.5.2561.
Nucleic acid sequences may be looked upon as words over the alphabet of nucleotides. Naturally occurring DNAs and RNAs form subsets of the set of all possible words. The use of formal languages is proposed to describe the structure of these subsets. Regular languages defined by finite automata are introduced to demonstrate the application of the concept on RNA-phages of group I. This approach permits a concise characterization of grammatical patterns in genetic information.
核酸序列可以被看作是由核苷酸组成的字母表上的单词。天然存在的DNA和RNA构成了所有可能单词集合的子集。有人提议使用形式语言来描述这些子集的结构。引入由有限自动机定义的正则语言,以展示该概念在第一组RNA噬菌体中的应用。这种方法允许对遗传信息中的语法模式进行简洁的表征。