• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Genome structure described by formal languages.用形式语言描述的基因组结构。
Nucleic Acids Res. 1984 Mar 12;12(5):2561-8. doi: 10.1093/nar/12.5.2561.
2
Complete nucleotide sequence of wound tumor virus genomic segment S7.创伤肿瘤病毒基因组片段S7的完整核苷酸序列
Nucleic Acids Res. 1989 Apr 25;17(8):3300. doi: 10.1093/nar/17.8.3300.
3
The evolution of RNA viruses.RNA病毒的进化
Annu Rev Microbiol. 1982;36:47-73. doi: 10.1146/annurev.mi.36.100182.000403.
4
Evolution of RNA genomes: does the high mutation rate necessitate high rate of evolution of viral proteins?RNA基因组的进化:高突变率是否必然导致病毒蛋白的高进化速率?
J Mol Evol. 1989 Jun;28(6):524-7. doi: 10.1007/BF02602932.
5
The proteins encoded by rice grassy stunt virus RNA5 and RNA6 are only distantly related to the corresponding proteins of other members of the genus Tenuivirus.水稻草状矮化病毒RNA5和RNA6编码的蛋白质与纤细病毒属其他成员的相应蛋白质只有远缘关系。
J Gen Virol. 1997 Sep;78 ( Pt 9):2355-63. doi: 10.1099/0022-1317-78-9-2355.
6
Sequences of 3' end of genome and of 5' end of open reading frame 1a of lactate dehydrogenase-elevating virus and common junction motifs between 5' leader and bodies of seven subgenomic mRNAs.乳酸脱氢酶升高病毒基因组3'末端、开放阅读框1a 5'末端的序列以及七个亚基因组mRNA的5'前导序列与主体之间的共同连接基序。
J Gen Virol. 1993 Apr;74 ( Pt 4):643-59. doi: 10.1099/0022-1317-74-4-643.
7
Nucleotide sequence of rice dwarf virus genome segment 4.
J Gen Virol. 1990 Oct;71 ( Pt 10):2217-22. doi: 10.1099/0022-1317-71-10-2217.
8
Synthesis of subgenomic RNAs by positive-strand RNA viruses.正链RNA病毒亚基因组RNA的合成
Virology. 2000 Jul 20;273(1):1-8. doi: 10.1006/viro.2000.0421.
9
All subgenomic mRNAs of equine arteritis virus contain a common leader sequence.
Nucleic Acids Res. 1990 Jun 11;18(11):3241-7. doi: 10.1093/nar/18.11.3241.
10
Complete nucleotide sequence of a new double-stranded RNA virus from the rice blast fungus, Magnaporthe oryzae.来自稻瘟病菌Magnaporthe oryzae的一种新型双链RNA病毒的完整核苷酸序列。
Arch Virol. 2008;153(2):389-91. doi: 10.1007/s00705-007-1101-3. Epub 2007 Dec 13.

引用本文的文献

1
Language Modelling Techniques for Analysing the Impact of Human Genetic Variation.用于分析人类基因变异影响的语言建模技术
Bioinform Biol Insights. 2025 Sep 2;19:11779322251358314. doi: 10.1177/11779322251358314. eCollection 2025.
2
Compression principle and Zipf's Law of brevity in infochemical communication.信息素通讯中的压缩原理和齐夫简短律。
Biol Lett. 2022 Jul;18(7):20220162. doi: 10.1098/rsbl.2022.0162. Epub 2022 Jul 27.
3
AMPGAN v2: Machine Learning-Guided Design of Antimicrobial Peptides.AMPGAN v2:基于机器学习的抗菌肽设计。
J Chem Inf Model. 2021 May 24;61(5):2198-2207. doi: 10.1021/acs.jcim.0c01441. Epub 2021 Mar 31.
4
Information theoretic perspective on genome clustering.基因组聚类的信息论视角
Saudi J Biol Sci. 2021 Mar;28(3):1867-1889. doi: 10.1016/j.sjbs.2020.12.039. Epub 2020 Dec 31.
5
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome.DNABERT:用于基因组中DNA语言的基于变换器的预训练双向编码器表征模型。
Bioinformatics. 2021 Aug 9;37(15):2112-2120. doi: 10.1093/bioinformatics/btab083.
6
Estimating probabilistic context-free grammars for proteins using contact map constraints.利用接触图约束估计蛋白质的概率上下文无关文法。
PeerJ. 2019 Mar 18;7:e6559. doi: 10.7717/peerj.6559. eCollection 2019.
7
Probabilistic grammatical model for helix-helix contact site classification.用于螺旋-螺旋接触位点分类的概率语法模型。
Algorithms Mol Biol. 2013 Dec 18;8(1):31. doi: 10.1186/1748-7188-8-31.
8
Data Compression Concepts and Algorithms and their Applications to Bioinformatics.数据压缩概念、算法及其在生物信息学中的应用。
Entropy (Basel). 2010 Jan 1;12(1):34. doi: 10.3390/e12010034.
9
A stochastic context free grammar based framework for analysis of protein sequences.基于随机上下文无关语法的蛋白质序列分析框架。
BMC Bioinformatics. 2009 Oct 8;10:323. doi: 10.1186/1471-2105-10-323.
10
A formal language-based approach in biology.生物学中基于形式语言的方法。
Comp Funct Genomics. 2004;5(1):91-4. doi: 10.1002/cfg.364.

本文引用的文献

1
Lysis gene expression of RNA phage MS2 depends on a frameshift during translation of the overlapping coat protein gene.RNA噬菌体MS2的裂解基因表达取决于重叠衣壳蛋白基因翻译过程中的移码。
Nature. 1982 Jan 7;295(5844):35-41. doi: 10.1038/295035a0.
2
The genetic code. 3.遗传密码。3.
Sci Am. 1966 Oct;215(4):55-60 passim. doi: 10.1038/scientificamerican1066-55.
3
Developmental systems without cellular interactions, their languages and grammars.没有细胞相互作用的发育系统、它们的语言和语法。
J Theor Biol. 1971 Mar;30(3):455-84. doi: 10.1016/0022-5193(71)90002-6.
4
The 3'-terminal sequence of Escherichia coli 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites.大肠杆菌16S核糖体RNA的3'末端序列:与无义三联体及核糖体结合位点的互补性
Proc Natl Acad Sci U S A. 1974 Apr;71(4):1342-6. doi: 10.1073/pnas.71.4.1342.
5
Structure and function of phage RNA.噬菌体RNA的结构与功能。
Annu Rev Biochem. 1973;42:303-28. doi: 10.1146/annurev.bi.42.070173.001511.
6
A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase.一种通过DNA聚合酶引发合成来测定DNA序列的快速方法。
J Mol Biol. 1975 May 25;94(3):441-8. doi: 10.1016/0022-2836(75)90213-2.
7
Language processor generation with BNF inputs: methods and implementation.
Comput Programs Biomed. 1977 Jun;7(2):85-98. doi: 10.1016/0010-468x(77)90015-0.
8
An automaton analogue of unicellularity.单细胞性的自动机类似物。
Biosystems. 1979 Aug;11(2-3):133-62. doi: 10.1016/0303-2647(79)90007-8.
9
Initiation mechanisms of protein syntehesis.蛋白质合成的起始机制。
Prog Nucleic Acid Res Mol Biol. 1977;20:209-84. doi: 10.1016/s0079-6603(08)60474-2.
10
A new method for sequencing DNA.一种新的DNA测序方法。
Proc Natl Acad Sci U S A. 1977 Feb;74(2):560-4. doi: 10.1073/pnas.74.2.560.

用形式语言描述的基因组结构。

Genome structure described by formal languages.

作者信息

Brendel V, Busse H G

出版信息

Nucleic Acids Res. 1984 Mar 12;12(5):2561-8. doi: 10.1093/nar/12.5.2561.

DOI:10.1093/nar/12.5.2561
PMID:6200832
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC318685/
Abstract

Nucleic acid sequences may be looked upon as words over the alphabet of nucleotides. Naturally occurring DNAs and RNAs form subsets of the set of all possible words. The use of formal languages is proposed to describe the structure of these subsets. Regular languages defined by finite automata are introduced to demonstrate the application of the concept on RNA-phages of group I. This approach permits a concise characterization of grammatical patterns in genetic information.

摘要

核酸序列可以被看作是由核苷酸组成的字母表上的单词。天然存在的DNA和RNA构成了所有可能单词集合的子集。有人提议使用形式语言来描述这些子集的结构。引入由有限自动机定义的正则语言,以展示该概念在第一组RNA噬菌体中的应用。这种方法允许对遗传信息中的语法模式进行简洁的表征。