Borodovsky Mark, Lomsadze Alex, Ivanov Nikolai, Mills Ryan
School of Biology and School of Biomedical Engineering, Georgia Institute of Technology, Atlanta, Georgia, USA.
Curr Protoc Bioinformatics. 2003 May;Chapter 4:Unit4.6. doi: 10.1002/0471250953.bi0406s01.
In this unit, eukaryotic GeneMark.hmm is presented as a method for detecting genes in eukaryotic DNA sequences. The eukaryotic GeneMark.hmm uses Markov models of protein coding and noncoding sequences, as well as positional nucleotide frequency matrices for prediction of the translational start, translational termination and splice sites. All these models along with length distributions of exons, introns and intergenic regions are integrated into one Hidden Markov model. The unit describes running the program over the Internet and locally on a Unix machine. It also discusses GeneMarkS EV, which can be used to detect genes in eukaryotic viruses.
在本单元中,介绍了真核生物基因预测工具GeneMark.hmm,它是一种用于检测真核生物DNA序列中基因的方法。真核生物基因预测工具GeneMark.hmm使用蛋白质编码和非编码序列的马尔可夫模型,以及用于预测翻译起始、翻译终止和剪接位点的位置核苷酸频率矩阵。所有这些模型连同外显子、内含子和基因间区域的长度分布都被整合到一个隐马尔可夫模型中。本单元描述了如何通过互联网以及在Unix机器上本地运行该程序。它还讨论了GeneMarkS EV,其可用于检测真核病毒中的基因。