Davuluri Ramana V, Zhang Michael Q
Department of Molecular Virology, Immunology, and Medical Genetics, Human Cancer Genetics Program, The Ohio State University, Columbus, OH, USA.
Methods Mol Biol. 2003;236:87-108. doi: 10.1385/1-59259-413-1:87.
Gene finding is the most important phase of genome annotation. Eukaryotic genomes contain thousands of protein coding genes, and computational gene prediction would rapidly increase the pace of experimental confirmation of expressed genes at the bench. The purpose of this chapter is to discuss the use of different computer programs that identify protein-coding genes in large genomic sequences. We describe most commonly used gene prediction programs that are available on the World Wide Web and demonstrate the use of some of these programs by an example. We provide a list of these programs along with their. Web uniform resource locators (URLs) and suggest guidelines for successful gene finding.
基因发现是基因组注释中最重要的阶段。真核生物基因组包含数千个蛋白质编码基因,而计算基因预测将迅速加快实验室中对表达基因进行实验验证的速度。本章的目的是讨论使用不同的计算机程序来识别大型基因组序列中的蛋白质编码基因。我们描述了万维网上可用的最常用的基因预测程序,并通过一个例子展示其中一些程序的使用方法。我们提供了这些程序的列表及其网络统一资源定位器(URL),并给出了成功进行基因发现的指导原则。