Besemer John, Borodovsky Mark
School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA.
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W451-4. doi: 10.1093/nar/gki487.
The task of gene identification frequently confronting researchers working with both novel and well studied genomes can be conveniently and reliably solved with the help of the GeneMark web software (http://opal.biology.gatech.edu/GeneMark/). The website provides interfaces to the GeneMark family of programs designed and tuned for gene prediction in prokaryotic, eukaryotic and viral genomic sequences. Currently, the server allows the analysis of nearly 200 prokaryotic and >10 eukaryotic genomes using species-specific versions of the software and pre-computed gene models. In addition, genes in prokaryotic sequences from novel genomes can be identified using models derived on the spot upon sequence submission, either by a relatively simple heuristic approach or by the full-fledged self-training program GeneMarkS. A database of reannotations of >1000 viral genomes by the GeneMarkS program is also available from the web site. The GeneMark website is frequently updated to provide the latest versions of the software and gene models.
对于研究新基因组和已深入研究基因组的研究人员而言,经常面临的基因识别任务可以借助GeneMark网络软件(http://opal.biology.gatech.edu/GeneMark/)方便且可靠地解决。该网站提供了与GeneMark程序家族的接口,这些程序是为原核生物、真核生物和病毒基因组序列中的基因预测而设计和优化的。目前,该服务器允许使用软件的物种特异性版本和预先计算的基因模型对近200个原核生物基因组和10多个真核生物基因组进行分析。此外,对于新基因组中的原核生物序列,可以在提交序列时使用当场推导的模型来识别基因,推导方式可以是相对简单的启发式方法,也可以是功能完备的自训练程序GeneMarkS。该网站还提供了一个由GeneMarkS程序对1000多个病毒基因组进行重新注释的数据库。GeneMark网站会频繁更新,以提供软件和基因模型的最新版本。