Delcher A L, Harmon D, Kasif S, White O, Salzberg S L
Department of Computer Science, Loyola College in Maryland, Baltimore, MD 21210, USA.
Nucleic Acids Res. 1999 Dec 1;27(23):4636-41. doi: 10.1093/nar/27.23.4636.
The GLIMMER system for microbial gene identification finds approximately 97-98% of all genes in a genome when compared with published annotation. This paper reports on two new results: (i) significant technical improvements to GLIMMER that improve its accuracy still further, and (ii) a comprehensive evaluation that demonstrates that the accuracy of the system is likely to be higher than previously recognized. A significant proportion of the genes missed by the system appear to be hypothetical proteins whose existence is only supported by the predictions of other programs. When the analysis is restricted to genes that have significant homology to genes in other organisms, GLIMMER misses <1% of known genes.
用于微生物基因识别的GLIMMER系统,与已发表的注释相比,能找到基因组中约97% - 98%的所有基因。本文报告了两个新结果:(i)对GLIMMER进行的重大技术改进,进一步提高了其准确性;(ii)一项全面评估表明,该系统的准确性可能比之前认为的更高。该系统遗漏的很大一部分基因似乎是假设性蛋白质,其存在仅由其他程序的预测支持。当分析仅限于与其他生物中的基因具有显著同源性的基因时,GLIMMER遗漏的已知基因不到1%。