Hagiwara H, Naitou M, Shibata T, Hanaoka F, Eki T, Murakami Y
Laboratory of Cellular Physiology, Institute of Physical and Chemical Research (RIKEN), Ibaraki, Japan.
DNA Res. 1995 Dec 31;2(6):247-53. doi: 10.1093/dnares/2.6.247.
We searched the nucleotide sequence of budding yeast Saccharomyces cerevisiae chromosome VI (270 kb) for candidate coding regions, using the computer program GenMark. One hundred and twenty-nine putative genes were identified, which is almost the same as the number of ORFs on this chromosome. Nineteen new putative genes were identified through the GenMark analysis. Most large ORFs were also correctly identified (87% of the predicted putative genes identified by the GenMark (110 of 127) matched the reported ORFs). The new coding regions were mostly small but they were distinguished from the more than 2000 ORFs identified by Genetyx. GenMark did not predict 17 ORFs that were over 300 bp long. As these ORFs include known genes, their sequence context may differ somewhat from that of typical yeast genes. These analyses revealed the high potential of GenMark to identify putative genes from numerous short ORFs and will produce information on the likelihood of their being actual genes.
我们使用计算机程序GenMark在出芽酵母酿酒酵母第六条染色体(270 kb)的核苷酸序列中搜索候选编码区。共鉴定出129个推定基因,这与该染色体上的开放阅读框数量几乎相同。通过GenMark分析鉴定出19个新的推定基因。大多数大的开放阅读框也被正确鉴定(GenMark鉴定出的预测推定基因中有87%(127个中的110个)与报道的开放阅读框匹配)。新的编码区大多较小,但它们与Genetyx鉴定出的2000多个开放阅读框有所不同。GenMark没有预测出17个长度超过300 bp的开放阅读框。由于这些开放阅读框包含已知基因,它们的序列背景可能与典型酵母基因的序列背景有所不同。这些分析揭示了GenMark从众多短开放阅读框中鉴定推定基因的巨大潜力,并将产生有关它们成为实际基因可能性的信息。