Zheng Yu, Posfai Janos, Morgan Richard D, Vincze Tamas, Roberts Richard J
New England Biolabs, Inc., 240 County Road, Ipswich, MA 01938, USA.
Nucleic Acids Res. 2009 Jan;37(1):e1. doi: 10.1093/nar/gkn883. Epub 2008 Nov 6.
Whole genome shotgun sequence analysis has become the standard method for beginning to determine a genome sequence. The preparation of the shotgun sequence clones is, in fact, a biological experiment. It determines which segments of the genome can be cloned into Escherichia coli and which cannot. By analyzing the complete set of sequences from such an experiment, it is possible to identify genes lethal to E. coli. Among this set are genes encoding restriction enzymes which, when active in E. coli, lead to cell death by cleaving the E. coli genome at the restriction enzyme recognition sites. By analyzing shotgun sequence data sets we show that this is a reliable method to detect active restriction enzyme genes in newly sequenced genomes, thereby facilitating functional annotation. Active restriction enzyme genes have been identified, and their activity demonstrated biochemically, in the sequenced genomes of Methanocaldococcus jannaschii, Bacillus cereus ATCC 10987 and Methylococcus capsulatus.
全基因组鸟枪法测序分析已成为初步确定基因组序列的标准方法。实际上,制备鸟枪法测序克隆是一个生物学实验。它能确定基因组的哪些片段可以克隆到大肠杆菌中,哪些不能。通过分析该实验的完整序列集,有可能鉴定出对大肠杆菌致死的基因。其中包括编码限制酶的基因,这些基因在大肠杆菌中激活时,会通过在限制酶识别位点切割大肠杆菌基因组而导致细胞死亡。通过分析鸟枪法序列数据集,我们表明这是一种在新测序基因组中检测活性限制酶基因的可靠方法,从而有助于功能注释。在詹氏甲烷球菌、蜡样芽孢杆菌ATCC 10987和荚膜甲基球菌的测序基因组中,已鉴定出活性限制酶基因,并通过生化方法证明了它们的活性。