Plunkett G, Burland V, Daniels D L, Blattner F R
Laboratory of Genetics, University of Wisconsin, Madison 53706.
Nucleic Acids Res. 1993 Jul 25;21(15):3391-8. doi: 10.1093/nar/21.15.3391.
The DNA sequence of 96.5 kilobases of the Escherichia coli K-12 genome has been determined, spanning the region between rrnA at 87.2 minutes and katG at 89.2 minutes on the genetic map. The sequence includes 84 open reading frames, of which 46 code for unidentified proteins. Six previously mapped but unsequenced genes have been identified in this span: mob, fdhD, rhaD, rhaA, rhaB, and kdgT. In addition, five new genes have been assigned: the heat shock genes hsIU and hsIV, and the genes fdoG, fdoH, and fdoI, which encode the three subunits of formate dehydrogenase-O. The arrangement of the genes relative to possible promoters and terminators suggests 57 potential transcription units. Other features include the precise location of the bacteriophage P2 attachment site attP2II, and eleven REP elements, including one containing 9 REP sequences--one of the largest such elements known. This segment brings the total length of contiguous finished sequence to 325 kilobases.
已确定大肠杆菌K - 12基因组96.5千碱基的DNA序列,该序列跨越遗传图谱上87.2分钟处的rrnA和89.2分钟处的katG之间的区域。该序列包含84个开放阅读框,其中46个编码未知蛋白质。在此区间内已鉴定出6个先前已定位但未测序的基因:mob、fdhD、rhaD、rhaA、rhaB和kdgT。此外,还确定了5个新基因:热休克基因hsIU和hsIV,以及编码甲酸脱氢酶 - O三个亚基的fdoG、fdoH和fdoI基因。基因相对于可能的启动子和终止子的排列表明有57个潜在的转录单元。其他特征包括噬菌体P2附着位点attP2II的精确位置,以及11个REP元件,其中一个包含9个REP序列——这是已知的最大此类元件之一。该片段使连续完成序列的总长度达到325千碱基。