Department of Biochemistry, University of Wisconsin, Madison, Wisconsin.
Protein Sci. 2020 Apr;29(4):930-940. doi: 10.1002/pro.3807. Epub 2020 Jan 20.
Tens of thousands of bacterial genome sequences are now known due to the development of rapid and inexpensive sequencing technologies. An important key in utilizing these vast amounts of data in a biologically meaningful way is to infer the function of the proteins encoded in the genomes via bioinformatics techniques. Whereas these approaches are absolutely critical to the annotation of gene function, there are still issues of misidentifications, which must be experimentally corrected. For example, many of the bacterial DNA sequences encoding sugar N-formyltransferases have been annotated as l-methionyl-tRNA transferases in the databases. These mistakes may be due in part to the fact that until recently the structures and functions of these enzymes were not well known. Herein we describe the misannotation of two genes, WP_088211966.1 and WP_096244125.1, from Shewanella spp. and Pseudomonas congelans, respectively. Although the proteins encoded by these genes were originally suggested to function as l-methionyl-tRNA transferases, we demonstrate that they actually catalyze the conversion of dTDP-4-amino-4,6-dideoxy-d-glucose to dTDP-4-formamido-4,6-dideoxy-d-glucose utilizing N -formyltetrahydrofolate as the carbon source. For this analysis, the genes encoding these enzymes were cloned and the corresponding proteins purified. X-ray structures of the two proteins were determined to high resolution and kinetic analyses were conducted. Both enzymes display classical Michaelis-Menten kinetics and adopt the characteristic three-dimensional structural fold previously observed for other sugar N-formyltransferases. The results presented herein will aid in the future annotation of these fascinating enzymes.
由于快速且廉价的测序技术的发展,现在已经知道了数以万计的细菌基因组序列。以生物学意义的方式利用这些大量数据的一个重要关键是通过生物信息学技术推断基因组中编码的蛋白质的功能。虽然这些方法对于基因功能的注释绝对至关重要,但仍然存在错误识别的问题,这些问题必须通过实验来纠正。例如,许多编码糖 N-甲酰基转移酶的细菌 DNA 序列在数据库中被注释为 l-甲硫氨酰-tRNA 转移酶。这些错误可能部分归因于直到最近这些酶的结构和功能还不是很清楚。本文描述了来自希瓦氏菌和假单胞菌的 WP_088211966.1 和 WP_096244125.1 两个基因的错误注释。尽管这些基因编码的蛋白质最初被认为具有 l-甲硫氨酰-tRNA 转移酶的功能,但我们证明它们实际上催化 dTDP-4-氨基-4,6-二脱氧-d-葡萄糖转化为 dTDP-4-甲酰基-4,6-二脱氧-d-葡萄糖,利用 N-甲酰四氢叶酸作为碳源。为此分析,克隆了这些酶的编码基因,并纯化了相应的蛋白质。确定了两种蛋白质的 X 射线结构,并进行了动力学分析。这两种酶都表现出经典的米氏动力学,并采用了以前观察到的其他糖 N-甲酰基转移酶的特征三维结构折叠。本文所提供的结果将有助于这些迷人的酶的未来注释。