Johnston Jodie M, Arcus Vickery L, Morton Craig J, Parker Michael W, Baker Edward N
School of Biological Sciences, University of Auckland, Auckland, New Zealand. St. Vincent's Institute of Medical Research, Fitzroy, Victoria 3065, Australia.
J Bacteriol. 2003 Jul;185(14):4057-65. doi: 10.1128/JB.185.14.4057-4065.2003.
Bioinformatic analyses of whole genome sequences highlight the problem of identifying the biochemical and cellular functions of many gene products that are at present uncharacterized. The open reading frame Rv3853 from Mycobacterium tuberculosis has been annotated as menG and assumed to encode an S-adenosylmethionine (SAM)-dependent methyltransferase that catalyzes the final step in menaquinone biosynthesis. The Rv3853 gene product has been expressed, refolded, purified, and crystallized in the context of a structural genomics program. Its crystal structure has been determined by isomorphous replacement and refined at 1.9 A resolution to an R factor of 19.0% and R(free) of 22.0%. The structure strongly suggests that this protein is not a SAM-dependent methyltransferase and that the gene has been misannotated in this and other genomes that contain homologs. The protein forms a tightly associated, disk-like trimer. The monomer fold is unlike that of any known SAM-dependent methyltransferase, most closely resembling the phosphohistidine domains of several phosphotransfer systems. Attempts to bind cofactor and substrate molecules have been unsuccessful, but two adventitiously bound small-molecule ligands, modeled as tartrate and glyoxalate, are present on each monomer. These may point to biologically relevant binding sites but do not suggest a function. In silico screening indicates a range of ligands that could occupy these and other sites. The nature of these ligands, coupled with the location of binding sites on the trimer, suggests that proteins of the Rv3853 family, which are distributed throughout microbial and plant species, may be part of a larger assembly binding to nucleic acids or proteins.
对全基因组序列的生物信息学分析凸显了识别目前尚未表征的许多基因产物的生化和细胞功能这一问题。来自结核分枝杆菌的开放阅读框Rv3853已被注释为menG,并被假定编码一种依赖S-腺苷甲硫氨酸(SAM)的甲基转移酶,该酶催化甲萘醌生物合成的最后一步。在一个结构基因组学项目中,Rv3853基因产物已被表达、复性、纯化并结晶。其晶体结构已通过同晶置换法确定,并在1.9 Å分辨率下进行了精修,R因子为19.0%,R(自由)为22.0%。该结构强烈表明该蛋白质不是依赖SAM的甲基转移酶,并且该基因在包含同源物的这个和其他基因组中被错误注释。该蛋白质形成紧密结合的盘状三聚体。单体折叠与任何已知的依赖SAM的甲基转移酶不同,最类似于几种磷酸转移系统的磷酸组氨酸结构域。结合辅因子和底物分子的尝试未成功,但每个单体上存在两个偶然结合的小分子配体,模拟为酒石酸盐和乙醛酸盐。这些可能指向生物学上相关的结合位点,但未表明其功能。计算机模拟筛选表明一系列配体可以占据这些位点和其他位点。这些配体的性质,加上三聚体上结合位点的位置,表明分布在微生物和植物物种中的Rv3853家族蛋白质可能是与核酸或蛋白质结合的更大组装体的一部分。