Dillon Shane C, Bateman Alex
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK.
BMC Bioinformatics. 2004 Aug 12;5:109. doi: 10.1186/1471-2105-5-109.
The Hotdog fold was initially identified in the structure of Escherichia coli FabA and subsequently in 4-hydroxybenzoyl-CoA thioesterase from Pseudomonas sp. strain CBS. Since that time structural determinations have shown a number of other apparently unrelated proteins also share the Hotdog fold.
Using sequence analysis we unify a large superfamily of HotDog domains. Membership includes numerous prokaryotic, archaeal and eukaryotic proteins involved in several related, but distinct, catalytic activities, from metabolic roles such as thioester hydrolysis in fatty acid metabolism, to degradation of phenylacetic acid and the environmental pollutant 4-chlorobenzoate. The superfamily also includes FapR, a non-catalytic bacterial homologue that is involved in transcriptional regulation of fatty acid biosynthesis. We have defined 17 subfamilies, with some characterisation. Operon analysis has revealed numerous HotDog domain-containing proteins to be fusion proteins, where two genes, once separate but adjacent open-reading frames, have been fused into one open-reading frame to give a protein with two functional domains. Finally we have generated a Hidden Markov Model library from our analysis, which can be used as a tool for predicting the occurrence of HotDog domains in any protein sequence.
The HotDog domain is both an ancient and ubiquitous motif, with members found in the three branches of life.
热狗折叠最初是在大肠杆菌FabA的结构中被鉴定出来的,随后在假单胞菌属菌株CBS的4-羟基苯甲酰辅酶A硫酯酶中也被发现。从那时起,结构测定表明许多其他明显不相关的蛋白质也具有热狗折叠。
通过序列分析,我们统一了一个大型的热狗结构域超家族。其成员包括众多原核生物、古细菌和真核生物蛋白质,这些蛋白质参与了几种相关但不同的催化活性,从脂肪酸代谢中的硫酯水解等代谢作用,到苯乙酸和环境污染物4-氯苯甲酸的降解。该超家族还包括FapR,一种非催化性细菌同源物,它参与脂肪酸生物合成的转录调控。我们定义了17个亚家族,并进行了一些特征描述。操纵子分析表明,许多含有热狗结构域的蛋白质是融合蛋白,即两个曾经分开但相邻的开放阅读框的基因融合成了一个开放阅读框,从而产生了一种具有两个功能结构域的蛋白质。最后,我们通过分析生成了一个隐马尔可夫模型库,可作为预测任何蛋白质序列中热狗结构域出现情况的工具。
热狗结构域是一个既古老又普遍存在的基序,在生命的三个分支中都有发现。