IFM Bioinformatics and SeRC (Swedish e-Science Research Centre), Linköping University, Linköping, Sweden.
Chem Biol Interact. 2013 Feb 25;202(1-3):111-5. doi: 10.1016/j.cbi.2012.11.009. Epub 2012 Nov 29.
The short-chain dehydrogenases/reductases (SDRs) constitute one of the largest protein superfamilies known today. The members are distantly related with typically 20-30% residue identity in pair-wise comparisons. Still, all hitherto structurally known SDRs present a common three-dimensional structure consisting of a Rossmann fold with a parallel beta sheet flanked by three helices on each side. Using hidden Markov models (HMMs), we have developed a semi-automated subclassification system for this huge family. Currently, 75% of all SDR forms have been assigned to one of the 464 families totalling 122,940 proteins. There are 47 human SDR families, corresponding to 75 genes. Most human SDR families (35 families) have only one gene, while 12 have between 2 and 8 genes. For more than half of the human SDR families, the three-dimensional fold is known. The number of SDR members increases considerably every year, but the number of SDR families now starts to converge. The classification method has paved the ground for a sustainable and expandable nomenclature system. Information on the SDR superfamily is continuously updated at http://sdr-enzymes.org/.
短链脱氢酶/还原酶(SDRs)构成了当今已知的最大蛋白质超家族之一。成员之间的亲缘关系较远,两两比较时通常具有 20-30%的残基同一性。尽管如此,所有迄今已知的 SDR 结构都呈现出一种共同的三维结构,由一个具有平行β片层的 Rossmann 折叠组成,两侧各有三个螺旋。我们使用隐马尔可夫模型(HMMs)为这个庞大的家族开发了一种半自动的细分分类系统。目前,75%的 SDR 形式已被分配到总共 122940 个蛋白质的 464 个家族之一。有 47 个人类 SDR 家族,对应 75 个基因。大多数人类 SDR 家族(35 个家族)只有一个基因,而 12 个家族有 2 到 8 个基因。对于超过一半的人类 SDR 家族,三维折叠结构是已知的。每年 SDR 成员的数量都在大幅增加,但 SDR 家族的数量现在开始趋于收敛。分类方法为可持续和可扩展的命名系统奠定了基础。有关 SDR 超家族的信息在 http://sdr-enzymes.org/ 上不断更新。