Department of Genome Oriented Bioinformatics, Technische Universität München, Wissenschaftszentrum Weihenstephan, D-85354 Freising, Germany.
Proteins. 2010 May 15;78(7):1760-73. doi: 10.1002/prot.22692.
For over 2 decades, continuous efforts to organize the jungle of available protein structures have been underway. Although a number of discrepancies between different classification approaches for soluble proteins have been reported, the classification of membrane proteins has so far not been comparatively studied because of the limited amount of available structural data. Here, we present an analysis of alpha-helical membrane protein classification in the SCOP and CATH databases. In the current set of 63 alpha-helical membrane protein chains having between 1 and 13 transmembrane helices, we observed a number of differently classified proteins both regarding their domain and fold assignment. The majority of all discrepancies affect single transmembrane helix, two helix hairpin, and four helix bundle domains, while domains with more than five helices are mostly classified consistently between SCOP and CATH. It thus appears that the structural constraints imposed by the lipid bilayer complicate the classification of membrane proteins with only few membrane-spanning regions. This problem seems to be specific for membrane proteins as soluble four helix bundles, not restrained by the membrane, are more consistently classified by SCOP and CATH. Our findings indicate that the structural space of small membrane helix bundles is highly continuous such that even minor differences in individual classification procedures may lead to a significantly different classification. Membrane proteins with few helices and limited structural diversity only seem to be reasonably classifiable if the definition of a fold is adapted to include more fine-grained structural features such as helix-helix interactions and reentrant regions.
二十多年来,人们一直在努力组织可用蛋白质结构的丛林。尽管已经报道了可溶性蛋白质的不同分类方法之间存在一些差异,但由于可用结构数据有限,膜蛋白的分类迄今尚未进行比较研究。在这里,我们分析了 SCOP 和 CATH 数据库中α螺旋膜蛋白的分类。在当前的 63 条具有 1 到 13 个跨膜螺旋的α螺旋膜蛋白链中,我们观察到许多在结构域和折叠分配方面分类不同的蛋白质。所有差异的大多数都影响单个跨膜螺旋、双螺旋发夹和四螺旋束结构域,而具有超过五个螺旋的结构域在 SCOP 和 CATH 之间大多分类一致。因此,似乎由脂质双层施加的结构约束使具有少数跨膜区域的膜蛋白的分类复杂化。这个问题似乎是膜蛋白特有的,因为不受膜限制的可溶性四螺旋束,通过 SCOP 和 CATH 更一致地分类。我们的研究结果表明,小膜螺旋束的结构空间是高度连续的,以至于即使在个别分类过程中的细微差异也可能导致显著不同的分类。只有当定义一个折叠时适应包括更精细的结构特征,如螺旋-螺旋相互作用和折返区域时,具有少数螺旋和有限结构多样性的膜蛋白似乎才能合理分类。