Alvarez-Venegas Raul, Avramova Zoya
Department of Biological Sciences, Lilly Hall of Life Sciences, Purdue University, West Lafayette, IN 47907-1392, USA.
Gene. 2002 Feb 20;285(1-2):25-37. doi: 10.1016/s0378-1119(02)00401-8.
SET-domain (SET: Su(var)3-9, E(z) and Trithorax)-containing proteins were collected through sequence searches of the available databases. After removing redundancies, the proteins belonging to three families, SU(VAR)3-9, E(Z) and Trithorax, were selected. Analysis of the relationship between the different members is based on pairwise alignment, compilation, and comparison of their SET-domains. The level of homology of the SET-domains defined the distribution of the proteins into families and into clades within the families. The architecture of the entire protein supported the distribution pattern built upon SET-domain similarity. Parallel cladistic and protein-architecture analyses outlined two plausible criteria for predicting function.
通过对现有数据库进行序列搜索,收集了含SET结构域(SET:果蝇三体变异蛋白3-9、果蝇三体变异蛋白E(z)和果蝇三胸节蛋白)的蛋白质。去除冗余后,选择了属于三个家族的蛋白质,即SU(VAR)3-9、E(Z)和三胸节蛋白。基于不同成员SET结构域的两两比对、汇编和比较,分析它们之间的关系。SET结构域的同源性水平决定了蛋白质在家族内和家族内分支的分布。整个蛋白质的结构支持基于SET结构域相似性建立的分布模式。平行的分支分析和蛋白质结构分析概述了两个预测功能的合理标准。