Department of Cellular Biology, University of Georgia, Athens, GA 30602, USA.
Insect Biochem Mol Biol. 2010 Mar;40(3):189-204. doi: 10.1016/j.ibmb.2010.02.001. Epub 2010 Feb 18.
The availability of whole genome sequences of several arthropods has provided new insights into structural cuticular proteins (CPs), in particular the distribution of different families, the recognition that these proteins may comprise almost 2% of the protein coding genes of some species, and the identification of features that should aid in the annotation of new genomes and EST libraries as they become available. Twelve CP families are described: CPR (named after the Rebers and Riddiford Consensus); CPF (named because it has a highly conserved region consisting of about forty-four amino acids); CPFL (like the CPFs in a conserved C-terminal region); the TWDL family, named after a picturesque phenotype of one mutant member; four families in addition to TWDL with a preponderance of low complexity sequence that are not member of the families listed above. These were named after particular diagnostic features as CPLCA, CPLCG, CPLCW, CPLCP. There are also CPG, a lepidopteran family with an abundance of glycines, the apidermin family, named after three proteins in Apis mellifera, and CPAP1 and CPAP3, named because they have features analogous to peritrophins, namely one or three chitin-binding domains. Also described are common motifs and features. Four unusual CPs are discussed in detail. Data that facilitated the analysis of sequence variation of single CP genes in natural populations are analyzed.
几种节肢动物的全基因组序列的可用性为结构表皮蛋白 (CPs) 提供了新的见解,特别是不同家族的分布、这些蛋白质可能构成某些物种编码基因的近 2%的认识,以及识别有助于新基因组和 EST 文库注释的特征,因为它们变得可用。描述了 12 个 CP 家族:CPR(以 Rebers 和 Riddiford 共识命名);CPF(因其高度保守的区域由大约 44 个氨基酸组成而得名);CPFL(与 CPFs 在保守的 C 末端区域相同);以一个突变成员的生动表型命名的 TWDL 家族;除了 TWDL 之外,还有四个家族具有丰富的低复杂度序列,不属于上述家族。这些家族被命名为具有特定诊断特征的 CPLCA、CPLCG、CPLCW 和 CPLCP。还有 CPG,一个富含甘氨酸的鳞翅目家族,以及 apidermin 家族,以蜜蜂中的三种蛋白质命名,以及 CPAP1 和 CPAP3,因为它们具有类似于围食膜蛋白的特征,即一个或三个几丁质结合域。还描述了常见的基序和特征。详细讨论了四个不寻常的 CP。分析了有助于分析自然种群中单 CP 基因序列变异的数据。