Sappington T W, Raikhel A S
Department of Entomology, Michigan State University, East Lansing 48824, USA.
J Mol Evol. 1998 Apr;46(4):476-87. doi: 10.1007/pl00006328.
Insect vitellogenin and yolk protein receptors (VgR/YPR) are newly discovered members of the low-density lipoprotein receptor (LDLR) family, which is characterized by a highly conserved arrangement of repetitive modular elements homologous to functionally unrelated proteins. The insect VgR/YPRs are unique in having two clusters of complement-type cysteine-rich (class A) repeats or modules, with five modules in the first cluster and seven in the second cluster, unlike classical LDLRs which have a single seven-module cluster, vertebrate VgRs and very low density lipoprotein receptors (VLDLR) which have a single eight-module cluster, and LDLR-related proteins (LRPs) and megalins which have four clusters of 2-7, 8, 10, and 11 modules. Alignment of clusters across subfamilies by conventional alignment programs is problematic because of the repetitive nature of the component modules which may have undergone rearrangements, duplications, and deletions during evolution. To circumvent this problem, we "fingerprinted" each class A module in the different clusters by identifying those amino acids that are both relatively conserved and relatively unique within the cluster. Intercluster reciprocal comparisons of fingerprints and aligned sequences allowed us to distinguish four cohorts of modules reflecting shared recent ancestry. All but two of the 57 modules examined could be assigned to one of these four cohorts designated A, B, C, and D. Alignment of clusters based on modular cohorts revealed that all clusters are derived from a single primordial cluster of at least seven modules with a consensus arrangement of CDCADBC. All extant clusters examined are consistent with this consensus, though none matches it perfectly. This analysis also revealed that the eight-module clusters in vertebrate VgRs, insect VgR/YPRs, and LRP/megalins are not directly homologous with one another. Assignment of modules to cohorts permitted us to properly align 32 class A clusters from all four LDLR subfamilies for phylogenetic analysis. The results revealed that smaller one-cluster and two-cluster members of the family did not originate from the breakup of a large two-cluster or four-cluster receptor. Similarly, the LRP/megalins did not arise from the duplication of a two-cluster insect VgR/YPR-like progenitor. Rather, it appears that the multicluster receptors were independently constructed from the same single-cluster ancestor.
昆虫卵黄原蛋白和卵黄蛋白受体(VgR/YPR)是低密度脂蛋白受体(LDLR)家族新发现的成员,该家族的特征是重复模块元件高度保守排列,这些元件与功能不相关的蛋白质同源。昆虫VgR/YPR的独特之处在于有两簇补体类型富含半胱氨酸(A类)重复序列或模块,第一簇有五个模块,第二簇有七个模块,这与经典的LDLR不同,后者有一个单一的七个模块簇,脊椎动物VgR和极低密度脂蛋白受体(VLDLR)有一个单一的八个模块簇,以及LDLR相关蛋白(LRP)和巨膜蛋白有四簇分别为2 - 7、8、10和11个模块。由于组成模块具有重复性,在进化过程中可能经历了重排、复制和缺失,使用传统比对程序对亚家族间的簇进行比对存在问题。为了解决这个问题,我们通过识别在簇内相对保守且相对独特的氨基酸,对不同簇中的每个A类模块进行了“指纹识别”。指纹和比对序列的簇间相互比较使我们能够区分反映共同近期祖先的四类模块。在检查的57个模块中,除了两个之外,其余所有模块都可以归为这四类之一,分别命名为A、B、C和D。基于模块类群的簇比对显示,所有簇都源自至少七个模块的单一原始簇,其共有排列为CDCADBC。所有检查的现存簇都与这个共有排列一致,尽管没有一个完全匹配。该分析还表明脊椎动物VgR、昆虫VgR/YPR和LRP/巨膜蛋白中的八个模块簇彼此并非直接同源。将模块分配到类群使我们能够正确比对来自所有四个LDLR亚家族的32个A类簇,用于系统发育分析。结果表明,该家族中较小的单簇和双簇成员并非源自大型双簇或四簇受体的分裂。同样,LRP/巨膜蛋白也不是由双簇昆虫VgR/YPR样祖先进化而来。相反,似乎多簇受体是由同一个单簇祖先独立构建而成。