Chabasse Christine, Bailly Xavier, Sanchez Sophie, Rousselot Morgane, Zal Franck
Equipe Ecophysiologie, Adaptation et Evolution Moléculaires, UPMC-CNRS UMR 7144, Station Biologique, BP 74, 29682, Roscoff cedex, France.
J Mol Evol. 2006 Sep;63(3):365-74. doi: 10.1007/s00239-005-0198-9. Epub 2006 Jul 12.
Giant extracellular hexagonal bilayer hemoglobin (HBL-Hb), found only in annelids, is an approximately 3500-kDa heteropolymeric structure involved in oxygen transport. The HBL-Hbs are comprised of globin and linker chains, the latter being required for the assembly of the quaternary structure. The linker chains, varying in size from 225 to 283 amino acids, have a conserved cysteine-rich domain within their N-terminal moiety that is homologous to the cysteine-rich modules constituting the ligand binding domain of the low-density lipoprotein receptor (LDLR) protein family found in many metazoans. We have investigated the gene structure of linkers from Arenicola marina, Alvinella pompejana, Nereis diversicolor, Lumbricus terrestris, and Riftia pachyptila. We found, contrary to the results obtained earlier with linker genes from N. diversicolor and L. terrestris, that in all of the foregoing cases, the linker LDL-A module is flanked by two phase 1 introns, as in the human LDLR gene, with two more introns in the 3' side whose positions varied with the species. In addition, we obtained 13 linker cDNAs that have been determined experimentally or found in the EST database LumbriBASE. A molecular phylogenetic analysis of the linker primary sequences demonstrated that they cluster into two distinct families of linker proteins. We propose that the common gene ancestor to annelid linker genes exhibited a four-intron and five-exon structure and gave rise to the two families subsequent to a duplication event.
巨大的细胞外六边形双层血红蛋白(HBL-Hb)仅存在于环节动物中,是一种参与氧气运输的约3500 kDa的杂聚结构。HBL-Hb由球蛋白和连接链组成,后者是四级结构组装所必需的。连接链的大小在225至283个氨基酸之间变化,在其N端部分有一个保守的富含半胱氨酸的结构域,该结构域与构成许多后生动物中发现的低密度脂蛋白受体(LDLR)蛋白家族配体结合结构域的富含半胱氨酸模块同源。我们研究了海蚯蚓、庞贝蠕虫、多毛裸腹蚓、陆正蚓和巨型管虫的连接链基因结构。我们发现,与之前从多毛裸腹蚓和陆正蚓的连接链基因获得的结果相反,在上述所有情况下,连接链LDL-A模块两侧都有两个1期内含子,就像人类LDLR基因一样,在3'端还有另外两个内含子,其位置因物种而异。此外,我们获得了13个连接链cDNA,这些cDNA已通过实验确定或在EST数据库LumbriBASE中找到。对连接链一级序列的分子系统发育分析表明,它们聚集成两个不同的连接蛋白家族。我们提出,环节动物连接链基因的共同基因祖先具有四个内含子和五个外显子结构,并在一次复制事件后产生了这两个家族。