Suppr超能文献

脂联素基因家族的外显子-内含子结构与进化

Exon-intron structure and evolution of the Lipocalin gene family.

作者信息

Sánchez Diego, Ganfornina María D, Gutiérrez Gabriel, Marín Antonio

机构信息

Departamento de Bioquímica y Fisiología y Genética Molecular-IBGM, Universidad de Valladolid-CSIC, Valladolid, Spain.

出版信息

Mol Biol Evol. 2003 May;20(5):775-83. doi: 10.1093/molbev/msg079. Epub 2003 Apr 2.

Abstract

The Lipocalins are an ancient protein family whose expression is currently confirmed in bacteria, protoctists, plants, arthropods, and chordates. The evolution of this protein family has been assessed previously using amino acid sequence phylogenies. In this report we use an independent set of characters derived from the gene structure (exon-intron arrangement) to infer a new lipocalin phylogeny. We also present the novel gene structure of three insect lipocalins. The position and phase of introns are well preserved among lipocalin clades when mapped onto a protein sequence alignment, suggesting the homologous nature of these introns. Because of this homology, we use the intron position and phase of 23 lipocalin genes to reconstruct a phylogeny by maximum parsimony and distance methods. These phylogenies are very similar to the phylogenies derived from protein sequence. This result is confirmed by congruence analysis, and a consensus tree shows the commonalities between the two source trees. Interestingly, the intron arrangement phylogeny shows that metazoan lipocalins have more introns than other eukaryotic lipocalins, and that intron gains have occurred in the C-termini of chordate lipocalins. We also analyze the relationship of intron arrangement and protein tertiary structure, as well as the relationship of lipocalins with members of the proposed structural superfamily of calycins. Our congruence analysis validates the gene structure data as a source of phylogenetic information and helps to further refine our hypothesis on the evolutionary history of lipocalins.

摘要

脂质运载蛋白是一个古老的蛋白质家族,目前已证实在细菌、原生生物、植物、节肢动物和脊索动物中均有表达。此前曾利用氨基酸序列系统发育分析评估过这个蛋白质家族的进化情况。在本报告中,我们使用了一组源自基因结构(外显子 - 内含子排列)的独立特征来推断一种新的脂质运载蛋白系统发育情况。我们还展示了三种昆虫脂质运载蛋白的新基因结构。当将内含子的位置和相位映射到蛋白质序列比对上时,脂质运载蛋白各分支之间的内含子位置和相位保存得很好,这表明这些内含子具有同源性。由于这种同源性,我们利用23个脂质运载蛋白基因的内含子位置和相位,通过最大简约法和距离法重建系统发育情况。这些系统发育情况与从蛋白质序列推导出来的系统发育情况非常相似。这一结果通过一致性分析得到了证实,一棵共识树显示了两棵源树之间的共性。有趣的是,内含子排列系统发育情况表明,后生动物的脂质运载蛋白比其他真核生物的脂质运载蛋白具有更多的内含子,而且在脊索动物脂质运载蛋白的C末端发生了内含子增加的情况。我们还分析了内含子排列与蛋白质三级结构之间的关系,以及脂质运载蛋白与拟议的萼蛋白结构超家族成员之间的关系。我们的一致性分析验证了基因结构数据作为系统发育信息来源的可靠性,并有助于进一步完善我们关于脂质运载蛋白进化历史的假设。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验