Fridmanis Davids, Fredriksson Robert, Kapa Ivo, Schiöth Helgi B, Klovins Janis
Biomedical Research and Study Centre, University of Latvia, Ratsupites 1, Riga, Latvia.
Mol Phylogenet Evol. 2007 Jun;43(3):864-80. doi: 10.1016/j.ympev.2006.11.007. Epub 2006 Nov 17.
Mammalian G protein-coupled receptor (GPCR) genes are characterised by a large proportion of intronless genes or a lower density of introns when compared with GPCRs of invertebrates. It is unclear which mechanisms have influenced intron density in this protein family, which is one of the largest in the mammalian genomes. We used a combination of Hidden Markov Models (HMM) and BLAST searches to establish the comprehensive repertoire of Rhodopsin GPCRs from seven species and performed overall alignments and phylogenetic analysis using the maximum parsimony method for over 1400 receptors in 12 subgroups. We identified 14 different Ancestral Receptor Groups (ARGs) that have members in both vertebrate and invertebrate species. We found that there exists a remarkable difference in the intron density among ancestral and new Rhodopsin GPCRs. The intron density among ARGs members was more than 3.5-fold higher than that within non-ARG members and more than 2-fold higher when considering only the 7TM region. This suggests that the new GPCR genes have been predominantly formed intronless while the ancestral receptors likely accumulated introns during their evolution. Many of the intron positions found in mammalian ARG receptor sequences were found to be present in orthologue invertebrate receptors suggesting that these intron positions are ancient. This analysis also revealed that one intron position is much more frequent than any other position and it is common for a number of phylogenetically different Rhodopsin GPCR groups. This intron position lies within a functionally important, conserved, DRY motif which may form a proto-splice site that could contribute to positional intron insertion. Moreover, we have found that other receptor motifs, similar to DRY, also contain introns between the second and third nucleotide of the arginine codon which also forms a proto-splice site. Our analysis presents compelling evidence that there was not a major loss of introns in mammalian GPCRs and formation of new GPCRs among mammals explains why these have fewer introns compared to invertebrate GPCRs. We also discuss and speculate about the possible role of different RNA- and DNA-based mechanisms of intron insertion and loss.
与无脊椎动物的G蛋白偶联受体(GPCR)相比,哺乳动物的GPCR基因具有大量无内含子基因或较低的内含子密度。目前尚不清楚是哪些机制影响了这个蛋白质家族(它是哺乳动物基因组中最大的家族之一)的内含子密度。我们结合隐马尔可夫模型(HMM)和BLAST搜索,确定了七个物种中视紫红质GPCR的完整目录,并使用最大简约法对12个亚组中的1400多个受体进行了整体比对和系统发育分析。我们确定了14个不同的祖先受体组(ARG),它们在脊椎动物和无脊椎动物物种中都有成员。我们发现,祖先视紫红质GPCR和新的视紫红质GPCR之间的内含子密度存在显著差异。ARG成员中的内含子密度比非ARG成员中的内含子密度高3.5倍以上,仅考虑7TM区域时则高2倍以上。这表明新的GPCR基因主要是无内含子形成的,而祖先受体在进化过程中可能积累了内含子。在哺乳动物ARG受体序列中发现的许多内含子位置,在无脊椎动物的直系同源受体中也存在,这表明这些内含子位置是古老的。该分析还揭示,一个内含子位置比其他任何位置出现得都更频繁,并且在许多系统发育不同的视紫红质GPCR组中都很常见。这个内含子位置位于一个功能重要、保守的DRY基序内,该基序可能形成一个原剪接位点,有助于位置性内含子插入。此外,我们发现其他类似于DRY的受体基序,在精氨酸密码子的第二个和第三个核苷酸之间也含有内含子,这也形成了一个原剪接位点。我们的分析提供了令人信服的证据,表明哺乳动物GPCR中没有大量内含子丢失,哺乳动物中新GPCR的形成解释了为什么与无脊椎动物GPCR相比,它们的内含子较少。我们还讨论并推测了基于RNA和DNA的不同内含子插入和丢失机制的可能作用。