Department of Biological Sciences, Idaho State University, Pocatello, 83209, USA.
BMC Evol Biol. 2010 May 10;10:139. doi: 10.1186/1471-2148-10-139.
CC chemokine receptor proteins (CCR1 through CCR10) are seven-transmembrane G-protein coupled receptors whose signaling pathways are known for their important roles coordinating immune system responses through targeted trafficking of white blood cells. In addition, some of these receptors have been identified as fusion proteins for viral pathogens: for example, HIV-1 strains utilize CCR5, CCR2 and CCR3 proteins to obtain cellular entry in humans. The extracellular domains of these receptor proteins are involved in ligand-binding specificity as well as pathogen recognition interactions.In mammals, the majority of chemokine receptor genes are clustered together; in humans, seven of the ten genes are clustered in the 3p21-24 chromosome region. Gene conversion events, or exchange of DNA sequence between genes, have been reported in chemokine receptor paralogs in various mammalian lineages, especially between the cytogenetically closely located pairs CCR2/5 and CCR1/3. Datasets of mammalian orthologs for each gene were analyzed separately to minimize the potential confounding impact of analyzing highly similar sequences resulting from gene conversion events.Molecular evolution approaches and the software package Phylogenetic Analyses by Maximum Likelihood (PAML) were utilized to investigate the signature of selection that has acted on the mammalian CC chemokine receptor (CCR) gene family. The results of neutral vs. adaptive evolution (positive selection) hypothesis testing using Site Models are reported. In general, positive selection is defined by a ratio of nonsynonymous/synonymous nucleotide changes (dN/dS, or omega) >1.
Of the ten mammalian CC motif chemokine receptor sequence datasets analyzed, only CCR2 and CCR3 contain amino acid codon sites that exhibit evidence of positive selection using site based hypothesis testing in PAML. Nineteen of the twenty codon sites putatively indentified as likely to be under positive selection code for amino acid residues located in extracellular domains of the receptor protein products.
These results suggest that amino acid residues present in intracellular and membrane-bound domains are more selectively constrained for functional signal transduction and homo- or heterodimerization, whereas amino acid residues in extracellular domains of these receptor proteins evolve more quickly, perhaps due to heightened selective pressure resulting from ligand-binding and pathogen interactions of extracellular domains.
CC 趋化因子受体蛋白(CCR1 至 CCR10)是七次跨膜 G 蛋白偶联受体,其信号通路以协调免疫系统通过白细胞靶向运输的重要作用而闻名。此外,其中一些受体已被鉴定为病毒病原体的融合蛋白:例如,HIV-1 株利用 CCR5、CCR2 和 CCR3 蛋白在人类中获得细胞进入。这些受体蛋白的细胞外结构域参与配体结合特异性以及病原体识别相互作用。在哺乳动物中,大多数趋化因子受体基因聚集在一起;在人类中,十个基因中的七个基因聚集在 3p21-24 染色体区域。在各种哺乳动物谱系中,已经报道了趋化因子受体旁系同源物之间的基因转换事件,或 DNA 序列在基因之间的交换,尤其是在细胞遗传学上紧密定位的 CCR2/5 和 CCR1/3 对之间。分别分析每个基因的哺乳动物直系同源物数据集,以最大程度地减少由于基因转换事件导致的高度相似序列分析的潜在混杂影响。利用最大似然法(PAML)的分子进化方法和软件包分析了趋化因子受体(CCR)基因家族的选择特征。报告了使用站点模型进行中性与适应性进化(正选择)假设检验的结果。一般来说,正选择定义为非同义/同义核苷酸变化的比值(dN/dS,或ω)>1。
在分析的十个哺乳动物 CC 基序趋化因子受体序列数据集中,只有 CCR2 和 CCR3 包含使用 PAML 中的基于站点的假设检验显示正选择证据的氨基酸密码子位点。在二十个推测的可能受正选择影响的密码子位点中,有 19 个编码位于受体蛋白产物细胞外结构域的氨基酸残基。
这些结果表明,存在于细胞内和膜结合结构域中的氨基酸残基对于功能信号转导和同型或异型二聚体化的选择性约束更大,而这些受体蛋白的细胞外结构域中的氨基酸残基进化得更快,可能是由于细胞外结构域的配体结合和病原体相互作用导致的选择性压力增加所致。