von Salomé Jenny, Kukkonen Jyrki P
University of Helsinki, Department of Basic Veterinary Sciences, Helsinki, Finland.
BMC Genomics. 2008 May 19;9:228. doi: 10.1186/1471-2164-9-228.
HLA/MHC class II molecules show high degree of polymorphism in the human population. The individual polymorphic motifs have been suggested to be propagated and mixed by transfer of genetic material (recombination, gene conversion) between alleles, but no clear molecular basis for this has been identified as yet. A large number of MHC class II allele sequences is publicly available and could be used to analyze the sequence features behind the recombination, revealing possible basis for such recombination processes both in HLA class II genes and other genes, which recombination acts upon.
In this study we analyzed the vast dataset of human allelic variants (49 full coding sequences, 374 full exon 2 sequences) of the most polymorphic MHC class II locus, HLA-DRB1, and identified many previously unknown sequence features possibly contributing to the recombination. The CpG-dinucleotide content of exon 2 (containing the antigen-binding sites and subsequently a high degree of polymorphism) was much elevated as compared to the other exons despite similar overall G+C content. Furthermore, the CpG pattern was highly conserved. We also identified more complex, highly conserved sequence motifs in exon 2. Some of these can be identified as putative recombination motifs previously found in other genes, but most are previously unidentified.
The identified sequence features could putatively act in recombination allowing either less (CpG dinucleotides) or more specific DNA cleavage (complex sequences) or homologous recombination (complex sequences).
HLA/MHC II类分子在人类群体中表现出高度多态性。有研究表明,个体多态性基序是通过等位基因之间的遗传物质转移(重组、基因转换)进行传播和混合的,但目前尚未确定其明确的分子基础。大量的MHC II类等位基因序列已公开可用,可用于分析重组背后的序列特征,揭示HLA II类基因及其他受重组作用的基因中此类重组过程的可能基础。
在本研究中,我们分析了最具多态性的MHC II类基因座HLA-DRB1的大量人类等位基因变体数据集(49个完整编码序列,374个完整外显子2序列),并确定了许多可能有助于重组的先前未知的序列特征。尽管总体G+C含量相似,但外显子2(包含抗原结合位点且随后具有高度多态性)的CpG二核苷酸含量与其他外显子相比大幅升高。此外,CpG模式高度保守。我们还在外显子2中鉴定出更复杂、高度保守的序列基序。其中一些可被鉴定为先前在其他基因中发现的假定重组基序,但大多数是先前未鉴定的。
所确定的序列特征可能在重组中起作用,允许较少(CpG二核苷酸)或更特异性的DNA切割(复杂序列)或同源重组(复杂序列)。