Falk Brain Tumor Center, Cancer Biology and Epigenomics Program, Children's Memorial Research Center, Chicago, IL 60614-3394, USA.
BMC Genomics. 2011 Dec 20;12:617. doi: 10.1186/1471-2164-12-617.
De novo retrotransposition of Alu elements has been recognized as a major driver for insertion polymorphisms in human populations. In this study, we exploited Alu-anchored bisulfite PCR libraries to identify evolutionarily recent Alu element insertions, and to investigate their genetic and epigenetic variation.
A total of 327 putatively recent Alu insertions were identified, altogether represented by 1,762 sequence reads. Nearly all such de novo retrotransposition events (316/327) were novel. Forty-seven out of forty-nine randomly selected events, corresponding to nineteen genomic loci, were sequence-verified. Alu element insertions remained hemizygous in one or more individuals in sixteen of the nineteen genomic loci. The Alu elements were found to be enriched for young Alu families with characteristic sequence features, such as the presence of a longer poly(A) tail. In addition, we documented the occurrence of a duplication of the AT-rich target site in their immediate flanking sequences, a hallmark of retrotransposition. Furthermore, we found the sequence motif (TT/AAAA) that is recognized by the ORF2P protein encoded by LINE-1 in their 5'-flanking regions, consistent with the fact that Alu retrotransposition is facilitated by LINE-1 elements. While most of these Alu elements were heavily methylated, we identified an Alu localized 1.5 kb downstream of TOMM5 that exhibited a completely unmethylated left arm. Interestingly, we observed differential methylation of its immediate 5' and 3' flanking CpG dinucleotides, in concordance with the unmethylated and methylated statuses of its internal 5' and 3' sequences, respectively. Importantly, TOMM5's CpG island and the 3 Alu repeats and 1 MIR element localized upstream of this newly inserted Alu were also found to be unmethylated. Methylation analyses of two additional genomic loci revealed no methylation differences in CpG dinucleotides flanking the Alu insertion sites in the two homologous chromosomes, irrespective of the presence or absence of the insertion.
We anticipate that the combination of methodologies utilized in this study, which included repeat-anchored bisulfite PCR sequencing and the computational analysis pipeline herein reported, will prove invaluable for the generation of genetic and epigenetic variation maps.
从头反转录转座的 Alu 元件已被认为是人类群体中插入多态性的主要驱动因素。在这项研究中,我们利用 Alu 锚定的亚硫酸氢盐 PCR 文库来识别进化上较新的 Alu 元件插入,并研究它们的遗传和表观遗传变异。
总共鉴定出 327 个假定的新 Alu 插入,总共由 1762 个序列读取表示。几乎所有这些新的反转录转座事件(316/327)都是新的。在 19 个基因组座中的 49 个随机选择的事件中,有 47 个得到了序列验证。在 19 个基因组座中的 16 个中,Alu 元件在一个或多个个体中保持半合子状态。Alu 元件富含具有特征序列特征的年轻 Alu 家族,例如较长的 poly(A)尾巴。此外,我们记录了它们紧邻侧翼序列中 AT 丰富靶位点的重复,这是反转录转座的标志。此外,我们在其 5'侧翼区域发现了 LINE-1 编码的 ORF2P 蛋白识别的序列模体(TT/AAAA),这与 Alu 反转录转座由 LINE-1 元件促进的事实一致。虽然这些 Alu 元件大多数都被高度甲基化,但我们在 TOMM5 下游 1.5kb 处鉴定出一个完全未甲基化的左臂的 Alu。有趣的是,我们观察到其紧邻的 5'和 3'侧翼 CpG 二核苷酸的差异甲基化,与内部 5'和 3'序列的未甲基化和甲基化状态一致。重要的是,TOMM5 的 CpG 岛和位于此新插入 Alu 上游的 3 个 Alu 重复和 1 个 MIR 元件也被发现未甲基化。对另外两个基因组座的甲基化分析显示,在两个同源染色体上,无论是否存在插入,Alu 插入位点侧翼的 CpG 二核苷酸没有甲基化差异。
我们预计,本研究中使用的组合方法学,包括重复锚定的亚硫酸氢盐 PCR 测序和本文报告的计算分析管道,将为遗传和表观遗传变异图谱的生成提供宝贵的资源。