Rowold D J, Herrera R J
Department of Biological Sciences, Florida International University, Miami 33199, USA.
Genetica. 2000;108(1):57-72. doi: 10.1023/a:1004099605261.
Alu insertional elements, the most abundant class of SINEs in humans are dimeric sequences approximately 300 bp in length derived from the 7SL RNA gene. These sequences contain a bipartite RNA pol III promoter, a central poly A tract, a 3' poly A tail, numerous CpG islands and are bracketed by short direct repeats. An estimated 500,000 to 1 x 10(6) units are dispersed throughout the human haploid genome primarily in AT rich neighborhoods located within larger GC dense chromosomal regions via a mechanism known as retroposition. Retroposition activity of Alu elements is determined by both internal and flanking regulatory elements as well as distant genes affecting transcription or transcript stability. Alu elements impact the organization and expression of the human genome at many levels including the processes of recombination, transcription and translation. Twelve subfamilies of Alu are defined by distinct patterns of diagnostic base substitutions. Subfamilies may be classified as young, intermediate or old reflecting the time since the start of retroposition by their members. Some insertions of the youngest subfamilies are not yet fixed in the human species and represent polymorphic loci. Alus are excellent molecular markers for a variety of reasons. They aid in tracing the complex pattern of duplication and rearrangements that occurred during the evolution of primate genome. Unlike other mutations, Alu sequences are rarely lost completely once retroposed, have a defined ancestral state and are free from homoplasy since independent and identical insertions are highly unlikely. Because of these characteristics, Alus are literally molecular fossils. Polymorphic Alu loci are especially useful in studies of human genetic diversity and in pedigree and forensic analysis.
Alu插入元件是人类中最丰富的短散在核元件(SINE)类型,是长度约为300 bp的二聚体序列,源自7SL RNA基因。这些序列包含一个二分RNA聚合酶III启动子、一个中央多聚腺苷酸序列、一个3'多聚腺苷酸尾巴、众多CpG岛,并由短的直接重复序列包围。估计有500,000至1×10⁶个单位通过一种称为逆转座的机制分散在人类单倍体基因组中,主要分布在位于较大GC密集染色体区域内富含AT的区域。Alu元件的逆转座活性由内部和侧翼调控元件以及影响转录或转录稳定性的远距离基因决定。Alu元件在许多层面影响人类基因组的组织和表达,包括重组、转录和翻译过程。Alu的12个亚家族由独特的诊断碱基替换模式定义。亚家族可分为年轻、中间或古老亚家族,反映其成员开始逆转座后的时间。最年轻亚家族的一些插入尚未在人类物种中固定,代表多态性位点。Alu由于多种原因是优秀的分子标记。它们有助于追踪灵长类基因组进化过程中发生的复杂重复和重排模式。与其他突变不同,Alu序列一旦逆转座很少会完全丢失,具有明确的祖先状态,并且由于独立且相同的插入极不可能发生,所以不存在同塑性。由于这些特征,Alu实际上就是分子化石。多态性Alu位点在人类遗传多样性研究以及家系和法医分析中特别有用。