Batzer M A, Kilroy G E, Richard P E, Shaikh T H, Desselle T D, Hoppens C L, Deininger P L
Department of Biochemistry and Molecular Biology, Louisiana State University Medical Center, New Orleans 70112.
Nucleic Acids Res. 1990 Dec 11;18(23):6793-8. doi: 10.1093/nar/18.23.6793.
The HS subfamily of Alu sequences is comprised of a group of nearly identical members. Individual subfamily members share 97.7% nucleotide identity with each other and 98.9% nucleotide identity with the HS consensus sequence. Individual subfamily members are on the average 2.8 million years old, and were probably derived from a single source 'master' gene sometime after the human/great ape divergence. The recent Alu family member insertions provide a better image of the structure of Alu retroposons before they have had the opportunity to change significantly. All of the HS subfamily members are flanked by perfect direct repeats as a result of insertion at staggered nicks. The 'master' gene from which the HS subfamily members were derived had an oligo-dA rich tail at least 40 bases long. The 'master' gene is very rich in CpG dinucleotides, but nucleotide substitutions within subfamily members accumulated in a random manner typical for Alu sequence with CpG substitutions occurring 9.2 fold faster than non-CpG substitutions.
Alu序列的HS亚家族由一组几乎相同的成员组成。各个亚家族成员之间的核苷酸同一性为97.7%,与HS共有序列的核苷酸同一性为98.9%。各个亚家族成员平均有280万年的历史,可能在人类/大猩猩分化后的某个时候源自单一的“主”基因。最近的Alu家族成员插入为Alu逆转座子在有机会发生显著变化之前的结构提供了更好的图像。由于在交错切口处插入,所有HS亚家族成员都侧翼有完美的正向重复序列。HS亚家族成员所源自的“主”基因有一条至少40个碱基长的富含寡聚dA的尾巴。“主”基因富含CpG二核苷酸,但亚家族成员内的核苷酸替换以Alu序列典型的随机方式积累,其中CpG替换的发生速度比非CpG替换快9.2倍。