Bacolla Albino, Collins Jack R, Gold Bert, Chuzhanova Nadia, Yi Ming, Stephens Robert M, Stefanov Stefan, Olsh Adam, Jakupciak John P, Dean Michael, Lempicki Richard A, Cooper David N, Wells Robert D
Institute of Biosciences and Technology, Center for Genome Research, Texas A&M University System Health Science Center, Texas Medical Center, 2121 West Holcombe Blvd, Houston, TX 77030, USA.
Nucleic Acids Res. 2006 May 19;34(9):2663-75. doi: 10.1093/nar/gkl354. Print 2006.
Homo(purinepyrimidine) sequences (RY tracts) with mirror repeat symmetries form stable triplexes that block replication and transcription and promote genetic rearrangements. A systematic search was conducted to map the location of the longest RY tracts in the human genome in order to assess their potential function(s). The 814 RY tracts with > or =250 uninterrupted base pairs were preferentially clustered in the pseudoautosomal region of the sex chromosomes and located in the introns of 228 annotated genes whose protein products were associated with functions at the cell membrane. These genes were highly expressed in the brain and particularly in genes associated with susceptibility to mental disorders, such as schizophrenia. The set of 1957 genes harboring the 2886 RY tracts with > or =100 uninterrupted base pairs was additionally enriched in proteins associated with phosphorylation, signal transduction, development and morphogenesis. Comparisons of the > or =250 bp RY tracts in the mouse and chimpanzee genomes indicated that these sequences have mutated faster than the surrounding regions and are longer in humans than in chimpanzees. These results support a role for long R*Y tracts in promoting recombination and genome diversity during evolution through destabilization of chromosomal DNA, thereby inducing repair and mutation.
具有镜像重复对称性的同源(嘌呤嘧啶)序列(RY序列)形成稳定的三链体,可阻断复制和转录并促进基因重排。我们进行了一项系统搜索,以绘制人类基因组中最长RY序列的位置,从而评估它们的潜在功能。814个长度大于或等于250个不间断碱基对的RY序列优先聚集在性染色体的假常染色体区域,并位于228个注释基因的内含子中,这些基因的蛋白质产物与细胞膜功能相关。这些基因在大脑中高度表达,尤其是在与精神疾病易感性相关的基因中,如精神分裂症。包含2886个长度大于或等于100个不间断碱基对的RY序列的1957个基因集还富含与磷酸化、信号转导、发育和形态发生相关的蛋白质。对小鼠和黑猩猩基因组中长度大于或等于250bp的RY序列的比较表明,这些序列的突变速度比周围区域更快,并且在人类中比在黑猩猩中更长。这些结果支持长R*Y序列在进化过程中通过使染色体DNA不稳定来促进重组和基因组多样性,从而诱导修复和突变的作用。