Xie Hai-Bing, Irwin David M, Zhang Ya-Ping
State Key Laboratory of Genetic Resource and Evolution, Kunming Institute of Zoology, Kunming 650223, PR China.
BMC Genomics. 2008 Nov 2;9:520. doi: 10.1186/1471-2164-9-520.
Many conserved secondary structures have been identified within conserved elements in the human genome, but only a small fraction of them are known to be functional RNAs. The evolutionary variations of these conserved secondary structures in human populations and their biological functions have not been fully studied.
We searched for polymorphisms within conserved secondary structures and identified a number of SNPs within these elements even though they are highly conserved among species. The density of SNPs in conserved secondary structures is about 65% of that of their flanking, non-conserved, sequences. Classification of sites as stems or as loops/bulges revealed that the density of SNPs in stems is about 62% of that found in loops/bulges. Analysis of derived allele frequency data indicates that sites in stems are under stronger evolutionary constraint than sites in loops/bulges. Intergenic conserved secondary structures tend to associate with transcription factor-encoding genes with genetic distance being the measure of regulator-gene associations. A substantial fraction of intergenic conserved secondary structures overlap characterized binding sites for multiple transcription factors.
Strong purifying selection implies that secondary structures are probably important carriers of biological functions for conserved sequences. The overlap between intergenic conserved secondary structures and transcription factor binding sites further suggests that intergenic conserved secondary structures have essential roles in directing gene expression in transcriptional regulation networks.
在人类基因组的保守元件中已鉴定出许多保守的二级结构,但其中只有一小部分已知是功能性RNA。这些保守二级结构在人类群体中的进化变异及其生物学功能尚未得到充分研究。
我们在保守二级结构中搜索多态性,即使这些元件在物种间高度保守,仍在其中鉴定出了一些单核苷酸多态性(SNP)。保守二级结构中SNP的密度约为其侧翼非保守序列密度的65%。将位点分类为茎或环/凸起表明,茎中SNP的密度约为环/凸起中SNP密度的62%。对衍生等位基因频率数据的分析表明,茎中的位点比环/凸起中的位点受到更强的进化约束。基因间保守二级结构倾向于与转录因子编码基因相关联,遗传距离是调节基因关联的衡量标准。相当一部分基因间保守二级结构与多个转录因子的特征性结合位点重叠。
强烈的纯化选择意味着二级结构可能是保守序列生物学功能的重要载体。基因间保守二级结构与转录因子结合位点之间的重叠进一步表明,基因间保守二级结构在转录调控网络中指导基因表达方面具有重要作用。