Bellora Nicolás, Farré Domènec, Albà M Mar
Research Unit on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain.
BMC Genomics. 2007 Dec 13;8:459. doi: 10.1186/1471-2164-8-459.
The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes.
We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters.
The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.
基因启动子中调控基序的排列,即启动子结构,是数百万年来突变和选择过程的结果。在哺乳动物中,组织特异性转录调控与基因启动子中特定的蛋白质相互作用DNA基序的存在有关。然而,对于这些基序的相对位置和间距知之甚少。为了填补这一空白,我们在大量管家基因和组织特异性基因中,对在特定启动子位置表现出显著偏向性的基序进行了系统搜索。
我们观察到,驱动管家基因表达的启动子富含具有强烈位置偏向性的特定基序,如YY1,而这些基序在驱动组织特异性表达的启动子中相关性较小。我们还鉴定出大量在高度组织特异性表达的基因中表现出位置偏向性的基序。它们包括众所周知的组织特异性基序,如肝脏、肾脏和小肠中的HNF1和HNF4基序,或睾丸中的RFX基序,以及许多潜在的新型调控基序。基于此分析,我们对小鼠基因启动子中的559个组织特异性基序进行了预测。
该研究表明,基序位置偏向性是哺乳动物近端启动子的一个重要特征,并且它影响一般基序和组织特异性基序。基序位置限制根据表达广度和组织类型定义了非常不同的启动子结构。