Jolma Arttu, Taipale Jussi
Department of Biosciences and Nutrition, SE-171 77, Stockholm, Sweden,
Subcell Biochem. 2011;52:155-73. doi: 10.1007/978-90-481-9069-0_7.
Transcription of genes during development and in response to environmental stimuli is determined by genomic DNA sequence. The DNA sequences regulating transcription are read by sequence-specific transcription factors (TFs) that recognize relatively short sequences, generally between four and twenty base pairs in length. Transcriptional regulation generally requires binding of multiple TFs in close proximity to each other. Mechanistic understanding of transcription in an organism thus requires detailed knowledge of binding affinities of all its TFs to all possible DNA sequences, and the co-operative interactions between the TFs. However, very little is known about such co-operative binding interactions, and even the simple TF-DNA binding information exists only for a very small proportion of all TFs - for example, mammals have approximately 1,300-2,000 TFs [1, 2], yet the largest public databases for TF binding specificity, Jaspar and Uniprobe [3, 4] currently list only approximately 500 moderate to high resolution profiles for human or mouse. This lack of knowledge is in part due to the fact that analysis of TF DNA binding has been laborious and expensive. In this chapter, we review methods that can be used to determine binding specificity of TFs to DNA, mainly focusing on recently developed assays that allow high-resolution analysis of TF binding specificity in relatively high throughput.
基因在发育过程中以及对环境刺激的反应中的转录由基因组DNA序列决定。调节转录的DNA序列由序列特异性转录因子(TFs)读取,这些转录因子识别相对较短的序列,长度一般在4到20个碱基对之间。转录调控通常需要多个TFs彼此紧密结合。因此,要从机制上理解一个生物体中的转录过程,就需要详细了解其所有TFs与所有可能的DNA序列的结合亲和力,以及TFs之间的协同相互作用。然而,对于这种协同结合相互作用知之甚少,甚至对于所有TFs中只有很小一部分存在简单的TF-DNA结合信息——例如,哺乳动物大约有1300-2000个TFs[1,2],然而,目前最大的TF结合特异性公共数据库Jaspar和Uniprobe[3,4]仅列出了大约500个人类或小鼠的中等到高分辨率图谱。这种知识的匮乏部分是由于TF DNA结合分析一直既费力又昂贵。在本章中,我们将综述可用于确定TFs与DNA结合特异性的方法,主要关注最近开发的能够在相对高通量下对TF结合特异性进行高分辨率分析的检测方法。