Ostrin Edwin J, Li Yumei, Hoffman Kristi, Liu Jing, Wang Keqing, Zhang Li, Mardon Graeme, Chen Rui
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA.
Genome Res. 2006 Apr;16(4):466-76. doi: 10.1101/gr.4673006. Epub 2006 Mar 13.
The discovery of direct downstream targets of transcription factors (TFs) is necessary for understanding the genetic mechanisms underlying complex, highly regulated processes such as development. In this report, we have used a combinatorial strategy to conduct a genome-wide search for novel direct targets of Eyeless (Ey), a key transcription factor controlling early eye development in Drosophila. To overcome the lack of high-quality consensus binding site sequences, phylogenetic shadowing of known Ey binding sites in sine oculis (so) was used to construct a position weight matrix (PWM) of the Ey protein. This PWM was then used for in silico prediction of potential binding sites in the Drosophila melanogaster genome. To reduce the false positive rate, conservation of these potential binding sites was assessed by comparing the genomic sequences from seven Drosophila species. In parallel, microarray analysis of wild-type versus ectopic ey-expressing tissue, followed by microarray-based epistasis experiments in an atonal (ato) mutant background, identified 188 genes induced by ey. Intersection of in silico predicted conserved Ey binding sites with the candidate gene list produced through expression profiling yields a list of 20 putative ey-induced, eye-enriched, ato-independent, direct targets of Ey. The accuracy of this list of genes was confirmed using both in vitro and in vivo methods. Initial analysis reveals three genes, eyes absent, shifted, and Optix, as novel direct targets of Ey. These results suggest that the integrated strategy of computational biology, genomics, and genetics is a powerful approach to identify direct downstream targets for any transcription factor genome-wide.
发现转录因子(TFs)的直接下游靶点对于理解诸如发育等复杂、高度调控过程背后的遗传机制至关重要。在本报告中,我们采用了一种组合策略,在全基因组范围内搜索无眼(Ey)的新型直接靶点,Ey是控制果蝇早期眼睛发育的关键转录因子。为了克服高质量共有结合位点序列的缺乏,利用无眼(so)中已知Ey结合位点的系统发育影子来构建Ey蛋白的位置权重矩阵(PWM)。然后将该PWM用于在黑腹果蝇基因组中对潜在结合位点进行计算机预测。为了降低假阳性率,通过比较七种果蝇物种的基因组序列来评估这些潜在结合位点的保守性。同时,对野生型与异位表达ey的组织进行微阵列分析,随后在无调性(ato)突变背景下进行基于微阵列的上位性实验,鉴定出188个由ey诱导的基因。通过表达谱分析产生的候选基因列表与计算机预测的保守Ey结合位点的交集,得到了20个假定的由ey诱导、眼睛富集、ato非依赖性的Ey直接靶点列表。使用体外和体内方法均证实了该基因列表的准确性。初步分析揭示了三个基因,无眼、移位和Optix,作为Ey的新型直接靶点。这些结果表明,计算生物学、基因组学和遗传学的综合策略是在全基因组范围内识别任何转录因子直接下游靶点的有力方法。