Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA.
Proc Natl Acad Sci U S A. 2010 Feb 23;107(8):3930-5. doi: 10.1073/pnas.1000147107. Epub 2010 Feb 8.
Gene expression is controlled by interactions between trans-regulatory factors and cis-regulatory DNA sequences, and these interactions constitute the essential functional linkages of gene regulatory networks (GRNs). Validation of GRN models requires experimental cis-regulatory tests of predicted linkages to authenticate their identities and proposed functions. However, cis-regulatory analysis is, at present, at a severe bottleneck in genomic system biology because of the demanding experimental methodologies currently in use for discovering cis-regulatory modules (CRMs), in the genome, and for measuring their activities. Here we demonstrate a high-throughput approach to both discovery and quantitative characterization of CRMs. The unique aspect is use of DNA sequence tags to "barcode" CRM expression constructs, which can then be mixed, injected together into sea urchin eggs, and subsequently deconvolved. This method has increased the rate of cis-regulatory analysis by >100-fold compared with conventional one-by-one reporter assays. The utility of the DNA-tag reporters was demonstrated by the rapid discovery of 81 active CRMs from 37 previously unexplored sea urchin genes. We then obtained simultaneous high-resolution temporal characterization of the regulatory activities of more than 80 CRMs. On average 2-3 CRMs were discovered per gene. Comparison of endogenous gene expression profiles with those of the CRMs recovered from each gene showed that, for most cases, at least one CRM is active in each phase of endogenous expression, suggesting that CRM recovery was comprehensive. This approach will qualitatively alter the practice of GRN construction as well as validation, and will impact many additional areas of regulatory system biology.
基因表达受转录调节因子和顺式调控 DNA 序列之间相互作用的控制,这些相互作用构成了基因调控网络(GRN)的基本功能联系。GRN 模型的验证需要对预测的联系进行实验性顺式调控测试,以验证它们的身份和提出的功能。然而,由于目前用于发现基因组中顺式调控模块(CRMs)及其活性的实验方法学要求很高,因此基因组系统生物学中的顺式调控分析目前处于严重的瓶颈期。在这里,我们展示了一种高通量的方法来发现和定量表征 CRMs。独特之处在于使用 DNA 序列标签来“标记”CRM 表达构建体,然后可以将它们混合在一起,一起注入海胆卵中,然后进行反卷积。与传统的逐个报告基因检测相比,这种方法将顺式调控分析的速度提高了 100 多倍。DNA 标记报告的实用性通过从 37 个以前未探索的海胆基因中快速发现 81 个活跃的 CRM 得到了证明。然后,我们获得了 80 多个 CRM 的同时进行的高分辨率时间特征描述。平均每个基因发现 2-3 个 CRM。与从每个基因中恢复的 CRM 的内源基因表达谱进行比较表明,在大多数情况下,至少有一个 CRM 在内源表达的每个阶段都是活跃的,这表明 CRM 的恢复是全面的。这种方法将从根本上改变 GRN 构建和验证的实践,并将影响调控系统生物学的许多其他领域。