Alonso M Eva, Pernaute Bárbara, Crespo Miguel, Gómez-Skarmeta José Luis, Manzanares Miguel
Instituto de Investigaciones Biomédicas CSIC-UAM, Madrid, Spain.
Int J Dev Biol. 2009;53(8-10):1367-78. doi: 10.1387/ijdb.072428ma.
The sequencing of the whole genome of multiple species provides us with the instruction book of how to build an organism and make it work, plus a detailed history of how diversity was generated during evolution. Unfortunately, we still understand only a small fraction, which is locating where genes are and deciphering the proteins they code for. The next step is to understand how the correct amount of gene products are produced in space and time to obtain a fully functioning organism, from the egg to the adult. This is what is known as the regulatory genome, a term coined by Eric H. Davidson. In this review, we examine what we know about gene regulation from a genomic point of view, revise the current in silico, in vitro and in vivo methodological approaches to study transcriptional regulation, and point to the power of phylogenetic footprinting as a guide to regulatory element discovery. The advantages and limitations of each approach are considered, with the emerging view that only large-scale studies and data-crunching will give us insight into the language of genomic regulatory systems, and allow the discovery of regulatory codes in the genome.
多个物种全基因组的测序为我们提供了关于如何构建一个生物体并使其运转的说明书,以及一部关于进化过程中多样性是如何产生的详细历史。不幸的是,我们仍然只了解其中的一小部分,即确定基因的位置并解读它们编码的蛋白质。下一步是了解如何在空间和时间上产生正确数量的基因产物,以获得一个从卵子到成年个体都能正常运转的生物体。这就是所谓的调控基因组,这一术语由埃里克·H·戴维森提出。在这篇综述中,我们从基因组的角度审视我们对基因调控的了解,修订当前用于研究转录调控的计算机模拟、体外和体内方法,并指出系统发育足迹法作为发现调控元件的指南的作用。我们考虑了每种方法的优缺点,新出现的观点认为,只有大规模研究和数据处理才能让我们深入了解基因组调控系统的语言,并发现基因组中的调控密码。