Nobrega Marcelo A, Pennacchio Len A
Genome Sciences Department, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA.
J Physiol. 2004 Jan 1;554(Pt 1):31-9. doi: 10.1113/jphysiol.2003.050948.
The recent completion of the human genome sequence has enabled the identification of a large fraction of our gene catalogue and their physical chromosomal position. However, current efforts lag at defining the cis-regulatory sequences that control the spatial and temporal patterns of each gene's expression. This task remains difficult due to our lack of knowledge of the vocabulary controlling gene regulation and the vast genomic search space, with greater than 95% of our genome being noncoding. Recent comparative genomic-based strategies are beginning to aid in the identification of functional sequences based on their high levels of evolutionary conservation. This has proven successful for comparisons between closely related species such as human-primate or human-mouse, but also holds true for distant evolutionary comparisons, such as human-fish or human-bird. In this review we provide support for the utility of cross-species sequence comparisons by illustrating several applications of this strategy, including the identification of new genes and functional non-coding sequences. We also discuss emerging concepts as this field matures, such as how to properly select which species for comparison, which may differ significantly between independent studies.
人类基因组序列的近期完成,使得我们能够鉴定出很大一部分基因目录及其在染色体上的物理位置。然而,目前在确定控制每个基因表达的时空模式的顺式调控序列方面仍进展缓慢。由于我们对控制基因调控的词汇缺乏了解,以及基因组搜索空间巨大(超过95%的基因组是非编码的),这项任务仍然困难重重。最近基于比较基因组学的策略开始有助于基于功能序列的高度进化保守性来识别它们。这已被证明在人类与灵长类动物或人类与小鼠等亲缘关系较近的物种之间的比较中是成功的,但在人类与鱼类或人类与鸟类等远缘进化比较中也同样适用。在本综述中,我们通过举例说明该策略的几种应用,包括新基因和功能性非编码序列的鉴定,来支持跨物种序列比较的实用性。我们还讨论了随着该领域的成熟而出现的新观念,比如如何恰当地选择用于比较的物种,这在不同的独立研究中可能会有很大差异。