Nelson Craig E, Hersh Bradley M, Carroll Sean B
Howard Hughes Medical Institute, University of Wisconsin-Madison, 1525 Linden Drive, Madison, WI 53703, USA.
Genome Biol. 2004;5(4):R25. doi: 10.1186/gb-2004-5-4-r25. Epub 2004 Mar 15.
Factors affecting the organization and spacing of functionally unrelated genes in metazoan genomes are not well understood. Because of the vast size of a typical metazoan genome compared to known regulatory and protein-coding regions, functional DNA is generally considered to have a negligible impact on gene spacing and genome organization. In particular, it has been impossible to estimate the global impact, if any, of regulatory elements on genome architecture.
To investigate this, we examined the relationship between regulatory complexity and gene spacing in Caenorhabditis elegans and Drosophila melanogaster. We found that gene density directly reflects local regulatory complexity, such that the amount of noncoding DNA between a gene and its nearest neighbors correlates positively with that gene's regulatory complexity. Genes with complex functions are flanked by significantly more noncoding DNA than genes with simple or housekeeping functions. Genes of low regulatory complexity are associated with approximately the same amount of noncoding DNA in D. melanogaster and C. elegans, while loci of high regulatory complexity are significantly larger in the more complex animal. Complex genes in C. elegans have larger 5' than 3' noncoding intervals, whereas those in D. melanogaster have roughly equivalent 5' and 3' noncoding intervals.
Intergenic distance, and hence genome architecture, is highly nonrandom. Rather, it is shaped by regulatory information contained in noncoding DNA. Our findings suggest that in compact genomes, the species-specific loss of nonfunctional DNA reveals a landscape of regulatory information by leaving a profile of functional DNA in its wake.
后生动物基因组中影响功能不相关基因的组织和间距的因素尚未得到充分理解。由于典型的后生动物基因组与已知的调控区域和蛋白质编码区域相比规模巨大,功能性DNA通常被认为对基因间距和基因组组织的影响可以忽略不计。特别是,一直无法估计调控元件对基因组结构的全局影响(如果有的话)。
为了研究这一问题,我们研究了秀丽隐杆线虫和黑腹果蝇中调控复杂性与基因间距之间的关系。我们发现基因密度直接反映了局部调控复杂性,因此一个基因与其最邻近基因之间的非编码DNA量与其调控复杂性呈正相关。具有复杂功能的基因两侧的非编码DNA比具有简单功能或看家功能的基因多得多。调控复杂性低的基因在黑腹果蝇和秀丽隐杆线虫中与大致相同数量的非编码DNA相关联,而调控复杂性高的基因座在更复杂的动物中明显更大。秀丽隐杆线虫中的复杂基因5'端的非编码间隔比3'端大,而黑腹果蝇中的复杂基因5'端和3'端的非编码间隔大致相等。
基因间距离以及基因组结构是高度非随机的。相反,它是由非编码DNA中包含的调控信息塑造的。我们的研究结果表明,在紧凑的基因组中,非功能性DNA的物种特异性缺失通过留下功能性DNA的轮廓揭示了调控信息的格局。