Gherman Adrian, Wang Ruihua, Avramopoulos Dimitrios
McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, School of Medicine, Baltimore, MD 21205, USA.
Hum Genomics. 2009 Jan;3(2):143-56. doi: 10.1186/1479-7364-3-2-143.
The sequencing of the human genome has allowed us to observe globally and in detail the arrangement of genes along the chromosomes. There are multiple lines of evidence that this arrangement is not random, both in terms of intergenic distances and orientation of neighbouring genes. We have undertaken a systematic evaluation of the spatial distribution and orientation of known genes across the human genome. We used genome-level information, including phylogenetic conservation, single nucleotide polymorphism density and correlation of gene expression to assess the importance of this distribution. In addition to confirming and extending known properties of the genome, such as the significance of gene deserts and the importance of 'head to head' orientation of gene pairs in proximity, we provide significant new observations that include a smaller average size for intervals separating the 3' ends of neighbouring genes, a correlation of gene expression across tissues for genes as far as 100 kilobases apart and signatures of increasing positive selection with decreasing interval size surprisingly relaxing for intervals smaller than approximately 500 base pairs. Further, we provide extensive graphical representations of the genome-wide data to allow for observations and comparisons beyond what we address.
人类基因组测序使我们能够从整体上详细观察基因在染色体上的排列方式。有多项证据表明,无论是从基因间距离还是相邻基因的方向来看,这种排列都不是随机的。我们对已知基因在人类基因组中的空间分布和方向进行了系统评估。我们利用了基因组层面的信息,包括系统发育保守性、单核苷酸多态性密度以及基因表达的相关性,来评估这种分布的重要性。除了确认和扩展基因组的已知特性,如基因沙漠的重要性以及相邻基因对“头对头”方向的重要性外,我们还提供了重要的新观察结果,包括相邻基因3'端之间间隔的平均尺寸更小、相隔达100千碱基的基因在不同组织中的基因表达具有相关性,以及随着间隔尺寸减小正选择增加的特征在间隔小于约500个碱基对时出人意料地减弱。此外,我们还提供了全基因组数据的大量图形表示,以便进行超出我们所讨论内容的观察和比较。